%A Yu Chuanming,Zhong Yunci,Lin Aochen,An Lu %T Author Name Disambiguation with Network Embedding %0 Journal Article %D 2020 %J Data Analysis and Knowledge Discovery %R 10.11925/infotech.2096-3467.2019.0644 %P 48-59 %V 4 %N 2/3 %U {https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/abstract/article_4793.shtml} %8 2020-02-25 %X

[Objective] The paper tries to eliminate the ambiguity of author names in the document system, aiming to solve the problem of incorrect document aggregation.[Methods] First, we constructed three types of networks for authors, documents and author-documents, with structured document data. Then we combined different network embedding methods to obtain the representation of document nodes. Finally, we employed the unsupervised learning model and the hierarchical agglomerative clustering to process the documents.[Results] We conducted empirical studies on datasets from ArnetMiner, CiteSeerX and DBLP. Our method performed well on sparse networks and the macro-F1 value increased by 6%.[Limitations] We only explored author name disambiguation in English.[Conclusions] The proposed method could effectively reduce the ambiguity of author names. It is of great significance for scientific collaboration and citation recommendation, as well as knowledge network related research.