[Objective] This paper identifies basic vocabularies of a specific domain from academic papers, aiming to grasp the knowledge structure and development context. [Methods] We combined the citation network and the co-word analysis to construct a citation co-word network. Then, we used the PageRank algorithm to evaluate the importance of the candidate words. We examined the proposed method with 110,360 articles in computer science. [Results] Our new method was compared with the word frequency method and co-word analysis qualitatively and quantitatively. We found that the proposed method performed well, and the average precision of a blind selection experiment reached 72.6%. [Limitations] The proposed method was only examined with computer science articles. [Conclusions] The new strategies could improve the performance of basic vocabulary discovery in one specific domain.
Courtial J P.Comments on Leydesdorff’s Article[J]. Journal of the American Society for Information Science, 1998, 49(1): 98.
[2]
Su H N, Lee P C.Mapping Knowledge Structure by Keyword Co-occurrence: A First Look at Journal Papers in Technology Foresight[J]. Scientometrics, 2010, 85(1): 65-79.
[3]
Hu J M, Zhang Y.Research Patterns and Trends of Recommendation System in China Using Co-Word Analysis[J]. Information Processing and Management, 2015, 51(4): 329-339.
[4]
Sun Y W, Zhai Y.Mapping the Knowledge Domain and the Theme Evolution of Appropriability Research Between 1986 and 2016: A Scientometric Review[J]. Scientometrics, 2018, 116(1): 203-230.
[5]
Khasseh A A, Soheili F, Moghaddam H S, et al.Intellectual Structure of Knowledge in iMetrics: A Co-Word Analysis[J]. Information Processing & Management, 2017, 53(3): 705-720.
[6]
Ravikumar S, Agrahari A, Singh S N.Mapping the Intellectual Structure of Scientometrics: A Co-Word Analysis of the Journal Scientometrics (2005-2010)[J]. Scientometrics, 2015, 102(1): 929-955.
[7]
Soriano A S, Álvarez C L, Valdés R M T. Bibliometric Analysis to Identify an Emerging Research Area: Public Relations Intelligence — A Challenge to Strengthen Technological Observatories in the Network Society[J]. Scientometrics, 2018, 115(3): 1591-1641.
(Hu Changping, Chen Guo.Characteristics of Keywords in Scientific Papers and Their Impact on Co-word Analysis[J]. Journal of the China Society for Scientific and Technical Information, 2014, 33(1): 23-32.)
(Li Shuqing.Research on Automatic Construction of Domain Ontology in Library and Information Science Based on Weighted Co-occurrence of Citation Keywords[J]. Journal of the China Society for Scientific and Technical Information, 2012, 31(4): 371-380.)
[10]
Yan B N, Lee T S, Lee T P.Mapping the Intellectual Structure of the Internet of Things (IoT) Field (2000-2014): A Co-Word Analysis[J]. Scientometrics, 2015,105(2): 1285-1300.
[11]
Wang Z S, Zhao H, Wang Y.Social Networks in Marketing Research 2001-2014: A Co-Word Analysis[J]. Scientometrics, 2015, 105(1): 65-82.
[12]
Donohue J C.Understanding Scientific Literature: A Bibliographic Approach[M]. Cambridge: The MIT Press, 1973: 101.
[13]
Booth A D.A “Law” of Occurrences for Words of Low Frequency[J]. Information and Control, 1967, 10(4): 386-393.
[14]
Yang Y, Wu M, Cui L.Integration of Three Visualization Methods Based on Co-Word Analysis[J]. Scientometrics, 2011, 90(2): 659-673.
[15]
Yan B N, Lee T S, Lee T P.Analysis of Research Papers on E-Commerce (2000-2013): Based on a Text Mining Approach[J]. Scientometrics, 2015, 105(1): 403-417.
(Li Gang, Ba Zhichao.Co-word Analysis: Limitations and Solutions[J]. Journal of Library Science in China, 2017, 43(4): 93-113.)
[17]
Choi J, Yi S, Lee K C.Analysis of Keyword Networks in MIS Research and Implications for Predicting Knowledge Evolution[J]. Information & Management, 2011, 48(8): 371-381.
[18]
Zhu W, Guan J.A Bibliometric Study of Service Innovation Research: Based on Complex Network Analysis[J]. Scientometrics, 2013, 94(3): 1195-1216.
[19]
Ocholla D N, Onyancha O B, Britz J.Can Information Ethics Be Conceptualized by Using the Core/Periphery Model?[J]. Journal of Informetrics, 2010, 4(4): 492-502.
[20]
Liu J X, Zheng C H, Xu Y.Extracting Plants Core Genes Responding to Abiotic Stresses by Penalized Matrix Decomposition[J]. Computers in Biology & Medicine, 2012, 42(5): 582-589.
[21]
Ding Y, Song M, Han J, et al.Entitymetrics: Measuring the Impact of Entities[J]. PLoS One, 2013, 8(8): e71416.
[22]
Song M, Han N G, Kim Y H, et al.Discovering Implicit Entity Relation with the Gene-Citation-Gene Network[J]. PLoS One, 2013, 8(12): e84639.
(Wu Qingqiang, Zhao Yajuan.Research in the Weighted Co-word Analysis Based on the Attributes of Articles[J]. Journal of the China Society for Scientific and Technical Information, 2008, 27(2): 89-92.)
(Ge Fei, Tan Zongying.Review of Science Structure and Evolution of Bibliometric Methods[J]. Journal of Intelligence, 2012, 31(12): 34-39.)
[25]
Brin S, Page L.The Anatomy of a Large-Scale Hypertextual Web Search Engine[C]// Proceedings of the 7th International Conference on World Wide Web. 1998: 107-117.
[26]
Zhao W Y, Mao J, Lu K.Ranking Themes on Co-Word Networks: Exploring the Relationships Among Different Metrics[J]. Information Processing & Management, 2018, 54(2): 203-218.
(Chen Guo, Xiao Lu, Zhao Xueqin.A Keyword Selection Method Based on the Combination of Popularity and Domain Relevancy of Keywords: A Holistic Perspective[J]. Journal of the China Society for Scientific and Technical Information, 2014, 33(9): 959-968.)