|
|
Theme Identification Empirical Study on Technical Documentation in Full-text |
Ye Chunlei1,2,3, Leng Fuhai1 |
1. National Science Library, Chinese Academy of Sciences, Beijing 100190, China;
2. Graduate University of Chinese Academy of Sciences, Beijing 100049, China;
3. Information Department, Beijing City University, Beijing 100094, China |
|
|
Abstract This paper proposes an improved DT method to identify the theme of the NASA 2011-2020 strategic plan based on multi-word phrases frequency analysis and phrases proximity analysis, adding the term identification of subject. Experiment proves that the improved DT method can identify the theme of documentation in full-text effectively and simply the work of intelligences to some extent.
|
Received: 03 November 2011
Published: 26 February 2012
|
|
[1] 靖继鹏, 马费成, 张向先. 情报科学理论[M]. 北京: 科学出版社, 2009: 231-236.[2] 冷伏海, 吴霞. 基于文献的知识挖掘:概念、关键技术与应用[M].北京:国防工业出版社, 2008:271-306.[3] 张晗, 崔雷. 生物信息学的共词分析研究[J]. 情报学报 , 2003, 22(5): 613-617.[4] Kostoff R N, Miles D L, Eberhart H J. System and Method for Database Tomography: U.S., 5440481[P].1995-08-08.[5] Kostoff R N,Eberhart H J,Toothman D R.Database Tomography for Information Retrieval[J].Journal of Information Science,1997,23(4):301-311.[6] Kostoff R N, DeMarco R A. Science and Technology Text Mining: Analytical Chemistry[R]. Arlington: Office of Naval Research, 2003:10,20-22.[7] Kostoff R N, Eberhart H J,Toothman D R,et al.Database Tomography for Technical Intelligence: Comparative Roadmaps of the Research Impact Assessment Literature and the Journal of the American Chemical Society[J].Scientometrics,1997,40(1):103-138.[8] Kostoff R N, Shlesinger M F, Tshiteya R. Nonlinear Dynamics Text Mining Using Bibliometrics and Database Tomography[J]. International Journal of Bifurcation and Chaos,2004,14(1):61-92.[9] Kostoff R N, Eberhart H J, Toothman D R. Hypersonic and Supersonic Flow Roadmaps Using Bibliometrics and Database Tomography[J].Journal of the American Society for Information Science,1999,50(5):427-447.[10] Kostoff R N, Block J A. Context-dependent Conflation, Text Filtering and Clustering[R]. Arlington: Office of Naval Research, 2004:32-40.[11] 冯璐, 冷伏海. 共词分析方法理论进展[J]. 中国图书馆学报 , 2006,32(2):88-92.[12] 赵凡,马胜利.数据库内容结构分析法的理论与实践进展研究[J]. 情报理论与实践 ,2008,31(2): 279-282.[13] 王立学, 冷伏海. 基于文本结构解析的动态共词方法研究[J]. 图书情报工作 , 2010, 54(24):37-40.[14] Callon M, Law J, Rip A.Mapping the Dynamics of Science and Technology:Sociology of Science in the Real World[M].London: The Macmillan Press LTD, 1986:103-141.[15] Callon M, Courtial J P, Laville F. Co-word Analysis as a Tool for Describing the Network of Interactions Between Basic and Technological Research: The Case of Polymer Chemistry[J]. Scientometrics, 1991,22(1):155-205.[16] TerMine [EB/OL]. [2011-12-03]. http://www.nactem.ac.uk/software/termine/.[17] TerMine Plugin for Protégé 4[EB/OL]. [2011-12-03]. http://www.co-ode.org/downloads/protege-x/plugins/termine-docs.pdf.[18] Frantzi K T, Ananiadou S, Tsujii J. The C-value/NC-value Method of Automatic Recognition for Multi-word Terms[C]. In: Proceedings of the 2nd European Conference on Research and Advanced Technology for Digital Libraries. 1998: 585-604.[19] National Aeronautics and Space Administration. 2011 NASA Strategic Plan [EB/OL]. [2011-08-05].http://www.nasa.gov/pdf/516579main_NASA2011StrategicPlan.pdf. |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|