Please wait a minute...
New Technology of Library and Information Service  2012, Vol. 28 Issue (1): 53-57    DOI: 10.11925/infotech.1003-3513.2012.01.09
Current Issue | Archive | Adv Search |
Theme Identification Empirical Study on Technical Documentation in Full-text
Ye Chunlei1,2,3, Leng Fuhai1
1. National Science Library, Chinese Academy of Sciences, Beijing 100190, China;
2. Graduate University of Chinese Academy of Sciences, Beijing 100049, China;
3. Information Department, Beijing City University, Beijing 100094, China
Export: BibTeX | EndNote (RIS)      
Abstract  This paper proposes an improved DT method to identify the theme of the NASA 2011-2020 strategic plan based on multi-word phrases frequency analysis and phrases proximity analysis, adding the term identification of subject. Experiment proves that the improved DT method can identify the theme of documentation in full-text effectively and simply the work of intelligences to some extent.
Key wordsCo-word analysis      Theme identification      DT      Term identification     
Received: 03 November 2011      Published: 26 February 2012



Cite this article:

Ye Chunlei, Leng Fuhai. Theme Identification Empirical Study on Technical Documentation in Full-text. New Technology of Library and Information Service, 2012, 28(1): 53-57.

URL:     OR

[1] 靖继鹏, 马费成, 张向先. 情报科学理论[M]. 北京: 科学出版社, 2009: 231-236.

[2] 冷伏海, 吴霞. 基于文献的知识挖掘:概念、关键技术与应用[M].北京:国防工业出版社, 2008:271-306.

[3] 张晗, 崔雷. 生物信息学的共词分析研究[J]. 情报学报 , 2003, 22(5): 613-617.

[4] Kostoff R N, Miles D L, Eberhart H J. System and Method for Database Tomography: U.S., 5440481[P].1995-08-08.

[5] Kostoff R N,Eberhart H J,Toothman D R.Database Tomography for Information Retrieval[J].Journal of Information Science,1997,23(4):301-311.

[6] Kostoff R N, DeMarco R A. Science and Technology Text Mining: Analytical Chemistry[R]. Arlington: Office of Naval Research, 2003:10,20-22.

[7] Kostoff R N, Eberhart H J,Toothman D R,et al.Database Tomography for Technical Intelligence: Comparative Roadmaps of the Research Impact Assessment Literature and the Journal of the American Chemical Society[J].Scientometrics,1997,40(1):103-138.

[8] Kostoff R N, Shlesinger M F, Tshiteya R. Nonlinear Dynamics Text Mining Using Bibliometrics and Database Tomography[J]. International Journal of Bifurcation and Chaos,2004,14(1):61-92.

[9] Kostoff R N, Eberhart H J, Toothman D R. Hypersonic and Supersonic Flow Roadmaps Using Bibliometrics and Database Tomography[J].Journal of the American Society for Information Science,1999,50(5):427-447.

[10] Kostoff R N, Block J A. Context-dependent Conflation, Text Filtering and Clustering[R]. Arlington: Office of Naval Research, 2004:32-40.

[11] 冯璐, 冷伏海. 共词分析方法理论进展[J]. 中国图书馆学报 , 2006,32(2):88-92.

[12] 赵凡,马胜利.数据库内容结构分析法的理论与实践进展研究[J]. 情报理论与实践 ,2008,31(2): 279-282.

[13] 王立学, 冷伏海. 基于文本结构解析的动态共词方法研究[J]. 图书情报工作 , 2010, 54(24):37-40.

[14] Callon M, Law J, Rip A.Mapping the Dynamics of Science and Technology:Sociology of Science in the Real World[M].London: The Macmillan Press LTD, 1986:103-141.

[15] Callon M, Courtial J P, Laville F. Co-word Analysis as a Tool for Describing the Network of Interactions Between Basic and Technological Research: The Case of Polymer Chemistry[J]. Scientometrics, 1991,22(1):155-205.

[16] TerMine [EB/OL]. [2011-12-03].

[17] TerMine Plugin for Protégé 4[EB/OL]. [2011-12-03].

[18] Frantzi K T, Ananiadou S, Tsujii J. The C-value/NC-value Method of Automatic Recognition for Multi-word Terms[C]. In: Proceedings of the 2nd European Conference on Research and Advanced Technology for Digital Libraries. 1998: 585-604.

[19] National Aeronautics and Space Administration. 2011 NASA Strategic Plan [EB/OL]. [2011-08-05].
[1] Wu Jinming,Hou Yuefang,Cui Lei. Automatic Expression of Co-occurrence Clustering Based on Indexing Rules of Medical Subject Headings[J]. 数据分析与知识发现, 2020, 4(9): 133-144.
[2] Qikai Cheng,Jiamin Wang,Wei Lu. Discovering Domain Vocabularies Based on Citation Co-word Network[J]. 数据分析与知识发现, 2019, 3(6): 57-65.
[3] Xu Lulu,Wang Xiaoyue,Bai Rujiang,Zhou Yanting. Detecting Emerging Trends of Funds Based on DTM Model and Text Analytics: Case Study of NSF Graphene Field[J]. 数据分析与知识发现, 2018, 2(3): 87-97.
[4] Hong Ma, Yongming Cai. A CA-LDA Model for Chinese Topic Analysis: Case Study of Transportation Law Literature[J]. 数据分析与知识发现, 2016, 32(12): 17-26.
[5] Zhao Yuxiang,Peng Xixian. Media as a Community? Literature Based Topic Evaluation in Information Systems Discipline[J]. 现代图书情报技术, 2014, 30(1): 56-65.
[6] Hu Changping, Chen Guo. A New Feature Selection Method Based on Term Contribution in Co-word Analysis[J]. 现代图书情报技术, 2013, 29(7/8): 89-93.
[7] Tang Xiaobo, Xiao Lu. Research of Co-word Analysis Method of Combining Keywords Extension and Domain Ontology[J]. 现代图书情报技术, 2013, 29(11): 60-67.
[8] Zhang Biqiao, Han Shenglong. Audio to Score Alignment Based on Chroma Features and Dynamic Time Warping Algorithm[J]. 现代图书情报技术, 2012, 28(1): 40-45.
[9] Lu Wei, Peng Yu, Chen Wu. Hot Research Topics Detection Based on SOM[J]. 现代图书情报技术, 2011, 27(1): 63-68.
[10] Yang Ying, Cui Lei. Evolution of Topics About Medical Informatics by Improved Co-word Cluster Analysis[J]. 现代图书情报技术, 2011, 27(1): 83-87.
[11] Wang Lixue,Leng Fuhai,Wang Haixia. Research on Technology Readiness Level and Identified Methods[J]. 现代图书情报技术, 2010, 26(3): 58-63.
[12] Chu Jiuliang. The Construction of Campus Network Equipment Monitoring Platform Using Open Source Software[J]. 现代图书情报技术, 2010, 26(2): 85-90.
[13] Li Shuqing,Cheng Guoda,Wang Weimin. The Schema Matching of XML and DTD Based on Weighted XML Data Model[J]. 现代图书情报技术, 2010, 26(1): 57-65.
[14] Chen Shiji. Survey of Approaches to Research Front Detection[J]. 现代图书情报技术, 2009, (9): 28-33.
[15] Wang Jiandong. Domestic Information Services Research Concept Network Analysis Based on Complex Network Method[J]. 现代图书情报技术, 2009, (10): 56-61.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938