Please wait a minute...
New Technology of Library and Information Service  2012, Vol. 28 Issue (1): 53-57    DOI: 10.11925/infotech.1003-3513.2012.01.09
Current Issue | Archive | Adv Search |
Theme Identification Empirical Study on Technical Documentation in Full-text
Ye Chunlei1,2,3, Leng Fuhai1
1. National Science Library, Chinese Academy of Sciences, Beijing 100190, China;
2. Graduate University of Chinese Academy of Sciences, Beijing 100049, China;
3. Information Department, Beijing City University, Beijing 100094, China
Download: PDF(462 KB)   HTML  
Export: BibTeX | EndNote (RIS)      
Abstract  This paper proposes an improved DT method to identify the theme of the NASA 2011-2020 strategic plan based on multi-word phrases frequency analysis and phrases proximity analysis, adding the term identification of subject. Experiment proves that the improved DT method can identify the theme of documentation in full-text effectively and simply the work of intelligences to some extent.
Key wordsCo-word analysis      Theme identification      DT      Term identification     
Received: 03 November 2011      Published: 26 February 2012
: 

G350

 

Cite this article:

Ye Chunlei, Leng Fuhai. Theme Identification Empirical Study on Technical Documentation in Full-text. New Technology of Library and Information Service, 2012, 28(1): 53-57.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2012.01.09     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2012/V28/I1/53

[1] 靖继鹏, 马费成, 张向先. 情报科学理论[M]. 北京: 科学出版社, 2009: 231-236.

[2] 冷伏海, 吴霞. 基于文献的知识挖掘:概念、关键技术与应用[M].北京:国防工业出版社, 2008:271-306.

[3] 张晗, 崔雷. 生物信息学的共词分析研究[J]. 情报学报 , 2003, 22(5): 613-617.

[4] Kostoff R N, Miles D L, Eberhart H J. System and Method for Database Tomography: U.S., 5440481[P].1995-08-08.

[5] Kostoff R N,Eberhart H J,Toothman D R.Database Tomography for Information Retrieval[J].Journal of Information Science,1997,23(4):301-311.

[6] Kostoff R N, DeMarco R A. Science and Technology Text Mining: Analytical Chemistry[R]. Arlington: Office of Naval Research, 2003:10,20-22.

[7] Kostoff R N, Eberhart H J,Toothman D R,et al.Database Tomography for Technical Intelligence: Comparative Roadmaps of the Research Impact Assessment Literature and the Journal of the American Chemical Society[J].Scientometrics,1997,40(1):103-138.

[8] Kostoff R N, Shlesinger M F, Tshiteya R. Nonlinear Dynamics Text Mining Using Bibliometrics and Database Tomography[J]. International Journal of Bifurcation and Chaos,2004,14(1):61-92.

[9] Kostoff R N, Eberhart H J, Toothman D R. Hypersonic and Supersonic Flow Roadmaps Using Bibliometrics and Database Tomography[J].Journal of the American Society for Information Science,1999,50(5):427-447.

[10] Kostoff R N, Block J A. Context-dependent Conflation, Text Filtering and Clustering[R]. Arlington: Office of Naval Research, 2004:32-40.

[11] 冯璐, 冷伏海. 共词分析方法理论进展[J]. 中国图书馆学报 , 2006,32(2):88-92.

[12] 赵凡,马胜利.数据库内容结构分析法的理论与实践进展研究[J]. 情报理论与实践 ,2008,31(2): 279-282.

[13] 王立学, 冷伏海. 基于文本结构解析的动态共词方法研究[J]. 图书情报工作 , 2010, 54(24):37-40.

[14] Callon M, Law J, Rip A.Mapping the Dynamics of Science and Technology:Sociology of Science in the Real World[M].London: The Macmillan Press LTD, 1986:103-141.

[15] Callon M, Courtial J P, Laville F. Co-word Analysis as a Tool for Describing the Network of Interactions Between Basic and Technological Research: The Case of Polymer Chemistry[J]. Scientometrics, 1991,22(1):155-205.

[16] TerMine [EB/OL]. [2011-12-03]. http://www.nactem.ac.uk/software/termine/.

[17] TerMine Plugin for Protégé 4[EB/OL]. [2011-12-03]. http://www.co-ode.org/downloads/protege-x/plugins/termine-docs.pdf.

[18] Frantzi K T, Ananiadou S, Tsujii J. The C-value/NC-value Method of Automatic Recognition for Multi-word Terms[C]. In: Proceedings of the 2nd European Conference on Research and Advanced Technology for Digital Libraries. 1998: 585-604.

[19] National Aeronautics and Space Administration. 2011 NASA Strategic Plan [EB/OL]. [2011-08-05].http://www.nasa.gov/pdf/516579main_NASA2011StrategicPlan.pdf.
[1] Qikai Cheng,Jiamin Wang,Wei Lu. Discovering Domain Vocabularies Based on Citation Co-word Network[J]. 数据分析与知识发现, 2019, 3(6): 57-65.
[2] Lulu Xu,Xiaoyue Wang,Rujiang Bai,Yanting Zhou. Detecting Emerging Trends of Funds Based on DTM Model and Text Analytics: Case Study of NSF Graphene Field[J]. 数据分析与知识发现, 2018, 2(3): 87-97.
[3] Hong Ma, Yongming Cai. A CA-LDA Model for Chinese Topic Analysis: Case Study of Transportation Law Literature[J]. 数据分析与知识发现, 2016, 32(12): 17-26.
[4] Zhao Yuxiang,Peng Xixian. Media as a Community? Literature Based Topic Evaluation in Information Systems Discipline[J]. 现代图书情报技术, 2014, 30(1): 56-65.
[5] Hu Changping, Chen Guo. A New Feature Selection Method Based on Term Contribution in Co-word Analysis[J]. 现代图书情报技术, 2013, 29(7/8): 89-93.
[6] Tang Xiaobo, Xiao Lu. Research of Co-word Analysis Method of Combining Keywords Extension and Domain Ontology[J]. 现代图书情报技术, 2013, 29(11): 60-67.
[7] Zhang Biqiao, Han Shenglong. Audio to Score Alignment Based on Chroma Features and Dynamic Time Warping Algorithm[J]. 现代图书情报技术, 2012, 28(1): 40-45.
[8] Lu Wei, Peng Yu, Chen Wu. Hot Research Topics Detection Based on SOM[J]. 现代图书情报技术, 2011, 27(1): 63-68.
[9] Yang Ying, Cui Lei. Evolution of Topics About Medical Informatics by Improved Co-word Cluster Analysis[J]. 现代图书情报技术, 2011, 27(1): 83-87.
[10] Wang Lixue,Leng Fuhai,Wang Haixia. Research on Technology Readiness Level and Identified Methods[J]. 现代图书情报技术, 2010, 26(3): 58-63.
[11] Chu Jiuliang. The Construction of Campus Network Equipment Monitoring Platform Using Open Source Software[J]. 现代图书情报技术, 2010, 26(2): 85-90.
[12] Li Shuqing,Cheng Guoda,Wang Weimin. The Schema Matching of XML and DTD Based on Weighted XML Data Model[J]. 现代图书情报技术, 2010, 26(1): 57-65.
[13] Chen Shiji. Survey of Approaches to Research Front Detection[J]. 现代图书情报技术, 2009, (9): 28-33.
[14] Wang Jiandong. Domestic Information Services Research Concept Network Analysis Based on Complex Network Method[J]. 现代图书情报技术, 2009, (10): 56-61.
[15] Chen Tun,Chen Xin . A Study on Perl Programming Language Aided Informetrics[J]. 现代图书情报技术, 2006, 1(7): 41-46.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn