Please wait a minute...
New Technology of Library and Information Service  2006, Vol. 1 Issue (2): 5-9    DOI: 10.11925/infotech.1003-3513.2006.02.02
article Current Issue | Archive | Adv Search |
Research on a New Text Automatic Indexing Technology Based on Digital Library
Wang Lancheng1   Wang Lishuang2
1(Department of Information Management, Nanjing Political College PLA, Shanghai 200433, China)
2(Wanfang Data Co., Ltd, Beijing 100044, China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

The semantic environmental with  special stop-words location information control has been studied and founded. This technology has been applied to Chinese metadata CXMARC text automatic indexing and the data mining of theme information. The algorithm of SWF that is used in the pretreatment special Chinese text automatic indexing can reduce the participle different meanings of a field efficiently and shorten indexing time. So tradition maximum matching algorithm has been improved of its quality and efficiency.

Key wordsAutomatic indexing      Digital library      Chinese information processing      MARC     
Received: 13 September 2005      Published: 25 February 2005
: 

G254.36

 
Corresponding Authors: Wang Lancheng     E-mail: wanglancheng@163.com
About author:: Wang Lancheng,Wang Lishuang

Cite this article:

Wang Lancheng,Wang Lishuang. Research on a New Text Automatic Indexing Technology Based on Digital Library. New Technology of Library and Information Service, 2006, 1(2): 5-9.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2006.02.02     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2006/V1/I2/5

1 J.F.Martinez-Trinidad. A Tool To Discover The Main Themes. In A Spanish Or English Document,Expert System With Applications,2000,319-327
2 Wolff J E,et al.. Searching and browsing collections of structural information,In:Proc. of the IEEE Advances in Digital Libraries,2000,141-150
3 W.S.Cooper, A.Chen, F.Gey. Experiments in Probabilistic Retrieval of Full Text Documents, Text Retrieval Conference,Gaithersburg,MD, U.S.A., 1994,127-134
4 SaltonG.. Another look at automatic Text Retrieval  systems,Communications of ACM,1986,29(7):236-250
5 Gaston H Gonnet, Ricardo A. Baeza-yates and Tim Sinder. New indices for Text:PAT trees and PAT arrays. Information Retrieval Data Structures & Algorithms, Prentice Hall, 1992
6 Fan Jang-Jong, Su Keh-Yih. An efficient algorithm for match multiple patterns. IEEE Trans on Knowledge and Data Engineering, 1993, 5(2):339-351
7 王兰成等.  PLS:一种基于信息自动标引的最小推进分词算法及其实现,计算机科学,2002(增刊):24-26
8 田梅.  档案机读目录XML描述及其主题信息自动标引的研究:[学位论文].上海:南京政治学院上海分院信息管理系,2004

[1] Xiong Xin,Wang Hao,Zhang Haichao,Zhang Baolong. Impacts of Chinese Term Granularity on Measuring Term Discriminative Capacity[J]. 数据分析与知识发现, 2020, 4(2/3): 143-152.
[2] Qi Yunfei,Zhao Yuxiang,Zhu Qinghua. Linked Data for Mobile Visual Search System of Digital Library[J]. 数据分析与知识发现, 2017, 1(1): 81-90.
[3] Hong Liang,Qian Chen,Fan Xing. Context-aware Recommendation System for Mobile Digital Libraries[J]. 现代图书情报技术, 2016, 32(7-8): 110-119.
[4] Liu Jian,Bi Qiang,Ma Zhuo. Assessment of Digital Library’s Micro-services: An Empirical Study[J]. 现代图书情报技术, 2016, 32(5): 22-29.
[5] Yufeng Duan,Sisi Huang. Information Extraction from Chinese Plant Species Diversity Description Text[J]. 现代图书情报技术, 2016, 32(1): 87-96.
[6] Chen Guo, Hu Changping. Research on the Structural Features of Keyword Network of Scientific Research Areas:An Empirical Study of LIS[J]. 现代图书情报技术, 2014, 30(7): 84-91.
[7] Xiong Yongjun, Yuan Xiaoyi. Design and Implementation of Automatic Monitoring System about Library Document Database Running State[J]. 现代图书情报技术, 2014, 30(7): 127-132.
[8] Wang Chuanqing, Bi Qiang. System Model of Digital Library Automatic Semantic Annotation Tool[J]. 现代图书情报技术, 2014, 30(6): 17-24.
[9] Wei Meng. Literature Recommendation Using Evolution Patterns[J]. 现代图书情报技术, 2014, 30(4): 20-26.
[10] Hu Changping, Chen Guo. A New Feature Selection Method Based on Term Contribution in Co-word Analysis[J]. 现代图书情报技术, 2013, 29(7/8): 89-93.
[11] Yang He, Yang Yihong, Li Ning. Construction of Keywords-Chinese Library Classification Codes Integrated Thesaurus[J]. 现代图书情报技术, 2013, 29(7/8): 107-113.
[12] Wang Zhongyi, Xia Lixin, Shi Yijin, Zheng Senmao. The Creation and Publishing of Middle Linked Data in Digital Library[J]. 现代图书情报技术, 2013, (5): 28-33.
[13] Liu Wei, Xia Cuijuan, Zhang Chunjing. Big Data and Linked Data: The Emerging Data Technology for the Future of Librarianship[J]. 现代图书情报技术, 2013, (4): 2-9.
[14] Zhou Shanshan, Bi Qiang, Gao Junfeng. A Method of Information Retrieval Results Visualization Based on Social Network Analysis[J]. 现代图书情报技术, 2013, 29(11): 81-85.
[15] Deng Shasha, Zhang Pengzhu, Li Xinmiao. A Method for Network Opinion Modeling Based on Governmental Public Decision Domain[J]. 现代图书情报技术, 2012, (9): 69-74.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn