Please wait a minute...
New Technology of Library and Information Service  2006, Vol. 1 Issue (9): 23-28    DOI: 10.11925/infotech.1003-3513.2006.09.06
Current Issue | Archive | Adv Search |
Design and Implementation of Automatic Extraction Relevance Terms System Based on Multi-context
Zhang Chengzhi    Su Lanfang    Su Xinning
 (Department of Information Management, Nanjing University, Nanjing 210093, China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

This paper chooses corpus, definitions dictionaries and users’ query logs as contexts to extract the relevance terms. The experiment results show that the overlap ratio of results in different contexts is very low. So, it is necessary to integrate the different results. All of the relevance terms are integrated to a relevance table through direct integration.

Key wordsRelevance term      Multi-context      Corpus      Definitions dictionary      Query log     
Received: 05 June 2006      Published: 25 September 2006
: 

TP391 

 
     
  G252

 
Corresponding Authors: Zhang Chengzhi     E-mail: zcz51@citiz.net
About author:: Zhang Chengzhi,Su Lanfang,Su Xinning

Cite this article:

Zhang Chengzhi,Su Lanfang,Su Xinning . Design and Implementation of Automatic Extraction Relevance Terms System Based on Multi-context. New Technology of Library and Information Service, 2006, 1(9): 23-28.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2006.09.06     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2006/V1/I9/23

1贺宏朝等.一种基于上下文的中文信息检索查询扩展.中文信息学报,16(6):32-37
2Voorhees,E. M..Query expansion using lexical semantic relations.Proceedings of  the 17th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval,Dublin, Ireland.1994.61-69
3Yufeng Jing,W. B. Croft.An Association Thesaurus for Information Retrieval.Technical Report: UM-CS-1994-017.University of Massachusetts.1994
4Crouch C.A Cluster-based approach to thesaurus construction.Proceedings of the 11th Annual International ACM-SIGIR Conference on Research & Development in Information Retrieval,Grenoble,ACM Press,1988.309-320
5Crouch,C, Yang,B.Experiments in Automatic Statistical Thesaurus Construction. Proceedings of the 15th Annual International ACM-SIGIR Conference on Research & Development in Information Retrieval,Copenhagen, Denmark, ACM Press, 1992.77-88
6Hsinchun Chen,Kevin J ,Lynch.Automatic construction of networks of concepts characterizing document databases.IEEE Transactions on Systems,1992,22(5):885-902
7Peter D. Tumey.Mining the Web for synonyms: PMI-IR versus LSA on TOEFL.Proceedings of the 12th European Conference on Machine Learning.Freiburg, Germany.2001.491-502
8Pierre P. Senellart, Vincent D. Blondel, Automatic discovery of similar words, chapter in: Survey of Text Mining, Springer-Verlag, 2003
9Masaki Murata, Toshiyuki Kanamaru, Hitoshi Isahara.Automatic synonym acquisition based on matching of definition sentences in multiple dictionaries.CICLing 2005,LNCS 3406.2005:293-304
10崔航,文继荣,李敏强.基于用户日志的查询扩展统计模型.软件学报,2003,14(9):1593-1599
11Hsiao-Tieh Pu,Lee-Feng Chien.Integrating Log-Based and Text-Based Methods Towards Automatic Web Thesaurus Construction.Proceedings of the American Society for Information Science and Technology.2004,41(1):463-471
12周全明.全文检索系统后控制技术研究:[学位论文].北京:空间政治学院,1995
13朱毅华,侯汉清,沙印亭.计算机识别汉语同义词的两种算法比较和测评.中国图书馆学报,2002,28(4):82-85
14http://ccl.pku.edu.cn/doubtfire/Course/Chinese%20Information%20Processing/Source_Code/Chapter_8/Lexicon_full_2000.zip(Accessed Apr.20,2006)
15罗盛芬,孙茂松.基于字串内部结合紧密度的汉语自动抽词实验研究.中文信息学报,2003,17(3):9-14
16G.Salon, M.J.Mcgill. Introduction to Modern Information Retrieval. NewYork:McGraw-Hill Book Co.,  1983
17王益,白钦先.当代金融辞典.北京:中国经济出版社,2000
18Baidu.http://www.baidu.com (Accessed Apr.20,2006)
19天网.http://e.pku.edu.cn (Accessed Apr.20,2006)

[1] Liu Wenbin, He Yanqing, Wu Zhenfeng, Dong Cheng. Sentence Alignment Method Based on BERT and Multi-similarity Fusion[J]. 数据分析与知识发现, 2021, 5(7): 48-58.
[2] Liang Jiwen,Jiang Chuan,Wang Dongbo. Chinese-English Sentence Alignment of Ancient Literature Based on Multi-feature Fusion[J]. 数据分析与知识发现, 2020, 4(9): 123-132.
[3] Liu Tong,Ni Weijian,Liu Mei. Identifying Terminology from Search Engine Query Logs[J]. 现代图书情报技术, 2016, 32(2): 25-33.
[4] Tong Guoping, Sun Jianjun. User Behavior Analysis Based on Search Engine Log[J]. 现代图书情报技术, 2015, 31(7-8): 80-88.
[5] Gu Wei, Li Chaofan, Wang Hongjun, Xiao Shibin, Shi Shuicai. Acquisition of Synonym from Patent Query Logs[J]. 现代图书情报技术, 2015, 31(2): 24-30.
[6] Zhang Xiaojuan, Tang Xiangbin. Query Recommendation Based on User Task[J]. 现代图书情报技术, 2014, 30(4): 34-40.
[7] Zeng Zhen, Lv Xueqiang, Li Zhuo. The Automatic Identification of Chinese Names in Query Logs[J]. 现代图书情报技术, 2014, 30(12): 71-77.
[8] Guan Xiaoda,Lv Xueqiang,Li Zhuo,Zheng Luexing,. Chinese Organization Name Recognition in User Query Log[J]. 现代图书情报技术, 2014, 30(1): 72-78.
[9] Tang Yafen. Research of Automatically Recognizing Name in Pre-Qin Ancient Chinese Classics[J]. 现代图书情报技术, 2013, 29(7/8): 63-68.
[10] Xiong Wenxin. Sentence Alignment and Re-Alignment for Environmental Protection Texts in English-Chinese Parallel Corpus[J]. 现代图书情报技术, 2013, (6): 36-41.
[11] Wang Hao, Li Sishu, Deng Sanhong. Study on Text Language Recognition Based on N-Gram[J]. 现代图书情报技术, 2013, (4): 54-61.
[12] Yuan Dong, Xiong Jing, Liu Yongge. Research on Example-based Machine Translation for Oracle Bone Inscriptions[J]. 现代图书情报技术, 2012, 28(5): 48-54.
[13] Kang Xiaoli, Zhang Chengzhi. Chinese-English Comparable Corpus Construction for Bilingual Terminology Extraction[J]. 现代图书情报技术, 2012, 28(2): 28-33.
[14] Xie Jing, Su Xinning, Shen Si. Chinese Phrase Tagging and Automated Annotation Based on CSSCI Corpus[J]. 现代图书情报技术, 2012, (12): 32-38.
[15] Wang Dongbo, Han Pu, Shen Si, Wei Xiangqing. Research of Mining the Category Knowledge Based on English-Chinese Humanities and Social Sciences Parallel Corpus in Phrase Level[J]. 现代图书情报技术, 2012, (11): 40-46.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn