|
|
Query Expansion of Pseudo Relevance Feedback Based on Feature Terms Extraction and Correlation Fusion |
Feng Ping1, Huang Mingxuan2 |
1. Electronic Information and Control Engineering Department, Guangxi University of Technology, Liuzhou 545006, China;
2. Department of Math and Computer Science, Guangxi College of Education, Nanning 530023, China |
|
|
Abstract Aiming at the term mismatch issues of existing information retrieval systems, a novel query expansion algorithm of pseudo relevance feedback is proposed based on feature terms extraction and correlation fusion. At the same time, a new computing method for weights of expansion terms is also given. The algorithm can extract feature terms related to original query from the n chapter top-ranked retrieved local documents, and then identify those feature terms as final expansion terms according to the frequency of each feature term appeared in the local documents and the correlation between each feature term and the entire original query for query expansion. The results of the experiment show that the method is effective,and it can enhance and improve the performance of information retrieval.
|
Received: 25 November 2010
Published: 12 February 2011
|
|
[1] 黄名选,严小卫,张师超. 查询扩展技术进展与展望
[J]. 计算机应用与软件 ,2007, 24(11): 1-4,8.
[2] Yu S, Cai D, Wen J, et al. Improving Pseudo-Relevance Feedback in Web Information Retrieval Using Web Page Segmentation. In: Proceedings of the 12th World Wide Web Conference (WWW2003), Budapest, Hungary. 2003:11-18.
[3] Huang X, Huang Y R, Wen M, et al. Applying Data Mining to Pseudo-Relevance Feedback for High Performance Text Retrieval. In: Proceedings of the 6th IEEE International Conference on Data Mining (ICDM’06), Hong Kong. 2006: 295-306.
[4] 黄名选,严小卫,张师超.基于矩阵加权关联规则挖掘的伪相关反馈查询扩展
[J]. 软件学报 , 2009,20(7):1854-1865.
[5] Cao G H, Nie J Y, Gao J F, et al. Selecting Good Expansion Terms for Pseudo-Relevance Feedback. In: Proceedings of SIGIR’08 Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2008), Singapore.2008:243-250.
[6] Salton G, Buckley C. Improving Retrieval Performance by Relevance Feedback
[J]. Journal of the American Society for Information Science, 1990, 41(4):288-297.
[7] Xu J, Croft W B. Query Expansion Using Local and Global Document Analysis. In: Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Zurich, Switzerland. 1996: 4-11.
[8] Hang C, Wen J R, Nie J Y, et al. Query Expansion by Mining User Logs
[J]. IEEE Transactions on Knowledge and Data Engineering, 2003, 15(4): 829-839.
[9] Fonseca B M, Golgher P B, Moura E S, et al. Discovering Search Engine Related Query Using Association Rules
[J]. Journal of Web Engineering, 2004, 2(4): 215-227.
[10] Zhang C, Qin Z, Yan X. Association-based Segmentation for Chinese-Crossed Query Expansion
[J]. IEEE Intelligent Informatics Bulletin, 2005, 5(1): 18-25.
[11] Manmatha R, Rath T. Using Models of Score Distributions in Information Retrieval. In: Proceedings of the 24th ACM Conference on Research and Development in Information Retrieval, New York, USA. 2001.
[12] Han J, Kamber M. Data Mining: Concepts and Techniques
[M]. 1st Edition. Morgan: Kaufmann Publishers,2000.
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|