Please wait a minute...
New Technology of Library and Information Service  2014, Vol. 30 Issue (3): 65-72    DOI: 10.11925/infotech.1003-3513.2014.03.10
Current Issue | Archive | Adv Search |
Research on the Framework of a User Intent-oriented Intelligent Search Engine
Zheng Wei1, Liang Zhanping1,2, Liang Jian3
1 Department of Information Management, Peking University, Beijing 100871, China;
2 Institute of Scientific & Technical Information of China, Beijing 100038, China;
3 Information Center of Ministry of Science and Technology, Beijing 100038, China
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] This paper proposes a framework of the intent-oriented intelligent search engine system, and studies the key content ranking algorithm in detail. [Methods] This paper reinvents the search engine algorithms based on the user search intent in three aspects, i.e., content storage, content retrieval and content ranking, and considers multiple factors in the content ranking algorithm, including relevance, reliability, variety and hotness of the content. [Results] Experiments indicate that the relavence of the search results from the intent-based intelligent search algorithm has stably better performance which dominates the traditional keywords-based algorithm. [Limitations] Building intelligent search engine is so complicated that there are still many technical and engineering problems to resolve. Much more experiments need to be conducted to futher verify and improve the content ranking algorithm. [Conclusions] This research lays a foundation of building the next generation intent-oriented intelligent search engine.

Key wordsIntelligent search      User modeling      Retrieval      Ranking     
Received: 29 September 2013      Published: 15 April 2014
:  TP393  

Cite this article:

Zheng Wei, Liang Zhanping, Liang Jian. Research on the Framework of a User Intent-oriented Intelligent Search Engine. New Technology of Library and Information Service, 2014, 30(3): 65-72.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2014.03.10     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2014/V30/I3/65

[1] 李子臣. 搜索技术的现状及发展前景[J]. 情报科学, 2007, 25(7): 1114-1120.(Li Zichen. The Present Situation and the Development Foreground of Seeking Technique [J]. Information Science, 2007, 25(7): 1114-1120.)

[2] Vise D A, Malseed M. The Google Story [M]. New York: Delacorte Press, 2005.

[3] Brin S, Page L. The Anatomy of a Large-scale Hypertextual Web Search Engine [J]. Computer Networks and ISDN Systems, 1998, 30 (1-7): 107-117.

[4] Page L, Brin S, Motwani R, et al. The PageRank Citation Ranking: Bringing Order to the Web [EB/OL]. [2013-08-08]. http://ilpubs.stanford.edu:8090/422.

[5] 张立彬, 杨军花, 杨琴茹. 第三代搜索引擎的研究现状及其发展趋向探析[J]. 情报理论与实践, 2008, 31(5): 785-789.(Zhang Libin, Yang Junhua, Yang Qinru. Probe into the Research Status and Developing Trend of the Third Generation Search Engines [J]. Information Studies: Theory and Application, 2008, 31(5): 785-789.)

[6] 傅欣. 第三代搜索引擎的智能化趋势研究[J]. 现代图书情报技术, 2002(6): 28-30. (Fu Xin. Studies on Intelligent Trends in Third Generation Search Engines [J]. New Technology of Library and Information Service, 2002(6): 28-30.)

[7] 陈林, 杨丹, 赵俊芹. 基于语义理解的智能搜索引擎研究[J]. 计算机科学, 2008, 35(6): 152-154. (Chen Lin, Yang Dan, Zhao Junqin. Research on Intelligent Search Engine Based on Semantic Comprehension [J]. Computer Science, 2008, 35(6): 152-154.)

[8] 杨艺, 周元. 基于用户查询意图识别的Web搜索优化模型[J]. 计算机科学, 2012, 39(1): 264-267. (Yang Yi, Zhou Yuan. Web Retrieval Optimization Model Based on User's Query Intention Identification [J]. Computer Science, 2012, 39(1): 264-267.)

[9] Jansen B J, Booth D L, Spink A. Determining the User Intent of Web Search Engine Queries [C]. In: Proceedings of the 16th International Conference on World Wide Web. New York: ACM, 2007: 1149-1150.

[10] 林国, 李伟超. 个性化搜索引擎中用户兴趣模型研究[J]. 软件导刊, 2012, 11(8): 26-28. (Lin Guo, Li Weichao. Research on User Profile in Personalized Search Engine [J]. Software Guide, 2012, 11(8): 26-28.)

[11] MacKay D. Information Theory, Inference, and Learning Algorithms [M]. UK: Cambridge University Press, 2003: 284-292.

[12] Rice J A. Mathematical Statistics and Data Analysis [M]. The 3rd Edition.Belmont: Thomson Brooks/Cole, 2006.

[13] Goldwater S, Griffiths T L, Johnson M. A Bayesian Framework for Word Segmentation: Exploring the Effects of Context [J]. Cognition, 2009, 112(1): 21-54.

[14] Zhang T, Ramakrishnan R, Livny M. BIRCH: An Efficient Data Clustering Method for Very Large Databases [J]. ACM SIGMOD Record, 1996, 25(2): 103-114.

[15] 陈宝林. 最优化理论与算法 [M].第2版.北京: 清华大学出版社, 2005. (Chen Baolin. Optimization Theory and Algo- rithms [M]. The 2nd Edition.Beijing: Tsinghua University Press, 2005.)

[16] 黄名选, 陈燕红. 关联规则挖掘技术研究 [J]. 情报杂志, 2008,27(4): 119-121,115. (Huang Mingxuan, Chen Yanhong. Studies on Association Rules Mining Techniques[J].Journal of Intelligence, 2008,27(4):119-121,115.)

[17] Wu H, Luk R W P, Wong K F, et al. Interpreting TF-IDF Term Weights as Making Relevance Decisions [J]. ACM Transactions on Information Systems (TOIS), 2008, 26(3): Article No.13.

[18] Tan P, Steinbach M, Kumar V. Introduction to Data Mining [M]. Boston: Pearson Addison-Wesley, 2005.

[19] Herlocker J L, Konstan J A, Terveen L G, et al. Evaluating Collaborative Filtering Recommender Systems[J].ACM Transactions on Information Systems (TOIS), 2004, 22(1): 5-53.

[20] Blei D M, Ng A Y, Jordan M I. Latent Dirichlet Allocation [J]. The Journal of Machine Learning Research, 2003,3: 993-1022.

[21] Wikipedia. Jaccard Index [EB/OL]. [2013-10-08]. http://en. wikipedia.org/wiki/Jaccard_index.

[1] Huang Mingxuan,Jiang Caoqing,Lu Shoudong. Expanding Queries Based on Word Embedding and Expansion Terms[J]. 数据分析与知识发现, 2021, 5(6): 115-125.
[2] Meng Zhen,Wang Hao,Yu Wei,Deng Sanhong,Zhang Baolong. Vocal Music Classification Based on Multi-category Feature Fusion[J]. 数据分析与知识发现, 2021, 5(5): 59-70.
[3] Lu Linong,Zhu Zhongming,Zhang Wangqiang,Wang Xiaochun. Cross-database Knowledge Integration and Fingerprint of Institutional Repositories with Lingo3G Clustering Algorithm[J]. 数据分析与知识发现, 2021, 5(5): 127-132.
[4] Li Yueyan,Wang Hao,Deng Sanhong,Wang Wei. Research Trends of Information Retrieval——Case Study of SIGIR Conference Papers[J]. 数据分析与知识发现, 2021, 5(4): 13-24.
[5] Xu Yicong,Tian Xuedong,Li Xinfu,Yang Fang,Shi Qingxuan. Retrieving Mathematical Expressions Based on Hesitant Fuzzy Weight[J]. 数据分析与知识发现, 2020, 4(7): 118-126.
[6] Deng Siyi,Le Xiaoqiu. Coreference Resolution Based on Dynamic Semantic Attention[J]. 数据分析与知识发现, 2020, 4(5): 46-53.
[7] Zhu Lu,Tian Xiaomeng,Cao Sainan,Liu Yuanyuan. Subspace Cross-modal Retrieval Based on High-Order Semantic Correlation[J]. 数据分析与知识发现, 2020, 4(5): 84-91.
[8] Xiong Xin,Wang Hao,Zhang Haichao,Zhang Baolong. Impacts of Chinese Term Granularity on Measuring Term Discriminative Capacity[J]. 数据分析与知识发现, 2020, 4(2/3): 143-152.
[9] Xinyu Zai,Xuedong Tian. Retrieving Scientific Documents with Formula Description Structure and Word Embedding[J]. 数据分析与知识发现, 2020, 4(1): 131-138.
[10] Mingxuan Huang,Shoudong Lu,Hui Xu. Cross-Language Information Retrieval Based on Weighted Association Patterns and Rule Consequent Expansion[J]. 数据分析与知识发现, 2019, 3(9): 77-87.
[11] Ming Yi,Tingting Zhang. Ranking Answer Quality of Popular Q&A Community[J]. 数据分析与知识发现, 2019, 3(6): 12-20.
[12] Junliang Yao,Xiaoqiu Le. Semantic Matching for Sci-Tech Novelty Retrieval[J]. 数据分析与知识发现, 2019, 3(6): 50-56.
[13] Hui Nie. Modeling Users with Word Vector and Term-Graph Algorithm[J]. 数据分析与知识发现, 2019, 3(12): 30-40.
[14] Ye Guanghui,Yang Jinqing. Route Recommendation Based on Two-way Link Analysis of Urban Name Entities[J]. 数据分析与知识发现, 2019, 3(11): 79-88.
[15] Sun Haixia,Wang Lei,Wu Yingjie,Hua Weina,Li Junlian. Matching Strategies for Institution Names in Literature Database[J]. 数据分析与知识发现, 2018, 2(8): 88-97.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn