Please wait a minute...
New Technology of Library and Information Service  2014, Vol. 30 Issue (3): 65-72    DOI: 10.11925/infotech.1003-3513.2014.03.10
Current Issue | Archive | Adv Search |
Research on the Framework of a User Intent-oriented Intelligent Search Engine
Zheng Wei1, Liang Zhanping1,2, Liang Jian3
1 Department of Information Management, Peking University, Beijing 100871, China;
2 Institute of Scientific & Technical Information of China, Beijing 100038, China;
3 Information Center of Ministry of Science and Technology, Beijing 100038, China
Download: PDF(670 KB)   HTML  
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] This paper proposes a framework of the intent-oriented intelligent search engine system, and studies the key content ranking algorithm in detail. [Methods] This paper reinvents the search engine algorithms based on the user search intent in three aspects, i.e., content storage, content retrieval and content ranking, and considers multiple factors in the content ranking algorithm, including relevance, reliability, variety and hotness of the content. [Results] Experiments indicate that the relavence of the search results from the intent-based intelligent search algorithm has stably better performance which dominates the traditional keywords-based algorithm. [Limitations] Building intelligent search engine is so complicated that there are still many technical and engineering problems to resolve. Much more experiments need to be conducted to futher verify and improve the content ranking algorithm. [Conclusions] This research lays a foundation of building the next generation intent-oriented intelligent search engine.

Key wordsIntelligent search      User modeling      Retrieval      Ranking     
Received: 29 September 2013      Published: 15 April 2014
:  TP393  

Cite this article:

Zheng Wei, Liang Zhanping, Liang Jian. Research on the Framework of a User Intent-oriented Intelligent Search Engine. New Technology of Library and Information Service, 2014, 30(3): 65-72.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2014.03.10     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2014/V30/I3/65

[1] 李子臣. 搜索技术的现状及发展前景[J]. 情报科学, 2007, 25(7): 1114-1120.(Li Zichen. The Present Situation and the Development Foreground of Seeking Technique [J]. Information Science, 2007, 25(7): 1114-1120.)

[2] Vise D A, Malseed M. The Google Story [M]. New York: Delacorte Press, 2005.

[3] Brin S, Page L. The Anatomy of a Large-scale Hypertextual Web Search Engine [J]. Computer Networks and ISDN Systems, 1998, 30 (1-7): 107-117.

[4] Page L, Brin S, Motwani R, et al. The PageRank Citation Ranking: Bringing Order to the Web [EB/OL]. [2013-08-08]. http://ilpubs.stanford.edu:8090/422.

[5] 张立彬, 杨军花, 杨琴茹. 第三代搜索引擎的研究现状及其发展趋向探析[J]. 情报理论与实践, 2008, 31(5): 785-789.(Zhang Libin, Yang Junhua, Yang Qinru. Probe into the Research Status and Developing Trend of the Third Generation Search Engines [J]. Information Studies: Theory and Application, 2008, 31(5): 785-789.)

[6] 傅欣. 第三代搜索引擎的智能化趋势研究[J]. 现代图书情报技术, 2002(6): 28-30. (Fu Xin. Studies on Intelligent Trends in Third Generation Search Engines [J]. New Technology of Library and Information Service, 2002(6): 28-30.)

[7] 陈林, 杨丹, 赵俊芹. 基于语义理解的智能搜索引擎研究[J]. 计算机科学, 2008, 35(6): 152-154. (Chen Lin, Yang Dan, Zhao Junqin. Research on Intelligent Search Engine Based on Semantic Comprehension [J]. Computer Science, 2008, 35(6): 152-154.)

[8] 杨艺, 周元. 基于用户查询意图识别的Web搜索优化模型[J]. 计算机科学, 2012, 39(1): 264-267. (Yang Yi, Zhou Yuan. Web Retrieval Optimization Model Based on User's Query Intention Identification [J]. Computer Science, 2012, 39(1): 264-267.)

[9] Jansen B J, Booth D L, Spink A. Determining the User Intent of Web Search Engine Queries [C]. In: Proceedings of the 16th International Conference on World Wide Web. New York: ACM, 2007: 1149-1150.

[10] 林国, 李伟超. 个性化搜索引擎中用户兴趣模型研究[J]. 软件导刊, 2012, 11(8): 26-28. (Lin Guo, Li Weichao. Research on User Profile in Personalized Search Engine [J]. Software Guide, 2012, 11(8): 26-28.)

[11] MacKay D. Information Theory, Inference, and Learning Algorithms [M]. UK: Cambridge University Press, 2003: 284-292.

[12] Rice J A. Mathematical Statistics and Data Analysis [M]. The 3rd Edition.Belmont: Thomson Brooks/Cole, 2006.

[13] Goldwater S, Griffiths T L, Johnson M. A Bayesian Framework for Word Segmentation: Exploring the Effects of Context [J]. Cognition, 2009, 112(1): 21-54.

[14] Zhang T, Ramakrishnan R, Livny M. BIRCH: An Efficient Data Clustering Method for Very Large Databases [J]. ACM SIGMOD Record, 1996, 25(2): 103-114.

[15] 陈宝林. 最优化理论与算法 [M].第2版.北京: 清华大学出版社, 2005. (Chen Baolin. Optimization Theory and Algo- rithms [M]. The 2nd Edition.Beijing: Tsinghua University Press, 2005.)

[16] 黄名选, 陈燕红. 关联规则挖掘技术研究 [J]. 情报杂志, 2008,27(4): 119-121,115. (Huang Mingxuan, Chen Yanhong. Studies on Association Rules Mining Techniques[J].Journal of Intelligence, 2008,27(4):119-121,115.)

[17] Wu H, Luk R W P, Wong K F, et al. Interpreting TF-IDF Term Weights as Making Relevance Decisions [J]. ACM Transactions on Information Systems (TOIS), 2008, 26(3): Article No.13.

[18] Tan P, Steinbach M, Kumar V. Introduction to Data Mining [M]. Boston: Pearson Addison-Wesley, 2005.

[19] Herlocker J L, Konstan J A, Terveen L G, et al. Evaluating Collaborative Filtering Recommender Systems[J].ACM Transactions on Information Systems (TOIS), 2004, 22(1): 5-53.

[20] Blei D M, Ng A Y, Jordan M I. Latent Dirichlet Allocation [J]. The Journal of Machine Learning Research, 2003,3: 993-1022.

[21] Wikipedia. Jaccard Index [EB/OL]. [2013-10-08]. http://en. wikipedia.org/wiki/Jaccard_index.

[1] Ming Yi,Tingting Zhang. Ranking Answer Quality of Popular Q&A Community[J]. 数据分析与知识发现, 2019, 3(6): 12-20.
[2] Junliang Yao,Xiaoqiu Le. Semantic Matching for Sci-Tech Novelty Retrieval[J]. 数据分析与知识发现, 2019, 3(6): 50-56.
[3] Haixia Sun,Lei Wang,Yingjie Wu,Weina Hua,Junlian Li. Matching Strategies for Institution Names in Literature Database[J]. 数据分析与知识发现, 2018, 2(8): 88-97.
[4] Lei Li,Daqing He,Chengzhi Zhang. Survey on Social Question and Answer[J]. 数据分析与知识发现, 2018, 2(7): 1-12.
[5] Junwan Liu,Bo Yang,Feifei Wang. Ranking Scholarly Impacts Based on Citations and Academic Similarity[J]. 数据分析与知识发现, 2018, 2(4): 59-70.
[6] Lixin Zhou,Jie Lin. Extracting Product Features with NodeRank Algorithm[J]. 数据分析与知识发现, 2018, 2(4): 90-98.
[7] Jun Hou,Kui Liu,Qianmu Li. Classification Recommendation Based on ESSVM[J]. 数据分析与知识发现, 2018, 2(3): 9-21.
[8] Chaofan Yang,Zhonghua Deng,Xin Peng,Bin Liu. Review of Information Retrieval Research: Case Study of Conference Papers[J]. 数据分析与知识发现, 2017, 1(7): 35-43.
[9] Qiangbing Wang,Chengzhi Zhang. Constructing Users Profiles with Content and Gesture Behaviors[J]. 数据分析与知识发现, 2017, 1(2): 80-86.
[10] Guanghui Ye, Lixin Xia. Review of Expert Retrieval and Expert Ranking Studies[J]. 数据分析与知识发现, 2017, 1(2): 1-10.
[11] Wanying He,Jianlin Yang. Ranking Learning Method Based on Random Walk Model[J]. 数据分析与知识发现, 2017, 1(12): 41-48.
[12] Zhiqiang Wu,Zhongming Zhu,Wei Liu,Wangqiang Zhang,Xiaona Yao. Retrieving 3D Models from Institutional Repository[J]. 数据分析与知识发现, 2017, 1(1): 73-80.
[13] Xiaojuan Zhang, Yi Han. Reviews on Temporal Information Retrieval[J]. 数据分析与知识发现, 2017, 1(1): 3-15.
[14] Mingxuan Huang. Cross Language Information Retrieval Model Based on Matrix-weighted Association Patterns Mining[J]. 数据分析与知识发现, 2017, 1(1): 26-36.
[15] Ding Heng,Lu Wei. Building Standard Literature Knowledge Service System[J]. 现代图书情报技术, 2016, 32(7-8): 120-128.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn