Please wait a minute...
New Technology of Library and Information Service  2007, Vol. 2 Issue (8): 63-66    DOI: 10.11925/infotech.1003-3513.2007.08.15
Current Issue | Archive | Adv Search |
Design and Implementation of Enterprise Search Engine Based on Lucene
Chen Yanchun1   Li Shuangping2
1(Economic & Management Institute,Shijiazhuang Railway Institute,Shijiazhuang  050043,China )
2(Ewayboke Corporation Limited,Beijing 100010,China)
Download: PDF (629 KB)  
Export: BibTeX | EndNote (RIS)      
Abstract  

The enterprise-level search engine is proposed to solve the problem that enterprise have abundant document information resources but lack of effective search tools.The function and the overall framework of the enterprise-level search engine are analyzed firstly.Lucene indexer is studied in depth during implementation secondly. Then the plug-in unit is used to carry out the analysis and extraction of different types of documentsi in design. A set of parallel multi-task scheduling mechanism is established in the task scheduling. When the user interface is designed,Yui-ext components and DWR remote object invocation framework is applied to implement asynchronous communication by the Web,which can promote the users’ experience.

Key wordsSearch engine      Lucene      Plug-in      Crawler     
Received: 06 July 2007      Published: 25 August 2007
ZTFLH: 

TP393

 
Corresponding Authors: Chen Yanchun     E-mail: chenyanchunsjz@163.com
About author:: Chen Yanchun,Li Shuangping

Cite this article:

Chen Yanchun,Li Shuangping. Design and Implementation of Enterprise Search Engine Based on Lucene. New Technology of Library and Information Service, 2007, 2(8): 63-66.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2007.08.15     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2007/V2/I8/63

[1] 李刚,宋伟,邱哲.征服Ajax+Lucene-构建搜索引擎[M]. 北京:人民邮电出版社,2006.
[2] 邱哲,符滔滔.Lucene+Heritrix开发自己的搜索引擎[M]. 北京:人民邮电出版社,2007.
[3] 金恩年.智能商品查询服务系统的研究与设计[D].上海:华东师范大学,2006
[4] 丛磊.桌面搜索引擎的研究与实现[D].北京:北京化工大学,2006
[5] Gospodnetic O, Hatcher E.Lucene in Action[M].USA:Manning Publications Co.,2006.
[6] 孙卫琴.精通Hibernate Java对象持久化技术详解[M].北京:电子工业出版社,2005.

[1] Liu Tong,Ni Weijian,Liu Mei. Identifying Terminology from Search Engine Query Logs[J]. 现代图书情报技术, 2016, 32(2): 25-33.
[2] Wang Peixia,Yu Hai,Chen Li,Wang Yongji. Using Intelligent System to Extract Search Terms for Sci-Tech Novelty Retrieval[J]. 现代图书情报技术, 2016, 32(11): 82-93.
[3] Tong Guoping, Sun Jianjun. User Behavior Analysis Based on Search Engine Log[J]. 现代图书情报技术, 2015, 31(7-8): 80-88.
[4] Wang Xiwei, Zhao Dan, Yang Mengqing, Wei Junwei. Indices and Empirical Research on Search Engine Optimization of the Industry Websites: An Analysis from the Perspective of Information Ecology[J]. 现代图书情报技术, 2015, 31(3): 75-83.
[5] Fang An, Wu Sizhu, Hong Na, Qian Li, Wang Ying, Hu Jiahui. Design and Implementation of Integrated Service System for STKOS Related Tools[J]. 现代图书情报技术, 2015, 31(3): 92-100.
[6] Chen Yong, Li Honglian, Lv Xueqiang. Analysis for the Search Behavior of Web Users[J]. 现代图书情报技术, 2014, 30(12): 10-17.
[7] Qian Li, Zhang Xiaolin, Li Chunwang, Wang Xiaomei, Yang Liying, Chen Ting, Zhang Zhixiong. Research and Application of Science Intelligence Analysis Integrated Services Architecture Using OSGi[J]. 现代图书情报技术, 2014, 30(12): 62-70.
[8] Qiao Jianzhong. An Improved Best-First Search Algorithm Based Focused Crawling Research[J]. 现代图书情报技术, 2013, 29(7/8): 28-35.
[9] Li Wenjiang, Chen Shiqin. Application of AIMLBot Intelligent Robot in Real-time Virtual Reference Service[J]. 现代图书情报技术, 2012, 28(7): 127-132.
[10] Qiao Jianzhong. Statistical Characteristics Based Web Page Relevance Judgment Strategy for the “Type” Topics Crawled[J]. 现代图书情报技术, 2012, 28(6): 9-16.
[11] Huang Wei, Jin Yabo, Hu Changlong. Focused Crawling for Network Public Opinion’s Topic Information[J]. 现代图书情报技术, 2012, (11): 65-71.
[12] Xian Guojian, Zhao Ruixue, Zhu Liang, Kou Yuantao. Conversion and Consumption of Chinese Agricultural Thesaurus as SKOS[J]. 现代图书情报技术, 2012, (10): 16-20.
[13] Zhang Liyi, Chen Mingying. Research on the Sensitivity and Specificity of Search Engines[J]. 现代图书情报技术, 2011, 27(7/8): 41-46.
[14] Xian Guojian, Zhao Ruixue. Research and Implementation of Chinese Agricultural Journals’ Abstracts Retrieval System Based on Solr[J]. 现代图书情报技术, 2011, 27(6): 51-58.
[15] Wang Jimin, Lilei Mingzi, Zhang Peng. Co-authorship Network Analysis in the Research Field of Search Engine’s Log Mining[J]. 现代图书情报技术, 2011, 27(4): 58-63.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn