Please wait a minute...
New Technology of Library and Information Service  2005, Vol. 21 Issue (6): 26-29    DOI: 10.11925/infotech.1003-3513.2005.06.08
Current Issue | Archive | Adv Search |
on the Specific Topic on Web
Ding Yi
(Department of Computer Science and Technology, Hubei Normal University, Huangshi 435000, China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

Information Retrieval (IR) on the Web is the automatic retrieval of all relevant documents, the same as resource finding of intended Web documents, while the same time retrieves as few of the non-relevant as possible. Web IR has become very popular and favorite at present. It concentrates on the using traditional text IR methods in the Internet, as well as the properties of Web graph. This research focuses on how to effectively and broadly get relevant Web pages and contents, filter Web pages and assign proper labels for them. Accurate finding user-specific information in the Web is very difficult. And traditional Web search engines take a query as input and produce a set of (hopefully) relevant pages that match the query terms. While useful in many circumstances, search engines have the disadvantage that users have to formulate queries that specify their information need, which is prone to errors. Based on the discussion of Page Rank, HITS and similarity between Web texts, some new algorithms called RG-HITS (Resemblance Graph-HITS) for finding relevant documents on the Web are introduced.

Key wordsWeb mining      Web search      Similarity scoring      Information retrieval     
Received: 08 February 2005      Published: 25 June 2005
: 

TP311

 
Corresponding Authors: Ding Yi     E-mail: a_carrie@sina.com
About author:: Ding Yi

Cite this article:

Ding Yi. on the Specific Topic on Web. New Technology of Library and Information Service, 2005, 21(6): 26-29.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2005.06.08     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2005/V21/I6/26

1Filippo Menczer, Gautam Pant, Padmini Srinivasan, et al. Evaluating Topic-Driven Web Crawlers. 21st ACM International Conference on Research and Development in Information Retrieval .New Orleans,Lonisiana,USA.2002:241-249
2T. Hofmann. The cluster-abstraction model: Unsupervised learning of topic hierarchies from text data. Proceedings of 16th International Joint Conference on Artificial Intelligence (IJCAI'99). Stockholm, Sweden. 1999:682-687
3Kleinberg M.Authoritative Sources in a Hyperlinked Eveironment.Journal of the ACM,1999,46(5):604-632
4U. Y. Nahm and R. J. Mooney. Ua mutually beneficial integration of data mining and information extraction. Proceedings of the 17th National Conference on Artificial Intelligence (AAAI '00). AAAI Press, 2000:627-632
5叶允明,马范援,于水等. Igloo分布式爬虫系统的性能优化. 李晓明,李星主编. 搜索引擎与Web挖掘进展. 北京:高等教育出版社,2003:1-8

[1] Huang Mingxuan,Jiang Caoqing,Lu Shoudong. Expanding Queries Based on Word Embedding and Expansion Terms[J]. 数据分析与知识发现, 2021, 5(6): 115-125.
[2] Meng Zhen,Wang Hao,Yu Wei,Deng Sanhong,Zhang Baolong. Vocal Music Classification Based on Multi-category Feature Fusion[J]. 数据分析与知识发现, 2021, 5(5): 59-70.
[3] Li Yueyan,Wang Hao,Deng Sanhong,Wang Wei. Research Trends of Information Retrieval——Case Study of SIGIR Conference Papers[J]. 数据分析与知识发现, 2021, 5(4): 13-24.
[4] Mingxuan Huang,Shoudong Lu,Hui Xu. Cross-Language Information Retrieval Based on Weighted Association Patterns and Rule Consequent Expansion[J]. 数据分析与知识发现, 2019, 3(9): 77-87.
[5] Sun Haixia,Wang Lei,Wu Yingjie,Hua Weina,Li Junlian. Matching Strategies for Institution Names in Literature Database[J]. 数据分析与知识发现, 2018, 2(8): 88-97.
[6] Yang Chaofan,Deng Zhonghua,Peng Xin,Liu Bin. Review of Information Retrieval Research: Case Study of Conference Papers[J]. 数据分析与知识发现, 2017, 1(7): 35-43.
[7] Zhang Xiaojuan,Han Yi. Reviews on Temporal Information Retrieval[J]. 数据分析与知识发现, 2017, 1(1): 3-15.
[8] Huang Mingxuan. Cross Language Information Retrieval Model Based on Matrix-weighted Association Patterns Mining[J]. 数据分析与知识发现, 2017, 1(1): 26-36.
[9] Ding Heng,Lu Wei. Building Standard Literature Knowledge Service System[J]. 现代图书情报技术, 2016, 32(7-8): 120-128.
[10] Heng Ding, Wei Lu. A Study on Correlation-based Cross-Modal Information Retrieval[J]. 现代图书情报技术, 2016, 32(1): 17-23.
[11] Mao Jin, Li Gang, Cao Yujie. Re-rank Retrieval Results Through Subject Indexing[J]. 现代图书情报技术, 2014, 30(7): 48-55.
[12] Qiu Junping, Fang Guoping. The Comparative Analysis of Natural Language Processing Research at Home and Abroad Based on Knowledge Mapping[J]. 现代图书情报技术, 2014, 30(12): 51-61.
[13] Tang Jingxiao,Lv Xueqiang,Liu Chengyang,Li Han. A Hierarchical Framework for User Intention Recognition[J]. 现代图书情报技术, 2014, 30(1): 36-42.
[14] Zhang Mei, Duan Jianyong, Xu Jichao. Person Name Attribute Knowledge Mining and Its Application for Query Classification[J]. 现代图书情报技术, 2013, 29(9): 82-87.
[15] Zhou Shanshan, Bi Qiang, Gao Junfeng. A Method of Information Retrieval Results Visualization Based on Social Network Analysis[J]. 现代图书情报技术, 2013, 29(11): 81-85.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn