Please wait a minute...
New Technology of Library and Information Service  2007, Vol. 2 Issue (6): 70-73    DOI: 10.11925/infotech.1003-3513.2007.06.16
Current Issue | Archive | Adv Search |
Design and Implementation of a Search Engine for K12-related Websites
Chen Quan   Cao Zhuowen  Yang Xiaojiang
(Department of Education Technology, Nanjing Normal University, Nanjing 210097, China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

On the basis of some research work on metadata for Website,this paper introduces a search engine system for Websites related to K12 education. Combined with the characters of K12-related Websites,some key technologies are analyzed, such as topic-spider search, Website classification,information extraction for site etc. Both of the whole architecture of the system and the function modules are described in detail in the paper.

Key wordsTopic spider      Website classification      Information extraction      Search engine     
Received: 20 April 2007      Published: 25 June 2007
: 

TP393

 
Corresponding Authors: Yang Xiaojiang     E-mail: xjyang@njnu.edu.cn
About author:: Chen Quan,Cao Zhuowen,Yang Xiaojiang

Cite this article:

Chen Quan,Cao Zhuowen,Yang Xiaojiang. Design and Implementation of a Search Engine for K12-related Websites. New Technology of Library and Information Service, 2007, 2(6): 70-73.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2007.06.16     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2007/V2/I6/70

1教育部基础教育课程教材发展中心. CELTS-42基础教育资源元数据应用规范. http://www.celtsc.edu.cn/680751c665875e93/folder.2006-04-03.8417036039/celts-42/celts-42-1-cd1-6.pdf(Accessed Mar.19,2007)
2Hersovici M, Heydon A, Mitzenmacher M, et al. The Shark-Search Algorithm-An Application: Tailored Web Site Mapping. World Wide Web Conference,1998
3Cho J, Garcia-Molina H, Page L. Efficient Crawling through URL Ordering . Computer Networks, 1998, 30(1-7):161-172
4Kleinberg J M. Authoritative Sources in a Hyperlinked Environment. Association for Computing Machinery,1999,46(5):604-632
5Hans-Peter Kriegel, Matthias Schubert, Classification of Websites as Sets of Feature Vectors . The IASTED International Conference, Austria, 2004
6余智华. WWW站点的分析与分类:[学位论文]. 北京:中国科学院,1999
7田俊华. 基于Web的中文文本自动分类研究与实现:[学位论文].南京:南京师范大学,2004
8杨晓江. .Net环境下Web应用的通用设计. 计算机工程与设计, 2003(10):46-49

[1] Tan Ying, Tang Yifei. Extracting Citation Contents with Coreference Resolution[J]. 数据分析与知识发现, 2021, 5(8): 25-33.
[2] Wang Yi,Shen Zhe,Yao Yifan,Cheng Ying. Domain-Specific Event Graph Construction Methods:A Review[J]. 数据分析与知识发现, 2020, 4(10): 1-13.
[3] Tao Yue,Yu Li,Zhang Runjie. Active Learning Strategies for Extracting Phrase-Level Topics from Scientific Literature[J]. 数据分析与知识发现, 2020, 4(10): 134-143.
[4] Zhiqiang Liu,Yuncheng Du,Shuicai Shi. Extraction of Key Information in Web News Based on Improved Hidden Markov Model[J]. 数据分析与知识发现, 2019, 3(3): 120-128.
[5] Chengzhi Zhang,Zheng Li. Extracting Sentences of Research Originality from Full Text Academic Articles[J]. 数据分析与知识发现, 2019, 3(10): 12-18.
[6] Mu Dongmei,Jin Shan,Ju Yuanhong. Finding Association Between Diseases and Genes from Literature Abstracts[J]. 数据分析与知识发现, 2018, 2(8): 98-106.
[7] Liu Tong,Ni Weijian,Liu Mei. Identifying Terminology from Search Engine Query Logs[J]. 现代图书情报技术, 2016, 32(2): 25-33.
[8] Yufeng Duan,Sisi Huang. Information Extraction from Chinese Plant Species Diversity Description Text[J]. 现代图书情报技术, 2016, 32(1): 87-96.
[9] Tong Guoping, Sun Jianjun. User Behavior Analysis Based on Search Engine Log[J]. 现代图书情报技术, 2015, 31(7-8): 80-88.
[10] Liu Wei, Wang Xing, Song Peiyan. A Noise Cleaning Method for Synonym Extraction Results[J]. 现代图书情报技术, 2015, 31(6): 64-70.
[11] Wang Xiwei, Zhao Dan, Yang Mengqing, Wei Junwei. Indices and Empirical Research on Search Engine Optimization of the Industry Websites: An Analysis from the Perspective of Information Ecology[J]. 现代图书情报技术, 2015, 31(3): 75-83.
[12] Jiang Chuntao. Automatic Annotation of Bibliographical References in Chinese Patent Documents[J]. 现代图书情报技术, 2015, 31(10): 81-87.
[13] Li Xiangdong, Huo Yayong, Huang Li. Study of Book Pages Automatic Identification and Bibliographic Information Extraction[J]. 现代图书情报技术, 2014, 30(4): 71-77.
[14] Liu Yajing, Wang Yanxi, Hao Dan, Zhou Jinhui. Study on the Methods of Institutional Repository Supporting Research Services[J]. 现代图书情报技术, 2014, 30(3): 1-7.
[15] Chen Yong, Li Honglian, Lv Xueqiang. Analysis for the Search Behavior of Web Users[J]. 现代图书情报技术, 2014, 30(12): 10-17.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn