This paper reviews the development history of Web Archive, analyses the progress and characteristics of the three different stages of the initial experiment, deployment and application of Web Archive. By summing up researches of Web Archive and international practice in recent years, the authors give an initial view for the future development trend of Web Archive, hope to be a valuable reference for Chinese Web archive researches.
This paper summarizes three commonly used harvest strategies in Web Archive:the integrity harvest, selective harvest and hybrid harvest. Then comparatively analyzes characteristics of various harvest strategies, key issues and representation projects. Finally, some key factors need to consider in choosing the harvest strategy are analyzed and general recommendations are made.
Archiving strategy as the important research area of Web Archive, has been concerned by many projects, this paper selects several typical strategies: compressed archiving with external index, archiving with multi-master, format migration archiving, and characteristic described archiving , and analyzes their preservation context, characteristics ,as well as the application of the strategy to achieve, and provides valuable reference for Web Archive research in China.
Based on existing Web Archive projects,this paper finishes a preliminary analysis of retrieval system architecturethat are applied by these projects and how they cope with the challenge which is to search infomation in massive data collection. From the view of system architecture, it discusses archive retrieval system performance and efficiency and wish to provide some references to the relevantinstitutes and researchers.
This article introduced current applications of web archive resources, and then from the perspective of data mining, analyzes and sums up the in-depth applications of web archive resources.
This paper induces and discusses the Mashup technology, including system framework, resource acquiring technology, representation component technology, server technology and merging technology. The resource acquiring technology includes Web Feed, API, REST protocol and screen scraping .The representation component is classified into Portlet and Widget. The server technology is illustrated by Kapow Mashup Server. The merging technology puts an emphasis on merging schema, programming language and Mashup tools.Finally, it points out the existing problem and the research direction in future.
Based on the survey of researchers, librarians and decision makers in the chinese academy of sciences and some domestic universities, the authors compare the difference of cognition and requirement of IR between different roles. The paper elaborates the problems of planning and construction of IR in China to provide reference for the implementation of IR in domestic scientific research institutions and universities.
The role of Ontologies in knowledge retrieval is presented. The following subsections detailedly describe the usage of SWRL(Semantic Web Rule Language) and Protégé-OWL API , as well as give a presentation about implementation of reasoning based on Ontologies. The paper concludes with a discussion on test results and outlines of future work.
According to structured degree of data on Web, this paper discusses all kinds of techniques, methods and gettable ontology elements, existing problems in ontology learning, which always used to achieve semantic web. It also introduces and compares existing ontology learning systems which integrate with multi-techniques, and analyzes their adopted key techniques, objects in point and result description.
This paper presents the advantage of information foraging theory compared with traditional information retrieval theory and user behavior analysis theory, then gives a research content framework for information foraging theory, and extends the framework based on a through review of the two research branches, the basic concept of information foraging theory, the elementary models of information foraging theory. Several problems for future research are also identified through.
In view of the current E-commerce Recommender System can not be good for unregistered users, the paper set two different sets of data collection program according to the characteristics of unregistered users and registered users, in order to enhance the friendly of website and the accuracy of the data. Because the decision tree algorithm and bayesian network algorithms both have advantages and disadvantages, the paper uses a combination of two algorithms, and introduces the content-based algorithm to research the attribute of goods to improve the accuracy of the recommendation. The experiments prove that these methods can provide good service for unregistered users and the recommendation based on the hybrid algorithm is superior to single algorithm.
This paper briefly cards the development of a commercial Electronic Resource Management System (ERMS),introduces general application of the systems and summarizes the core functions of the ERMS products,and focuses on analysis and evaluation of the practical application of the three ERMS products. Finally,paper points out some suggestions that need pay attention to in the process that Chinese libraries introduce and implement the commercial ERMS products.
This paper introduces the application of SMS in circulation field of Tsinghua University Library. The design and implementation of short message content extraction subsystem and short message transmission subsystem are expatiated.
On the basis of Struts design patterns and Web Services technology, a solution of system structure about universal mobile phone library system is proposed, which is independent of the system, platform and terminal. The design thoughts,development framework,design and realization of main function modules and key technologies of the mobile library system are expatiated in this paper.
The paper introduces the design and implementation of middle data synchronous system based on entrance guard system and integrated management system for library of Ludong university. This system realizes reader data synchronous tracking，and resolves the problems about delaying reader information and difficulty of adding and deleting reader existing in entrance guard system, which provide convenients for readers and improves efficiency.