This paper reviews iPRES2007 digital preservation international conference comprehensively, introduces the development of policies,strategies,planning and infrastructures of digital preservation, related management issues, technology researches and practices, certification and assessment, etc. It also analyzes and summarizes experiences and lessons existing in practice, discusses problems that we have and brings forward the important concerns for the next phase of digital preservation.
Paper management and service system based on Struts+Ajax is the system which has integrated paper submission, paper management and OAI service. It meets most needs of paper management in universities. At the aspect of technologies used in the system,struts and prototype are mainly used.The paper expounds the implement of the system by combining the application of prototype and struts.
This paper introduces the OAI and OSS software ARC. On this basis,the authors introduce the harvest and service system in detail, including the design means and systems design framework, implementation and test results. Finally,the authors discuss something about OSS and the internationalization of Java. The paper mainly analyses and studies the system architecture, functions, the technology and theory of system implementation.
This paper proposes an architecture design and solution to union search and extended service system which is based on struts-hibernate design mode and Ajax technology. It focuses on the idea of lightweight structure, program process, design and implementation of main modules.
This paper introduces the improvement of the digital library portal integration technologies based on Web2.0,including data, management and service integration. Data integration includes local data, commercial data and internet data. Management integration includes privilege management, spider management and OAI/METS service management. Service integration includes SSO, united search, resouce navigation, OAI/METS data provider and RSS feed. The paper emphasizes the application of Web2.0 OSS and Ajax as well as some details of practical applications.
This paper describes various types of Ontology heterogeneity problems,introduces the concept of Ontology matching and its relationship with Ontology mapping.Based on summarizing major theories of Ontology matching,this paper sketches the theoretical framework of Ontology matching,including matching granularity,matching parameters,matching operation,matching strategy,matching process and so on.
The review of the automatic indexing research is presented.Firstly,the indexing object in the automatic indexing is proposed.Then,three phases and the representative methods of the automatic indexing in the past 50 years are described respectively.The road map of automatic indexing research is explained in detail.The classification of the keyword extraction and keyword assignment methods is put forward respectively.Finally,the issues in the automatic indexing are summarized,and the future research topics and application related to the automatic indexing are discussed.
This paper constructs a Article Novelty Evaluation System based on Sentence Matching(ANES-SM), aiming to overcome the difficulty of recognizing same contents between an article and other articles manually. Architecture of ANES-SM is built, and definite flow of key module is analyzed and algorithm is designed, inclading sentence analyzer, sentence matcher and article novelty evaluator. Experiment shows that it is feasible.
This paper focuses on how to crawl Weblogs effectively in some sections of Web,and brings forward an arithmetic of the Weblog gathering based on RSS.The authors design two crawlers,one of which is responsible for gathering RSS by performing a breadth-first traversal of the Web,and the other tracks updated Weblogs automatically by performing a vertical search of every RSS.Also A model system is implemented.
This paper proposes a new solution called user-oriented interface design approach for MIS. A user-oriented visual interface design environment is designed, and the basic principle of the new method is introduced in detail.Finally,the authors apply this method to an document management information system of international cargo transportation. Tests show that it well meets users’ various business requirements and it’s easy to use.
On the basis of the Intelligent Multi-Agent System (Multi-Agent) and the blackboard cooperation mechanism, this paper mainly discusses the improvement in the user agent.Then the anthors develop a system which is called a User Agent based Personalized Information Retrieval System (UAPIRS). The authors put forward the frame structure of the whole system, analyze user agent blackboard division and data format, and give the task decomposition and task area of information, communication information branch of the organization and how to achieve monitoring mechanisms.In the end, this paper has a discussion about multi-Agent system coordination mechanisms.
After discussing the origin, basic principles and architecture of the focused crawler development, the authors analyse features of the WebSPHINX, then design a focused crawler based on WebSPHINX.
In the paper,a new document copy detection algorithm based on the similarity of the sentences is proposed.In order to improve the detection accuracy,the authors not only emphasize on the whole document,but also on the structure of the document.In the end,experiments and comparison are taken between the new algorithm and other typical algorithms,the result shows that it is feasible.
The paper introduces the background of affective information processing and discusses its two main research branches and research status. Then, from the view of processing procedure, it summarizes three major technologies in affective information processing. Finally, the author points out the important significance of applying affective information processing in the field of library and information science.
This article gives a brief introduction to the open-source software Solr’s history, functionality, system architecture and usage. The definition of faceted browsing and the difference between faceted browsing and traditional search approach are analysed. This article provides a fast and efficient solution to building faceted browsing by using background services to analyze MARC data and pass it to Solr to generate index files. Solr has outstanding performance even under millions of data and is worth popularizing.
This paper designs an automatic classification model which can analyze the experience in the manual indexing. The Military Information Resources Classification has an important effect on the programs.
A design of Z39.50 client based on half B/S mode is given. Critical technology to the implementation of the client is discussed and solved. An actual developed system shows that the design is advanced.
By investigating current circumstance of university library office network, the author finds out that the “interoperability” between Cernet and other network is the main factor which affects the efficiency of university library network, and proposes the feasibility of constructing secure and seamless library network by using RouterOS. The result reveals that the program has a lot of advantages such as“low investment”,“high efficiency”,“high security”and“flexible configuration”etc.
The author studies several prominent problems about construction of virtual community in academic library. Based on the location of its service,the author proposes the plan of community division and the algorithm. Moreover,the author carries on the discussion on establishment of the community ecosphere and the information resource sharing mechanism.
This article fully discusses the system structure, methods and technology of commercial data mining. In order to use the results of commercial data mining directly and conveniently, it is optimal to adopt visualization method. This article covers taking feature value of commercial data, processing flow, algorithms, establishing feature bank, dimension reduction mapping, and generating visualized result. It probes into the theory of commercial data mining and its application profoundly through the mining of case data of Taobao net and realization of visualization way.