This paper reviews the state-of-the-art of the development of digital library, analyzes the trend of application of grid computing, and discusses the usage of grid technique in modern digital library, with conclusion of the future of digital library and the issues needed to be solved of using grid computing together with digital library.
Information grid is an important application type of grid computing. This paper discusses the background, definition and key technique of information grid. A typical example of information grid - Shanghai Grid is described in details to show the key concept and application of information grid.
The article is structured as follows. Firstly, we try to design a DTD of articles of science and technology. Secondly, we analyze the structure of PDF documents. Based on that, we dwell on the design of a PDF information extraction system, which use the above-mentioned DTD as a template， transfer a PDF-formatted scientific and technological article to a valid XML document.
iSCSI is a new ip-based storage protocol. The experiments reveal that iSCSI can be well used to construct storge system for digital library and efficiently reduce costs because it can make full use of existing network and hardware resources.
The paper illustrates the basic composing of panel system; explains the main technical mathematical abstraction of panel system; advances LB language and thought of panel-oriented programming; introduces design principle about panel language; describes instruction code set of LB language; points out tactics on limits of authority management for panel system; optimizes the panel system by analyzing instruction efficiency, thus the author make the panel system into a large software system of exploiting and using by combining display model with a lot of inner functions through LB Language.
In the evaluation procedure of library, a lot of conceptions are not only fuzzy but also dynamic. Based on the dynamic fuzzy dependent relation, the means of dynamic fuzzy dependent relation analysis evaluation is put forward in this text, and the effectiveness and operation nature of this evaluation means are incarnated by the way of the concrete application example .
According to the investigation in the development state of library WebPAC in our country, the article has analyzed and compared the system functions, user interface, standardized design of several common WebPAC systems, then put forward some basal assessment criterions of WebPAC. Finally some suggestions on the developing of WebPAC have been brought forward.
There are several methods about extracting, storing, conversing and displaying eigenvalues of image objects. SIMIIRS adopts two methods: database and XML. The article pays more attention to following questions: description method of image resources using XML; establishment XML indexing files of image information; retrieving XML files and finally making the searching and providing image information come true.
This article discusses Web-Based Open Domain Question Answering System and the relevant technologies, such as information retrieval, information extraction, natural language process and so on. It gives a general system architecture of Open Domain Question Answering System and analyzes the component of the it such as question analysis, information retrieval, answer generation.
This paper presents disadvantages of search engines in existence and offers a P2P-based adaptive information retrieval system model. It also discusses the model’s functions, methods, and advantages.
A lot of work has been done on personalized retrieval systems. In this paper, some typical ones which are widely used in the current Web environment are analyzed. Then, we introduce the common model of personalized retrieval, classify the systems through four different aspects and list those chief ones. After that, we describe the core modules of personalized retrieval. In the end, it is a brief introduction of a method to enhance the retrieval effectiveness.
That properly and completely extracting the content of search Web pages is the basic precondition for handling the information retrieved.This paper analyses the structure characteristic of Google Web pages,presents a group of regular expressions for matching the content of these pages,and realizes a content extractor with Visual C#.The results from practical application to many Google Web pages shows that the matching method with regular expressions can extract the whole main content of Google Web pages.
The goal of Information Resources Integration (IRI) is gathering and ordering the dispersive or disordered information resources to facilitate users’ entrance into information systems. The key problem of IRI include: the theoretical foundation of IRI, the method and corresponding technology of IRI, the innovation of information service because of IRI, etc. A detailed survey of IRI is provided in this paper.
This paper analyzes the application status of digital resources in domestic colleges, discusses the issue of digital resources integration, and studies the resource integration platform in college with example of DIPS and DSpace. DIPS puts emphasis on processing digital resources, integrating different-structure resources, and building special topic databases. DSpace emphasizes the management of digital resources, especially focusing on information storage. The advanced structure and customization interface of DSpace set up the foundation for the function extension in the future.
With the question that whether Web resources are appropriate for citation, we do the sampling inquiry on Web citations from journal articles these years. In order to find a solution for the question, we brought forward the half-life of availability, which reflects the stability of Web pages. Based on this, we analyze the dynamic and instability of Web resources and draw the conclusion that Web citations are not proper to use nowadays.
This paper discusses the digitization standards in the scope of library traditional collection. It introduces the principles of the formulation of the standards, makes analysis on the types and features of library collection and describes the digital resource file format and relative techniques. The authors then propose the detailed digitization level as well as the distribution standards for several typical types of library collection.
This paper researches into an information technology, which could real-timely extract the interested information from data-type Web pages. The technology we employ could intelligently identify table structures, and automatically separate different kinds of data. In the process of analyzing and classifying data, it adopts the combination of sorting by words and dividing by table structure, which depends on the idea of ontology and aggregates a series of mature models, such as SVM and HMM. The technology, which has passed the test, is applied into a dynamic information gathering system of a TBT early-warning system and does a good work.
The article expresses the rights，duties and legal responsibilities of libraries in copyright protection．
With the double-quick development of network, the tradition mode of reference in library of university has been strongly impact. In order to use all kinds of resource of library expediently, digital reference service comes up. This paper begins with the requirement of quick answer, we uses SQL Server 2000 and ASP developing tool to develop digital reference services system in campus network. The system is running in library for half a year, which can well meet our requirements.
Commonly XML is used for data description, data storage and data interchange, and it’s application will be limited if a valid means for searching information is absent. This paper introduces briefly the XQuery language, discusses the functions and realization of a Website based on ASP.NET, describes the technical essences and realized codes for XQuery processor based on .NET, and analyses several examples in books management with XQuery.
Take the example of the application of Oracle database in management system of library, the article introduces three kinds of database backup, and make compare among the three modes respectively.