The research background and related research work about Document Clustering Description (DCD) are given in this paper. The relationship between DCD and automatic indexing, automatic summarization, conceptual clustering is explained and the research content of DCD is definited. According to its requirements, the tasks of DCD are formalized. The evaluation methods of DCD are also described in this paper.
This paper concerns on collaborative tagging system, and makes a review on the articles of this field from three levels: research on theories, empirical studies and research on experiments and applications. Finally, the work done in this article is summarized,and the future development of research on collaborative tagging system is discussed.
This paper introduces an Ontology integration approach WCONS+ which consists of preparation, mapping, integration and checking phases. An experiment is given by integrating of military aircraft Ontology and electronic warfare equipment Ontology, and the results show the effectiveness of WCONS+.
This paper analyzes Chinese patent abstract about alternative energy vehicles by way of knowledge engineering method, and puts forward an Ontology-based knowledge extraction model for Chinese patent abstracts. Main stages in building the model include: to construct a corresponding Ontology, to collect a related word list, to write corresponding rules. These rules are utilized to extract underlying knowledge in patent abstracts. The result aids in the automatic construction of patent knowledge base. This paper is an attempt on how to organize unstructed information and on how to automatically construct a knowledge base, and verifies the feasibility of Ontology-based patent abstracts' knowledge extraction.
The system of library temperature monitoring is designed basing on a micro-power data transmission model. The distributed network constitutes some temperature data collection sub-systems and a monitoring center to realize real-time data processing by using model FC-210S.This paper decribes at length the components and the design method. The error of the temperature monitored is within ±0.1℃. Critical temperature alarming and historical data can also be obtained in this system for present and future need.
e-Science is a new research environment based on Grid technology. This paper analyzes new changes of data resources, and summarizes developments of compound digital object of data resources such as encapsulated scientific data, computational resource, work flow, and provenance metadata etc.
In order to utilize general knowledge of top Ontology for effective reasoning by domain Ontology, this paper studies translating of top Ontology expression from OWL Full to OWL DL, such as SUMO and OpenCyc. Problems appeared during definition translating of classes, properties and instances are handled to make the translated top Ontology accordant to the OWL DL language.
A new set of evaluation indicators of usability is put forward on the basis of the traditional evaluation theory of usability, including intelligibility,operability,information accessing,identifying information,errors,efficiency and user satisfaction.The authors test the usability of Chinese Science Citation Database (CSCD) with the method of summative testing and verify the applicability of the new set of evaluation indicators of usability. Finally, suggestions of further improving the usability of database retrieval system are put forward after analyzing testing results and the source of problems.
This paper detects Geographical Political Entities (GPE) and it subtypes from the English corpus of Automatic Content Extraction (ACE) evaluation, based on Conditional Random Fields (CRFs). A feature set is extracted from the ACE corpus, and contributions of different feature sets to the detection of GPE entities are evaluated in the experiments. The results show that the feature set extracted in this paper can get higher rate of recall and accuracy.
Some searching interface design principles based on cognitive style models are provided. Selecting Google as testing object, the paper analyzes styles of searching interface, organization of main categories and sub catecories, listing of hit Webs, presentation of searching results and showing of relevant categories. Some improvements are also provided for Google interface.
This paper introduces research on vertical search engines, analyzes its features and applications comparing with general search engines. Practical problems such as improving precision ratio, searching speed, information acquisition efficiency and controlling information acquisition quality are also discussed. With the catering vertical search prototype in 12580 information acquisition system as a reference, some application strategies about information acquisition, information updates and information extraction are proposed.
This paper proposes a concept model of user's acceptance behavior, taking ERP system as an empirical study. Structural equation analysis methods are used to validate the hypothetic relationship among the structural variables in the concept model. The results show that a majority of the variables proposed in the model have direct or indirect impact on user's intention of continuing the use. This model will be helpful for the enterprises to understand the user's behavior in the Enterprise Information System.
This paper gives a simple introduction to link analysis and visualization technology.It studies the status,methods and steps of how the visualization technology should be used in link analysis.It then analyzes a few representative systems and tools in this field. At last, this paper gives the deficiency and the possible directions and methods that can be used in visualization technology of link analysis,and provides support to the development of powerful visual link analysis tools.
With the advantages of regular expression in string manipulation,this paper realizes extraction of oil price information from noisy and irregular Webpages. Points of importance and difficulty in realization are pointed out, and the structural description ability of regular expression in string manipulation is testified.
This paper introduces the interactive system development methods to resolve the ambiguity problem of query translation in Cross-Language Information Retrieval (CLIR). After studying and analyzing on CLIR techniques based on relevance feedback, an English-Chinese interactive CLIR system has been put forward. This system implements some interactive functions such as user-assisted query translation, multi-level user relevance judgment, translation enhancement and query expansion. Experiments show that the retrieval efficiency are obviously improved.
According to the informantion security problem faced by digital library construction, this paper analyzes the architecture of visualization network behavior security auditing system based on B/S structure, and focuses on the design and realization of Web application based on Spring+Struts framework and visualization network behavior auditing function of this system.Application of security auditing system has the ability to control the users’ behavior, audit it after the event and provide the guarantee for maintaining the digital library network security.
This paper analyzes content streams of PDF files based on its structure, and extracts semantic metadata automatically from research papers by way of rule-based matching and format-based locating. Experimental results show that this method can extract important semantic metadata such as title and author effectively.
This paper analyzes necessity of Mashup application in the professional information service of library, and points out the insufficiency of currently available Mashup application based on the double value-added principle.Meanwhile, with the principle of increasing service value, the author develops a plan of a professional application of book recommendation and book review.