The authors build a semantic model based on metadata, domain Ontology, bridge Ontology and Ontology analytical system,and then put forward a semantic interconnection application model in digital library knowledge organization. Prototype researches on the semantic heterogeneous and architecture heterogeneous electronic medical records, designs and partially implements a medicine-oriented semantic interconnection simulation system. At last, it discusses semantic interconnection in digital library knowledge organization in the application level.
This paper builds bridge Ontology by the combination of static and dynamic methods. It uses Protégé 3.4 to build static classes, subclasses and properties of bridge Ontology, then builds the dynamic instances and property values using semantic similarity calculation on the EclipseSDKv3.4.2 platform. Ultimately it forms a shared bridge Ontology described by OWL.
The semantic annotation subsystem is the key subsystem of Medical Oriented Semantic Interconnection Simulation System (MOSISS), which is an example of the research on semantic interconnection of digital resources. Under the instruction of Ontology-based semantic annotation, the design idea, the system structure and the function of semantic annotation subsystem are expatiated, in order to explore the application of multi-field Ontology in semantic annotation, which provides a way of sharing data for users.
The origin of this research is bibliography organization based on the multiplicity of information forms, variability in information lifecycle and complexity of hybrid object. The paper proposes and practises the basic idea of semantic organization of bibliography that build semantic linking based on Ontology, describes semantic information accordantly using RDFS/OWL and SPARQL.
In this paper, an E-government information classification system of B/S structure is designed and implemented based on MVC mode. The author describes the system architecture and business logic, with emphasis on four key technologies including data abstraction, business association, access control and visualization. The system is proven to meet the functional requirements of the classification mechanism and business association, and the visualization of the whole classification structure and the correlations between business categories is realized.
Term semantic relationship is a key step of Chinese text information processing.Through researches on some existing methods at home and abroad,a process of term semantic hierarchy induction is proposed, which uses multiple clustering method to get the whole hierarchy,and combine with comprehensive similarity caculation to get the label of classes.Finally,some experiments are done to verify its rationality.
In view of user’s urgent demand to quick response information service of present E-government and from the perspective of information resources organization,this article provides solutions to E-government quick response information service in micro-level of data element. It also constructs an E-government information service model which is oriented by quick response, and then the model is validated.
This paper utilizes the Self-Organizing Map(SOM) to analyze the salient subjects among 60 foreign journals in the field of Library and Information Science (LIS) and the development trends of Journal of Information Science (JIS) from 1981 to 2007. An enhanced SOM display method named Attribute Accumulative Matrix is employed to identify 7 groups of salient subjects among the 60 investigated journals. A novel SOM display method named Prevalent Attribute Projection is constructed combined with U-matrix, to analyze the development process and patterns of JIS’ salient subjects in the past 27 years. The research findings reflect the development laws of foreign LIS journals to some extent,and the research methods can provide systematic tool and procedure for the analysis of salient subjects and their development trends among journals.
This paper introduces the steps, frameworks and metrics of approximately duplicate data cleaning. Then, the detect algorithms and the elimination algorithms are surveyed essentially,according to type and their improvement methods, and the algorithms usage scope and their advantages and disadvantages are given. Many data cleaning tools are presented, such as Merge/Purge. Finaly, it discusses the future research topics in data cleaning and points out that the concept of knowledge and semantic used in the framework of data cleaning will be an important trend.
With consideration of the problems concerning the data of patent analysis, such as single data source, rough pretreatment, and low-level data mining, this paper designs and achieves the data integration over heterogeneous patent sources. Specifically, the local patent database where the data are acquired from heterogeneous sources including two organizations and seven countries is regarded as basic data source. After using the SSIS tool for data cleaning and data transformation, the data from local database are loaded into data warehouse that is built according to the key performance indicators, which provides data support for more advantaged analysis.
Starting from the taxonomy and related technical theory in the field of library and information science,this paper studies the construction of classification and navigation platform for internet scientific data resource. It employs dynamic faceted classification to organize scientific data resource catalogue, then proposes viable multi-facet classification and keyword connection indexing method, as well as designs ranking scheme based on the weight of classification and keyword connection. The experimental system based on this scheme can classify distributed scientific data resource on the internet effectively and provide navigation service.
The paper designs the scheme of applying binary image watermarking technology for the copyright proection, and then the security and invisibility of this method are analyzed. The experiment results show that the method protection can effectively solve the problem of copyright protection in digitization of ancient books.
Aiming at the technology bottlenecks of current books management system in information sharing, the paper imports the concept of REST, introduces and analyzes the architecture of RESTful Web Services.It takes storage, updating, retrieval and borrowing/returning in books management business as examples, designs and implements the books management system based on RESTful Web Services to provide reference implementation of constructing books management system which fits lightweight sharing of information. Finally, it proves the superiority and feasibility of system in implementing lightweight sharing of books information.
This paper applies the book preview service in Web OPAC based on Google Book Search API. The author illustrates its design strategy and detailed steps,from which Web OPAC users can experience a better information service.