The paper introduces the background, the definition and basic principle of linked data, analyzes linked data-driven Web application components, and presents the types of linked data-driven applications on library such as resource discovery service, data fusion and semantic search service, research and scholarship, open data and reuse etc. Finally, it proposes some suggestions for building linked data applications in library.
This paper presents a method for organization and clustering of the network retrieved results based on formal concept analysis. It classifies the key words of result documents,then creates a concept lattice for retrieved result in order to help the users approach the final results.
This paper proposes a method of domain name recognition based on heuristic rules, to overcome the shortage of traditional solution in specific domain. It firstly studies chemical name in Chinese to obtain its domain features and statistical language features, and then on the basis of such features,it puts forward several heuristic rules, which is applicable to domain name recognition of chemical literature. Comparison experiment shows this method can improve the efficiency of domain name recognition obviously.
The article analyzes the necessity of error checking for relationships in thesaurus and Ontology establishment,and concisely introduces the research status of error checking in thesauri software, checking methods and other projects. According to the research objective, it focuses on the loop errors in hierarchical relationships, including the types of errors, the design and implementation of checking algorithm. Experiments on a test data and a thesaurus respectively show that the algorithm works well.
On the foundation of the Unicorn system in Nankai University Library, this paper introduces the design and implementation of bibliographic recommendation system based on maximal frequent patterns mining algorithm. It describes the process of analyzing the readers’ behavior patterns by fully utilizing the accumulation data collected in the Unicorn system in details, so as to offer personalized bibliographic recommendation service. By using this system, the academic library can effectively expand different service patterns to readers on available sources, and improve the efficiency of the existing automated circulation system.
In this paper,the concept of edit distance is introduced, and the issues about how to construct a tag tree and calculate the similarity of two Web pages by using the tree-matching algorithm are discussed. Firstly, the pages are roughly clustered according to their URL similarities and further classified by tree-matching algorithm. Based on the model page obtained by clustering, Web information can be extracted automatically by using Web structure similarity algorithm jointed with extraction rules. The test is able to verify the feasibility and efficiency of the algorithm in system.
This paper introduces the new variants and applications of Lexicalized Tree Adjoining Grammar(LTAG) over the past decade, and summarizes the main trends of the theory’s evolution. Then, it points out the significance and implementation value of LTAG in Chinese, describes the current situation and difficulties of LTAG in Chinese studies. Finally, the authors give some perspectives and prospects about the LTAG application in Chinese.
In order to improve the convenience of farmers’ access to information, this paper is focused on developing the farmer-oriented question answering system and proposes that the system is composed of four modules: knowledge base construction, question processing, information retrieval, answer extraction, in which question processing is the research priority. On the basis of concluding the characteristics of farmers’ questions, it proposes the question classification method based on interrogatives and phrases. During the question processing, methods are adopted to classify the questions and extract the keywords effectively, such as removing polite words, establishing special rule table for informal interrogatives and no interrogative and so on. While, the authors take advantage of the synonym extended table to expand keywords, and set different weight benchmarks. The research can lay the foundation for the processing of the information retrieval module.
The paper tries to apply the XML text retrieval methods to long text enviroment,and uses Chinese thesis as a dataset. It designs and implements XML tagging, indexing, keyword retrieval and structural retrieval on Chinese thesis, and finally constructs an XML-based Chinese thesis retrieval system.
This paper studies the Laplacian spectrum of scientific papers’ word co-occurrence network from three aspects: the Laplacian spectrum rank, the Laplace spectral density, and the Laplacian extremal eigenvalues. Through comparative analysis, the Laplacian spectral features of scientific papers’ word co-occurrence network are obviously different, and these differences can be used for discriminating the authenticity of scientific paper.
Based on the concept and calculation method of extensity centrality, which are newly proposed in 2009, the paper builds co-authorship networks by the data of the world’s top three journals in Management Information Systems(MIS)field, analyzes the components of networks and calculates the extensity centrality of authors in five components. Then, it surveys and analyzes the backgrounds and cooperators’ research fields of those authors whose extensity centrality scores are high. The results indicate that the whole cooperative behavior of researchers in MIS is more active.Many of the authors with high extensity centrality scores are well-known experts or scholars in MIS, and they cooperate with scholars coming from various fields. Therefore,extensity centrality can indeed be used to evaluate the importance of experts and scholars.
This paper researches the ResCarta Tools, and constructs ResCarta Data Repository by ResCarta Tools which catalogs documents based on METS and MODS standard. The repository can provide services such as local digital object retrieval and browsing, metadata harvests based on OAI-PMH protocol, object data access by identifier link for remote OAI service provider.
This paper introduces the topology architecture of the network of Tsinghua University Library. Then it analyzes the characteristics of the network system, and gives out the measures to set the network devices of IPv6/IPv4 dual stack and wireless network.
To support the library lecture service, a library lecture subscription system is designed and implemented. With this system,the librarians can manage the information related with library lectures, and the readers can subscribe lectures or cancel their subscriptions. At the same time,a friendly system interface is developed by using Ajax technology.
This article introduces how mobile library system is integrated with ILASII by Web Online Library when the self-developed database in ILASII can’t be accessible and API for system integration isn’t provided. Meanwhile, it gives a detailed analysis and elaborates the process of programming.
This paper summarizes the design, development and testing work of Study Room Management System in Library of Wenzhou Medical College. Based on B/S architecture, using the existing campus smart card system and the technologies of Flash, ASP and database, the system realizes the functions such as distributing seats automatically, choosing and exchanging seats, seats reserving time over management and displaying the position and status of the seat in graphic animation. Further more, it resolves the low seat utilization problem resulting from the occupation of the seats in contemporary college library study rooms.