Data Analysis and Knowledge Discovery

Select

Open Source Software of Digital Library System and Its Development

Guo Wenli,Li Shuning,Zhang Xiaolin

New Technology of Library and Information Service. 2007, 2(3): 1-6. https://doi.org/10.11925/infotech.1003-3513.2007.03.01

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

This paper gives a comprehensive description of the open source software of digital library in the abroad, including the significant improvement and expansion on open source software systems, the integration of multiple open source software systems and the integration of open source software and other technologies.

Select

Construction and Evolution of Discipline Domain Ontology

Du Xiaoyong,Ma Wenfeng,Wu Wenjuan

New Technology of Library and Information Service. 2007, 2(3): 7-12. https://doi.org/10.11925/infotech.1003-3513.2007.03.02

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

This paper briefly surveys the state-of-the-art of construction and evolution of domain Ontology. It describes the process to construct a primary version of economics Ontology from existing Chinese classified thesaurus, and the approach to evolve the primary version of the domain Ontology. The key techniques of Ontology evolution include creating a dataset for Ontology learning, determining the candidate keywords, and discovering the concepts and relationship of the domain Ontology.

Select

Localization Practice of Institutional Repository Based on DSpace

Chen He,Xiao Dehong,Lin Limin

New Technology of Library and Information Service. 2007, 2(3): 13-17. https://doi.org/10.11925/infotech.1003-3513.2007.03.03

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

Based on the introduction of the open source DSpace software, many useful steps are introduced to accord with the really application requirement of Internal correlative institution and to adapt for the Internal user’s usage habit. The steps includes several system files of DSpace about language be Chinese localized, the system interfaces be adjusted, the system functions be optimized, and the mail server’s function be improved. All the steps are carried out on Xiamen University Institutional Repository which has been built with DSpace.

Select

On OAI-PMH Based Interoperation of Digital Archival Metadata

Wang Fang,Wang Xiaoli

New Technology of Library and Information Service. 2007, 2(3): 18-24. https://doi.org/10.11925/infotech.1003-3513.2007.03.04

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

This paper puts forward the framework and functions of the OAI-PMH based interoperation of digital archives and after one of the main digital archival metadatas EAD and its mapping with DC, which is generally supported by OAI-PMH, are introduced, the technical principles on how EAD can be shifted into DC, and particularly, how the context information between EAD subordinate components can be kept after being shifted into OAI records, are discussed. At last, the existing problems in the process are analyzed, and some solutions are advanced.

Select

Toward User-Document Matrix Based User Clustering for Collaborative Recommendation

Yan Duanwu,Luo Shengyang,Cheng Xiao

New Technology of Library and Information Service. 2007, 2(3): 25-28. https://doi.org/10.11925/infotech.1003-3513.2007.03.05

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

According to the needs of personalized recommendation service and the problem of high-dimension and sparse user-document visited data, an inter-user comparation based dimension reduction method and K-hirachical clustering arithmetic is utilized to analyze the user clustering procedure based on users’ resources evaluation data colloction. On the basis of those, an experimental system of user clustering is also designed and developed by applying Java open source technology.

Select

Interoperability and Its Implementation Among Knowledge Organization Systems

Si Li

New Technology of Library and Information Service. 2007, 2(3): 29-34. https://doi.org/10.11925/infotech.1003-3513.2007.03.06

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

Interoperability among Knowledge Organization Systems(KOS) is one of the key technology to cross-browsing and searching. The paper introduces some research projects about interoperability among KOS, summarizes the methods adopted, analyzes three examples, and puts forward some suggestions to realizing interoperability among KOS in our country.

Select

Query Expansion & Standardization Based on Ontology

Nie Hui

New Technology of Library and Information Service. 2007, 2(3): 35-38. https://doi.org/10.11925/infotech.1003-3513.2007.03.07

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

This paper studies some issues related with intelligent information retrieval. Firstly, the method for calculating semantic similarity and relativity by use of taxonomy and entailment relations of Ontology is proposed, by which query expansion can be implemented. Secondly, by use of the relations in Ontology, keywords queries are standardized and re-construct in the form of RDF. Finally, the scheme is proved reasonable and valid by concrete tests and analysis.

Select

A Rule-Based Classification Approach of Web Pages Using Ontology

Tan Jinbo

New Technology of Library and Information Service. 2007, 2(3): 39-42. https://doi.org/10.11925/infotech.1003-3513.2007.03.08

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

The paper puts forward an approach of Ontology-based rule classification. Firstly, the approach creates Ontology for each subclass of the classification system. Then, the Web texts classification is performed using the rules and Ontology. Comparing with the method of Rocchio, the results of the experiments indicate that the recall of Ontology-based approach is slightly lower than Rocchio’s, but its precision is more eminent than Rocchio’s.

Select

A Text Categorization System with C#

Liu Hua

New Technology of Library and Information Service. 2007, 2(3): 43-45. https://doi.org/10.11925/infotech.1003-3513.2007.03.09

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

Based on Vector Space Model(VSM) and Nave-Bayes(NB), completed a multilayer and multi-classification text categorization system. Introduce detailedly four modules: words’ segmentation and frequency statistics, calculating between classifications’ and document, emendating the veracity of parent-class by emendation of subclass, judging whether document has multi-classification and multi-label. Text representation based on Vector Space Model has 89.7% MicroF1 of parent- category, 77.8% of sub- category; text representation based on Nave-Bayes has 67.6% MicroF1 of parent- category, 66.5% of sub- category.

Select

Chinese Time Words and Numerals Automatic Segmentation Method Based on Rules

Gao Xiaoyun,Yang Jianlin

New Technology of Library and Information Service. 2007, 2(3): 46-50. https://doi.org/10.11925/infotech.1003-3513.2007.03.10

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

This paper firstly generalizes the formats of Chinese time words and numerals appearing in the text. Based on them, this paper then sets up a rule sets for recognition, proposes a method about Chinese time words and numerals based on rules and discusses its application value in competitive intelligence analysis as well as machine translation field at last.

Select

Computation of the Concept Semantic Similarity in FCA

Zhang Xiaoluan,Wang Xifeng

New Technology of Library and Information Service. 2007, 2(3): 51-54. https://doi.org/10.11925/infotech.1003-3513.2007.03.11

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

Both Formal Concept Analysis (FCA) and domain Ontologies are two kinds of knowledge representations formalisms and their aims are at modeling concepts. This paper proposes a method to compute the similarity between concepts in FCA. The experimental result shows this method is effective for concept similarity computation.

Select

Constructing Semantic Distribution Dictionary Based on WordNet

Zhang Huiping,Lv Xueqiang,Shi Shuicai,Li Yuqin

New Technology of Library and Information Service. 2007, 2(3): 55-59. https://doi.org/10.11925/infotech.1003-3513.2007.03.12

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

A method to construct semantic distribution dictionary based on WordNet is presented in this paper. After introducing WordNet system and semcor corpus, the structure of semantic distribution dictionary is designed. The contents of sense.idx file and taglist file are analyzed, and the procedure for constructing semantic distribution dictionary based on them is described in detail.

Select

Study of Evaluation on Information Architecture Usability for Public Library Website

Zhao Yuxiang

New Technology of Library and Information Service. 2007, 2(3): 60-64. https://doi.org/10.11925/infotech.1003-3513.2007.03.13

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

Based on the introduction of information architecture and usability,this paper analyses the relationship between the two concept, we investigate 16 public library Websites at home and tentatively develop an evaluation system which is suitable for the public library Website, then this paper uses the evaluation system to test and evaluate the usability of IA of the Shanghai public library Website.

Select

Application of Ajax and RSS in Personalized Portal Site of the Library

Zhang Bei,Zhang Chengyu,Jiang Airong

New Technology of Library and Information Service. 2007, 2(3): 65-68. https://doi.org/10.11925/infotech.1003-3513.2007.03.14

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

This paper summarizes the definition and characteristics of Ajax and RSS,and mainly explains their applications in construction of the personalized portal site of Tsinghua University Library.

Select

Implementation of Preprocess Technology in Bibliometric and Analytic Research via VBA

Hua Bolin

New Technology of Library and Information Service. 2007, 2(3): 69-72. https://doi.org/10.11925/infotech.1003-3513.2007.03.15

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

Process of statistic is designed in accordance with character of Web data after analyzing them. Each stage is experimented with some different algorithms in order to achieve optimal solution. According to experiment, efficiency and effectiveness can be improved by decreasing IO operation, increasing process granularity and using lexicon.

Select

Research and Implementation of OPAC Search Machine Based on Open Source Software

Wang Zhengjun,Jin Yuling,Ren Yonggong

New Technology of Library and Information Service. 2007, 2(3): 73-76. https://doi.org/10.11925/infotech.1003-3513.2007.03.16

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

In view of the current OPAC (Online Public Access Cataloglle) searching engines lack the proprietary operating system environment, put forward an OPAC machine solution using of open source software. Being different from the traditional machines based on Windows solutions, the OPAC machine uses free and open source software to construct an open standard, which can reduce investment, and is more efficient, stable, safe, and easy to maintain.

Select

A New Method of Music Melody Extraction and Its Application

Han Shenglong

New Technology of Library and Information Service. 2007, 2(3): 77-79. https://doi.org/10.11925/infotech.1003-3513.2007.03.17

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

This paper presentes a new method of music melody extraction, which discovers note’s boundary and segments note by sequentially scans the pitch list and detects the pitch movement. Around 1 000 pieces of electronic-keyboard-played Chinese folk music have been processed, the success rate is over 90%.

Select

Merging Two Degree’s Dissertations Databases of Different Structure with Word

Long Haojian

New Technology of Library and Information Service. 2007, 2(3): 80-82. https://doi.org/10.11925/infotech.1003-3513.2007.03.18

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

Aimming at the questions such as different data structure, inconsistent data format and nonstandard of data in the course of dissertations databases construction, and associating with experience of merging two doctoral & master degree’s dissertations databases of TRS with different data structure in our library, the paper introduces how to resolve such questions with VBA in Word and presents actual program code.

Select

The Fingerprint-Technology and Its Uses in the Reader Credential System

Liu Fanxin

New Technology of Library and Information Service. 2007, 2(3): 83-86. https://doi.org/10.11925/infotech.1003-3513.2007.03.19

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

This paper introduces the principle and characteristic of fingerprint-technology, designs fingerprint-technology of reader credential system, discusses the cost question, privacy question, fingerprint-gathering and comparing question　in the library for fingerprint-technology.

Select

Application and Discussion of ALEPH 500 Acquisition Module

Mao Shirong

New Technology of Library and Information Service. 2007, 2(3): 87-89. https://doi.org/10.11925/infotech.1003-3513.2007.03.20

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

Through describing and analyzing the application of the ALEPH 500, this paper gives an introduction to its main functions and characteristics as well as its localization development in Sichuan University Library. In addition, the paper provides profound research and discussion to its present existent problems.

Select

Managing University Students Archive with Content Management Technology

Dong Pingjun,Wang Dongming,Wang Ning

New Technology of Library and Information Service. 2007, 2(3): 90-93. https://doi.org/10.11925/infotech.1003-3513.2007.03.21

Abstract ( ) Download PDF ( ) HTML ( )

Knowledge map

Save

This paper mainly uses content management technology,proposes a design solution of university students archive management system, which aims to bring convinence for remote usage.Moreover this paper introduces how to implement a prototype system with the IBM content manager v8.3 which is a midware product of IBM corp.

Please choose a citation manager

Content to export

25 March 2007, Volume 23 Issue 3

模态框（Modal）标题

Please choose a citation manager

Content to export

25 March 2007, Volume 23 Issue 3