Please wait a minute...
New Technology of Library and Information Service  2011, Vol. 27 Issue (1): 69-73    DOI: 10.11925/infotech.1003-3513.2011.01.11
article Current Issue | Archive | Adv Search |
Research on Content Characteristics About Complex Network of Text
Liu Honghong1,2, An Haizhong1,2, Gao Xiangyun1,2
1. Lab of Resources and Environmental Management, China University of Geosciences, Beijing 100083, China;
2. School of Humanities and Economic Management, China University of Geosciences, Beijing 100083, China
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

To solve the problem of irregular structure of some texts, this paper presents a method based on the complex network theory to evaluate the text structure. This method uses a node to represent a sentence and an edge between two nodes to represent a common word of two sentences, which construct the complex network of a text. Then the authors analyze characters of text structure by topological characteristics of text complex network. By building a text complex network based on a selected article, the degree, the degree of intensity, the shortest paths and the weighting clustering coefficients of this selected article are calculated. The results show that the structure of the text content can be effectively evaluated by this proposed method. Moreover, the results also provide important references to understand main ideas, to generate summaries and to filter text retrieval of a given text.

Key wordsComplex network of text      Content structure      Shortest path      Clustering coefficient     
Received: 28 October 2010      Published: 12 February 2011
: 

G203

 

Cite this article:

Liu Honghong, An Haizhong, Gao Xiangyun. Research on Content Characteristics About Complex Network of Text. New Technology of Library and Information Service, 2011, 27(1): 69-73.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2011.01.11     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2011/V27/I1/69


[1] 王孟国.“显—隐”的经纬:高行健长篇小说文本结构研究
[J]. 福建师范大学学报:哲学社会科学版 ,2010(3):89-96.

[2] 何维,王宇. 基于句子关系图的网页文本主题句抽取
[J]. 现代图书情报技术 ,2009(3):58-61.

[3] 梁文婷,何中市,龙华,等.改进传统文本结构关系图的文本结构分析
[J]. 微计算机信息 ,2009,25 (3):213-215.

[4] 刘军万,刘飞飞.基于潜在语义索引的文本结构分析方法的研究
[J]. 情报方法 ,2004(4):56-58.

[5] Jenkins S, Kirk S R. Software Architecture Graphs as Complex Networks: A Novel Partitioning Scheme to Measure Stability and Evolution
[J]. Information Sciences,2007,177(12):2587-2601.

[6] Amancio D R,Antiqueira L L, Pardo T A S, et al.Complex Networks Analysis of Manual and Machine Translations
[J]. International Journal of Modern Physics C,2008,19 (4):583-598.

[7] Antiqueira L, Nunes M G V, Oliveira Jr O N,et al. Strong Correlations Between Text Quality and Complex Networks Features
[J].Physica A,2007,373(4):811-820.

[8] Antiqueira L,Pardo T A S,Nunes M G V,et al.Some Issues on Complex Networks for Author Characterization. In: Proceedings of the 4th Workshop in Information and Human Language Technology.2006:59-68.

[9] Antiqueira L, Oliveira Jr O N, Luciano da Fontoura Costa,et al.A Complex Network Approach to Text Summarization
[J].Information Sciences,2009,179(5):584-599.

[10] Pardo T A S,Antiqueira L,Nunes M G V,et al.Modeling and Evaluating Summaries Using Complex Networks. In: Proceedings of the 7th Workshop on Computational Processing of Written and Spoken Portuguese (PROPOR).2006:1-10.

[11] 中国科学院计算技术研究所.汉语词法分析系统(ICTCLAS分词系统). 2007.

[12] BorgattiS P, Everett M T, Freeman L C.社会分析软件UCINET. 加州大学.2002.

[13] 邹声文.全面提高教书育人水平,推动教育事业科学发展. 人民日报,2010-09-10(1).

[14] 周磊,龚志强,支蓉,等.利用复杂网络研究中国温度序列的拓扑性质
[J]. 物理学报 ,2008,59(2):7380-7389.

[1] Sun Wei, Hao Aiyu, Lv Qiang. Application of Location Mapping Technology in Book Positioning and Navigation[J]. 现代图书情报技术, 2015, 31(2): 85-90.
[2] Xing Xiaoyun, Wei Jing. Study on the Dynamic Evolution of an OSN Structure and the Impacts on Word of Mouth[J]. 现代图书情报技术, 2011, 27(9): 60-65.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn