Please wait a minute...
Data Analysis and Knowledge Discovery
Current Issue | Archive | Adv Search |
Research on Policy Text Relevance Mining Method Integrating Syntactic Structure and Word Meaning Information
Wu Kaibiao,Lang Yuxiang,Dong Yu
(National Science Library, Chinese Academy of Sciences, Beijing 100190,China) (Department of Library, Information and Archives Management, School of Economics and Management, University of Chinese Academy of Sciences, Beijing 100190, China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective]In order to further improve the depth of the semantic relevance mining of policy text, this paper explores the method of policy text relevance mining.


[Methods] This research integrates dependency parsing analysis and word embedding model to mine the deep semantic relevance of policy text content from the perspective of sentence information and word meaning information, and it fully considers the language characteristics of the policy text when setting the dependency syntax extraction rules.


[Results]In terms of algorithm effect, in the test data set with a relatively low degree of policy text association, the algorithm F1 value reached 0.857, which is an increase of 22.78% compared to the traditional conventional algorithm; in terms of algorithm function, the policy text relevance can be described from the subtle differences in words.


[Limitations]In semantic information mining, the algorithm currently uses an open source model, which can subsequently independently train word vector models in specific policy domains to further improve accuracy; In sentence information mining, the algorithm relies on the accuracy of existing dependency syntactic analysis tools.


[Conclusions]The algorithm proposed in this paper has good effects and strong functions. It can effectively reveal the degree of policy text association. So it can bring new research perspectives and tools for quantitative research on policy text.



Key words Policy Text Relevance      Dependency Parsing      Word Embedding      
Published: 25 November 2021
ZTFLH:  D630  
  TP391.1  

Cite this article:

Wu Kaibiao, Lang Yuxiang, Dong Yu. Research on Policy Text Relevance Mining Method Integrating Syntactic Structure and Word Meaning Information . Data Analysis and Knowledge Discovery, 0, (): 1-.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467. 2021.0606     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y0/V/I/1

[1] Wu Kaibiao, Lang Yuxiang, Dong Yu. Mining Policy Text Relevance with Syntactic Structure and Semantic Information[J]. 数据分析与知识发现, 2022, 6(5): 20-33.
[2] Bocheng Li,Yunqiu Zhang,Kaixi Yang. Extracting Emotion Tags from Comments of Microblog Commodities[J]. 数据分析与知识发现, 2019, 3(9): 115-123.
[3] Li Lin,Li Hui. Computing Text Similarity Based on Concept Vector Space[J]. 数据分析与知识发现, 2018, 2(5): 48-58.
[4] Zhang Fan, Le Xiaoqiu. Research on Recognition of Concept Attribute Instances in Innovation Sentences of Scientific Research Paper[J]. 现代图书情报技术, 2015, 31(5): 15-23.
[5] Nie Hui, Du Jiazhong. Using Dependency Parsing Pattern to Extract Product Feature Tags[J]. 现代图书情报技术, 2014, 30(12): 44-50.
[6] Tang Xiaobo, Xiao Lu. Research of Text Feature Extraction on Dependency Parsing Network[J]. 现代图书情报技术, 2014, 30(11): 31-37.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn