%A Zheng Xinman, Dong Yu %T Constructing Degree Lexicon for STI Policy Texts %0 Journal Article %D 2021 %J Data Analysis and Knowledge Discovery %R 10.11925/infotech.2096-3467.2021.0148 %P 81-93 %V 5 %N 10 %U {https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/abstract/article_5090.shtml} %8 2021-10-25 %X

[Objective] This paper constructs a sentiment lexicon for STI policy texts, aiming to identify and quantify the embedded attitudes of policy makers. It tries to address the issues of existing studies, which ignore the semantic intensity of words. [Methods] First, we summarized the characteristics of policy texts and proposed a method to construct degree lexicon. This lexicon chose seed words from expert knowledge, expanded domain degree words with the PMI algorithm, and screened these words with Tongyi Cilin. Finally, we combined the TextRank algorithm with the new lexicon and conducted an experimental validation. [Results] The constructed degree lexicon yielded better results in policy text analysis than the traditional single text mining algorithm. [Limitations] The weights of our lexicon needs to be refined. [Conclusions] The degree words in STI policy texts are abundant, standardized and stable. The new lexicon can effectively utilize degree words, and learn more semantic features of policy texts.