National Science Library, Chinese Academy of Sciences, Beijing 100190, China Department of Library, Information and Archives Management, School of Economics and Management, University of Chinese Academy of Sciences, Beijing 100190, China
[Objective] This paper constructs a sentiment lexicon for STI policy texts, aiming to identify and quantify the embedded attitudes of policy makers. It tries to address the issues of existing studies, which ignore the semantic intensity of words. [Methods] First, we summarized the characteristics of policy texts and proposed a method to construct degree lexicon. This lexicon chose seed words from expert knowledge, expanded domain degree words with the PMI algorithm, and screened these words with Tongyi Cilin. Finally, we combined the TextRank algorithm with the new lexicon and conducted an experimental validation. [Results] The constructed degree lexicon yielded better results in policy text analysis than the traditional single text mining algorithm. [Limitations] The weights of our lexicon needs to be refined. [Conclusions] The degree words in STI policy texts are abundant, standardized and stable. The new lexicon can effectively utilize degree words, and learn more semantic features of policy texts.
(Zhao Yanyan, Qin Bing, Shi Qiuhui, et al. Large-Scale Sentiment Lexicon Collection and Its Application in Sentiment Classification[J]. Journal of Chinese Information Processing, 2017, 31(2): 187-193.)
符淮青. 现代汉语词汇[M]. 第2版. 北京: 北京大学出版社, 2004.
(Fu Huaiqing. Modern Chinese Word[M]. The 2nd Edition. Beijing: Peking University Press, 2004.)
朴镇秀. 现代汉语形容词的量研究[D]. 上海:复旦大学, 2009.
(Piao Zhenxiu . Study of Quantity in Modern Chinese Adjectives[D]. Shanghai: Fudan University, 2009.)
吕文杰. 现代汉语程度范畴表达方式研究[D]. 长春:吉林大学, 2013.
(Lü Wenjie. A Study on Expressions of Degree Category in Modern Chinese[D]. Changchun: Jilin University, 2013.)
张国宪. 形容词的记量[J]. 世界汉语教学, 1996, 10(4): 35-44.
(Zhang Guoxian. Quantitative Measurement of Chinese Adjectives[J]. Chinese Teaching in the World, 1996, 10(4): 35-44.)
朱德熙. 现代汉语语法研究[M]. 北京: 商务印书馆出版社, 1985.
(Zhu Dexi. Study on Modern Chinese Grammar[M]. Beijing: The Commercial Press, 1985.)
(Zhang Baojian, Li Pengli, Chen Jin, et al. Thematic Analysis and Evolution Process of National Science and Technology Innovation Policy: Based on the Perspective of Text Mining[J]. Science of Science and Management of S. & T., 2019, 40(11): 15-31.)
尹均生. 中国写作学大辞典[M]. 北京: 中国检察出版社, 1998.
(Yin Junsheng. Dictionary of Chinese Writing[M]. Beijing: China Procuratorial Press, 1998.)
杨正联. 公共政策文本解读的方法论[J]. 理论探讨, 2007(4): 143-147.
(Yang Zhenglian. Methodologies for the Interpretation of Public Policy Texts[J]. Theoretical Investigation, 2007(4): 143-147.)
Carvalho A, Pinto-Coelho Z, Seixas E. Listening to the Public -Enacting Power: Citizen Access, Standing and Influence in Public Participation Discourses[J]. Journal of Environmental Policy & Planning, 2019, 21(5): 563-576.
Turney P D, Littman M L. Measuring Praise and Criticism: Inference of Semantic Orientation from Association[J]. ACM Transactions on Information Systems, 2003, 21(4): 315-346.
(Beijing Municipal Science & Technnology Commission. Interpretation of Beijing’s 13th Five-Year Plan for Strengthening the Construction of a National Science and Technology Innovation Centre [EB/OL].[2020-11-20]. http://kw.beijing.gov.cn/art/2016/10/9/art_2410_57010.html.)