|
|
Extracting Hypernym-Hyponym Relationship for Financial Market Applications |
Dai Zhihong1(),Hao Xiaoling1,2 |
1School of Information Management & Engineering, Shanghai University of Finance and Economics, Shanghai 200433, China 2Shanghai Key Laboratory of Financial Information Technology, Shanghai University of Finance and Economics, Shanghai 200433, China |
|
|
Abstract [Objective] This paper proposes a new method to extract superior-inferior relationship from knowledge graph, and then explores its effectiveness with practical application. [Methods] First, we constructed the mapping matrix for hypernym-hyponym words and their context semantics. Then, we combined word vector similarity with the matrix to extract the relation. [Results] We examined our method with datasets of listed companies and found its F1 value was more than 3% higher than those of the existing methods. The new model could help us study the association between company similarity and stock performance. [Limitations] More research is needed to improve relationship extraction with the help of clustering technique and pattern matching method. [Conclusions] The proposed method can effectively identify the relationship between entities, and study the related listed companies and stocks. It also helps us construct better knowledge graph in the financial field.
|
Received: 16 December 2020
Published: 27 August 2021
|
|
Fund:National Social Science Foundation of China(20BGL287);National Natural Science Fund of China(71401096) |
Corresponding Authors:
Dai Zhihong, ORCID:0000-0002-3890-115X
E-mail: daizhihong@189.cn
|
[1] |
陈金栋, 肖仰华. 一种基于语义的上下位关系抽取方法[J]. 计算机应用与软件, 2019, 36(2): 216-221.
|
[1] |
(Chen Jindong, Xiao Yanghua. Hypernymy Relation Extraction Based on Semantics[J]. Computer Applications and Software, 2019, 36(2): 216-221.)
|
[2] |
钟茂生, 刘慧, 刘磊. 词汇间语义相关关系量化计算方法[J]. 中文信息学报, 2009, 32(2): 115-122.
|
[2] |
(Zhong Maosheng, Liu Hui, Liu Lei. Method of Semantic Relevance Relation Measurement Between Words[J]. Journal of Chinese Information Processing, 2009, 32(2): 115-122.)
|
[3] |
邱科达, 马建玲. 基于文本语料的上下位关系识别研究综述[J]. 情报科学, 2020, 38(7): 162-172.
|
[3] |
(Qiu Keda, Ma Jianling. A Review of Hypernym Relation Recognition from Text Corpora[J]. Information Science, 2020, 38(7): 162-172.)
|
[4] |
Hearst M A. Automatic Acquisition of Hyponyms from Large Text Corpora [C]//Proceedings of the 14th Conference on Computational Linguistics - Volume 2. 1992: 539-545.
|
[5] |
汤青, 吕学强, 李卓. 本体概念间上下位关系抽取研究[J]. 微电子学与计算机, 2014, 31(6): 68-71.
|
[5] |
(Tang Qing, Lv Xueqiang, Li Zhuo. Research on Domain Ontology Concept Hyponymy Relation Extraction[J]. Microelectronics & Computer, 2014, 31(6): 68-71.)
|
[6] |
李军锋, 吕学强, 李卓. 专利领域本体概念语义层次获取[J]. 情报学报, 2014, 33(9): 986-993.
|
[6] |
(Li Junfeng, Lv Xueqiang, Li Zhuo. Deriving Concept Semantic Hierarchy of Ontology in Patents[J]. Journal of the China Society for Scientific and Technical Information, 2014, 33(9): 986-993.)
|
[7] |
张晨童, 张佳影, 张知行, 等. 融合常用语的大规模疾病术语图谱构建[J]. 计算机研究与发展, 2020, 57(11): 2467-2477.
|
[7] |
(Zhang Chentong, Zhang Jiaying, Zhang Zhixing, et al. Construction of Large-Scale Disease Terminology Graph with Common Terms[J]. Journal of Computer Research and Development, 2020, 57(11): 2467-2477.)
|
[8] |
Snow R. Learning Syntactic Patterns for Automatic Hypernym Discovery [C]//Proceedings of the 17th International Conference on Neural Information Processing Systems. 2005: 1297-1304.
|
[9] |
Suchanek F M, Kasneci G, Weikum G. YAGO: A Large Ontology from Wikipedia and WordNet[J]. Journal of Web Semantics, 2008, 6(3): 203-217.
doi: 10.1016/j.websem.2008.06.001
|
[10] |
陆凯华, 李正华, 张民. 汉语上下位关系分类数据集构建和基准方法比较[J]. 厦门大学学报(自然科学版), 2020, 59(6): 1004-1010.
|
[10] |
(Lu Kaihua, Li Zhenghua, Zhang Min. Data Construction and Benchmark Method Comparison for Chinese Hypernym-Hyponym Relation Classification[J]. Journal of Xiamen University (Natural Science Edition), 2020, 59(6): 1004-1010.)
|
[11] |
丁晟春, 侯琳琳, 王颖. 基于电商数据的产品知识图谱构建研究[J]. 数据分析与知识发现, 2019, 3(3): 45-56.
|
[11] |
(Ding Shengchun, Hou Linlin, Wang Ying. Product Knowledge Map Construction Based on the E-Commerce Data[J]. Data Analysis and Knowledge Discovery, 2019, 3(3): 45-56.)
|
[12] |
黄毅, 王庆林, 刘禹. 一种基于条件随机场的领域术语上下位关系获取方法[J]. 中南大学学报(自然科学版), 2013, 44(S2): 355-359.
|
[12] |
(Huang Yi, Wang Qinglin, Liu Yu. An Acquisition Method of Domain-Specific Terminological Hyponymy Based on CRF[J]. Journal of Central South University (Science and Technology), 2013, 44(S2): 355-359.)
|
[13] |
马晓军, 郭剑毅, 线岩团, 等. 结合词向量和Bootstrapping的领域实体上下位关系获取与组织[J]. 计算机科学, 2018, 45(1): 67-72.
|
[13] |
(Ma Xiaojun, Guo Jianyi, Xian Yantuan, et al. Entity Hyponymy Acquisition and Organization Combining Word Embedding and Bootstrapping in Special Domain[J]. Computer Science, 2018, 45(1): 67-72.)
|
[14] |
孙佳伟, 李正华, 陈文亮, 等. 基于词模式嵌入的词语上下位关系分类[J]. 北京大学学报(自然科学版), 2019, 55(1): 1-7.
|
[14] |
(Sun Jiawei, Li Zhenghua, Chen Wenliang, et al. Hypernym Relation Classification Based on Word Pattern[J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2019, 55(1): 1-7.)
|
[15] |
吴婷, 李明扬, 孔芳. 基于同义推理的篇章级实体上下位关系语料库构建[J]. 中文信息学报, 2020, 34(4): 38-46.
|
[15] |
(Wu Ting, Li Mingyang, Kong Fang. Construction of Textual Entity Hypernymy Corpus Based on Synonymy Reasoning[J]. Journal of Chinese Information Processing, 2020, 34(4): 38-46.)
|
[16] |
汪诚愚, 何晓丰, 宫学庆, 等. 面向上下位关系预测的词嵌入投影模型[J]. 计算机学报, 2020, 43(5): 868-883.
|
[16] |
(Wang Chengyu, He Xiaofeng, Gong Xueqing, et al. Word Embedding Projection Models for Hypernymy Relation Prediction[J]. Chinese Journal of Computers, 2020, 43(5): 868-883.)
|
[17] |
Kotlerman L, Dagan I, Szpektor I, et al. Directional Distributional Similarity for Lexical Expansion [C]//Proceedings of ACL-IJCNLP 2009 Conference (Short Papers). 2009: 69-72.
|
[18] |
王思丽, 祝忠明, 杨恒, 等. 基于模式和投影学习的领域概念上下位关系自动识别研究[J]. 数据分析与知识发现, 2020, 4(11): 15-25.
|
[18] |
(Wang Sili, Zhu Zhongming, Yang Heng, et al. Automatically Identifying Hypernym-Hyponym Relations of Domain Concepts with Patterns and Projection Learning[J]. Data Analysis and Knowledge Discovery, 2020, 4(11): 15-25.)
|
[19] |
吴志祥, 王昊, 王雪颖, 等. 基于奇异值分解的专利术语层次关系解析研究[J]. 情报学报, 2017, 36(5): 473-483.
|
[19] |
(Wu Zhixiang, Wang Hao, Wang Xueying, et al. Study on Chinese Patent Terms Hierarchy Parse Based on Singular Value Decomposition[J]. Journal of the China Society for Scientific and Technical Information, 2017, 36(5): 473-483.)
|
[20] |
Kozareva Z, Hovy E. A Semi-Supervised Method to Learn and Construct Taxonomies Using the Web [C]//Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing. 2010: 1110-1118.
|
[21] |
Fu R J, Qin B, Liu T. Exploiting Multiple Sources for Open-Domain Hypernym Discovery [C]//Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. 2013: 1224-1234.
|
[22] |
刘琦, 肖仰华, 汪卫. 一种面向海量中文文本的典型类属关系识别方法[J]. 计算机工程, 2015, 41(2): 26-30.
|
[22] |
(Liu Qi, Xiao Yanghua, Wang Wei. A Recognition Approach of Typical Generic Relationship for Massive Chinese Text[J]. Computer Engineering [J]. Computer Engineering, 2015, 41(2): 26-30.)
|
[23] |
甘丽新, 万常选, 刘德喜, 等. 基于句法语义特征的中文实体关系抽取[J]. 计算机研究与发展, 2016, 53(2): 284-302.
|
[23] |
(Gan Lixin, Wan Changxuan, Liu Dexi, et al. Chinese Named Entity Relation Extraction Based on Syntactic and Semantic Features[J]. Computer Research and Development, 2016, 53(2): 284-302.)
|
[24] |
Bansal M, Burkett D, de Melo G, et al. Structured Learning for Taxonomy Induction with Belief Propagation [C]// Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2014: 1041-1051.
|
[25] |
段利国, 徐庆, 李爱萍, 等. 实体词语义信息对中文实体关系抽取的作用研究[J]. 计算机应用研究, 2017, 34(1): 141-146.
|
[25] |
(Duan Liguo, Xu Qing, Li Aiping, et al. Research on Effect of Entities Semantic Information on Chinese Entity Relation Extraction[J]. Computer Application Research, 2017, 34(1): 141-146.)
|
[26] |
Mikolov T, Yih S W T, Zweig G. Linguistic Regularities in Continuous Space Word Representations [C]//Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2013.
|
[27] |
付瑞吉. 开放域命名实体识别及其层次化类别获取[D]. 哈尔滨: 哈尔滨工业大学, 2014.
|
[27] |
(Fu Ruiji. Open-Domain Named Entity Recognition and Hierarchical Category Acquisition[D]. Harbin: Harbin Institute of Technology, 2014.)
|
[28] |
Lai S W, Liu K, He S Z, et al. How to Generate a Good Word Embedding[J]. IEEE Intelligent Systems, 2016: 31(6): 5-14.
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|