|
|
Identifying Scenic Spot Entities Based on Improved Knowledge Transfer |
Zhao Ping1,Sun Lianying2(),Tu Shuai1,Bian Jianling3,Wan Ying1 |
1Smart City College, Beijing UnionUniversity, Beijing 100101, China 2College of Urban Rail Transit and Logistics, Beijing Union University, Beijing 100101, China 3Beijing China-Power Information Technology Co., LTD, Beijing 100192, China |
|
|
Abstract [Objective] This paper addresses the issues facing labeled data in the recognition of scenic spots.[Methods] We proposed an improved knowledge transfer algorithm for entity recognition and used datasets from the People’s Daily to evaluate our new model.[Results] Our method’s accuracy was 1.62% higher than the model using all labeled data.[Limitations] More research is needed to examine the expansion of samples.[Conclusions] The proposed method uses less labeled data in entity recognition and provides better technical support for tourism recommendation.
|
Received: 05 August 2019
Published: 15 June 2020
|
|
Corresponding Authors:
Sun Lianying
E-mail: sunlychina@163.com
|
[1] |
Grishman R, Sundheim B . Message Understanding Conference-6:A Brief History [C]// Proceedings of the 16th International Conference on Computational Linguistics, Copenhagen, Denmark. Stroudsburg, PA: ACL, 1996: 466-471.
|
[2] |
Hanisch D, Fundel K, Mevissen H T, et al. ProMiner: Rule-based Protein and Gene Entity Recognition[J]. BMC Bioinformatics, 2005,6(1):S14.
|
[3] |
Lample G, Ballesteros M, Subramanian S , et al. Neural Architectures for Named Entity Recognition [C]// Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, San Diego, California, USA. Stroudsburg, PA: ACL, 2016: 260-270.
|
[4] |
Dong C, Zhang J, Zong C , et al. Character-based LSTM-CRF with Radical-level Features for Chinese Named Entity Recognition [C]// Proceedings of the Natural Language Understanding and Intelligent Applications,Kunming, China. Berlin, German:Springer, 2016: 239-250.
|
[5] |
Patil N V, Patil A S, Pawar B V . HMM Based Named Entity Recognition for Inflectional Language [C]// Proceedings of the 2017 International Conference on Computer, Communications and Electronics,Jaipur, India. Piscataway, NJ: IEEE, 2017: 565-572.
|
[6] |
薛征山, 郭剑毅, 余正涛, 等. 基于HMM的中文旅游景点的识别[J]. 昆明理工大学学报:理工版, 2009,34(6):44-48.
|
[6] |
( Xue Zhengshan, Guo Jianyi, Yu Zhengtao, et al. Recognition of HMM-Based Chinese Tourist Attractions[J]. Journal of Kunming University of Science and Technology:Science and Technology, 2009,34(6):44-48.)
|
[7] |
郭剑毅, 薛征山, 余正涛, 等. 基于层叠条件随机场的旅游领域命名实体识别[J]. 中文信息学报, 2009,23(5):47-52.
|
[7] |
( Guo Jianyi, Xue Zhengshan, Yu Zhengtao, et al. Named Entity Recognition for the Tourism Domain Based on Cascaded Conditional Random Fields[J]. Journal of Chinese Information Processing, 2009,23(5):47-52.)
|
[8] |
Chiu J P C, Nichols E. Named Entity Recognition with Bidirectional LSTM-CNNs[J]. Transactions of the Association for Computational Linguistics, 2016,4:357-370.
|
[9] |
黄菡, 王宏宇, 王晓光. 结合主动学习的条件随机场模型用于法律术语的自动识别[J]. 数据分析与知识发现, 2019,3(6):66-74.
|
[9] |
( Huang Han, Wang Hongyu, Wang Xiaoguang. Automatic Recognizing Legal Terminologies with Active Learning and Conditional Random Field Model[J]. Data Analysis and Knowledge Discovery, 2019,3(6):66-74.)
|
[10] |
Greenberg N, Bansal T, Verga P , et al. Marginal Likelihood Training of BiLSTM-CRF for Biomedical Named Entity Recognition from Disjoint Label Sets [C]// Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium. Stroudsburg, PA: ACL, 2018: 2824-2829.
|
[11] |
刘小安, 彭涛.基于卷积神经网络的中文景点识别研究[J/OL].计算机工程与应用.[ 2019- 08- 01]. http://kns.cnki.net/kcms/detail/11.2127.TP.20190307.1807.007.html.
|
[11] |
( Liu Xiaoan, Peng Tao. Research on Chinese Scenic Spot Named Entity Recognition Based on Convolutional Neural Network[J/OL]. Computer Engineering and Applications.[ 2019- 08- 01]. http://kns.cnki.net/kcms/detail/11.2127.TP.20190307.1807.007.html.)
|
[12] |
Devlin J, Chang M W, Lee K , et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding [C]// Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Minneapolis, USA. Stroudsburg, PA: ACL, 2019: 4171-4186.
|
[13] |
Hochreiter S, Schmidhuber J. Long Short-Term Memory[J]. Neural Computation, 1997,9(8):1735-1780.
|
[14] |
Sutton C, McCallum A. An Introduction to Conditional Random Fields[J]. Foundations and Trends® in Machine Learning, 2012,4(4):267-373.
|
[15] |
Peng D L, Wang Y R, Liu C, et al. TL-NER: A Transfer Learning Model for Chinese Named Entity Recognition[J]. Information Systems Frontiers, 2019. https://doi.org/10.1007/s10796-019-09932-y.
|
[16] |
Gomaa W H, Fahmy A A. A Survey of Text Similarity Approaches[J]. International Journal of Computer Applications, 2013,68(13):13-18.
|
[17] |
Zhang W, Yoshida T, Tang X. A Comparative Study of TF*IDF, LSI and Multi-Words for Text Classification[J]. Expert Systems with Applications, 2011,38(3):2758-2765.
|
[18] |
俞士汶, 段慧明, 吴云芳.现代汉语多级加工语料库[DS/OL].[ 2019- 01- 03]. http://dx.doi.org/10.18170/DVN/SEYRX5.
|
[18] |
( Yu Shiwen, Duan Huiming, Wu Yunfang. Corpus of Multi-Level Processing for Modern Chinese[DS/OL]. [ 2019- 01- 03]. http://dx.doi.org/10.18170/DVN/SEYRX5.)
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|