[Objective] Focus on the task of entity recognition of traditional music terms of intangible cultural heritage. [Methods] This research constructed a corpus of national intangible cultural heritage projects based on the China Intangible Cultural Heritage Network, and built an entity recognition framework on traditional music terms based on the CRF, LSTM, LSTM-CRF, and BERT. [Results] According to the performance comparison, the BERT model for recognition of traditional music terms had achieved a better result, with an average F1 value of 91.77%. [Limitations] This study only extract unique terms, and the training set is small. [Conclusions] The entity recognition model constructed by BERT is a valid model for automatically extracting traditional musical terms of intangible cultural heritage. It can provide a reliable reference for the related research of intangible cultural heritage.
刘浏,秦天允,王东波. 非物质文化遗产传统音乐术语自动抽取*[J]. 数据分析与知识发现, 2020, 4(12): 68-75.
Liu Liu,Qin Tianyun,Wang Dongbo. Automatic Extraction of Traditional Music Terms of Intangible Cultural Heritage. Data Analysis and Knowledge Discovery, 2020, 4(12): 68-75.
( Liu Liu, Wang Dongbo . A Review on Named Entity Recognition[J]. Journal of the China Society for Scientific and Technical Information, 2018,37(3):329-340.)
( Liu Zhiyuan, Sun Maosong, Lin Yankai , et al. Knowledge Representation Learning: A Review[J]. Journal of Computer Research and Development, 2016,53(2):247-261.)
( Xu Zenglin, Sheng Yongpan, He Lirong , et al. Review on Knowledge Graph Techniques[J]. Journal of University of Electronic Science and Technology of China, 2016,45(4):589-606.)
( Lai Yingxu, Li Yajuan, Liu Jing . Construction of Ontology-based Rice Breeding Method Knowledge Base[J]. Journal of Beijing University of Technology, 2019,45(12):1181-1191.)
( Wang Dongbo, Gao Ruiqing, Shen Si , et al. Research on Automatic Recognition of Basic Entity Component of Historic Events for Pre-Qin Classics[J]. Journal of the National Library of China, 2018,27(1):65-77.)
( Yin Zhangzhi, Li Xinzi, Huang Degen , et al. Chinese Named Entity Recognition Ensembled with Character[J]. Journal of Chinese Information Processing, 2019,33(11):95-100, 106.)
( Zhang Xiaohai, Cao Xinwen, Zhang Min . Military Named Entity Recognition Based on Self-Attention Mechanism[J]. Command Control & Simulation, 2019,41(6):29-33.)
( Cheng Zhonghui, Chen Ke, Chen Gang , et al. Named Entity Recognition Method Based on Co-training of Reinforcement Learning[J]. Software Engineering, 2020,23(1):7-11.)
( Cao Yiyi, Zhou Yinghua, Shen Fahai , et al. Research on Named Entity Recognition of Chinese Electronic Medical Record Based on CNN-CRF[J]. Journal of Chongqing University of Posts and Telecommunications (Natural Science Edition), 2019,31(6):869-875.)
( Wang Yue, Wang Mengxuan, Zhang Sheng , et al. Alarm Text Named Entity Recognition Based on BERT[J]. Journal of Computer Applications, 2020,40(2):535-540.)
( Li Ni, Guan Huanmei, Yang Piao , et al. BERT-IDCNN-CRF for Named Entity Recognition in Chinese[J]. Journal of Shandong University (Natural Science), 2020,55(1):102-109.)
( Huang Yonglin, Tan Guoxin . Research on Digital Protection and Development of China’s Intangible Cultural Heritage[J]. Journal of Huazhong Normal University (Humanities and Social Sciences), 2012,51(2):49-55.)
( Huang Yonglin . The Protection and Utilization of Intangible Cultural Heritage Under the Digital Background[J]. Cultural Heritage, 2015(1):1-10, 157.)
( Hou Xilong, Tan Guoxin, Zhuang Wenjie , et al. Research on Knowledge Management of Intangible Cultural Heritage Based on Linked Data[J]. Journal of Library Science in China, 2019,45(2):88-108.)
( Song Junhua . Some Thoughts on Digital Protection of Intangible Cultural Heritage[J]. Cultural Heritage, 2015(2):1-8, 157.)
[17]
Lafferty J, Mc Calluma, Prreira F. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data [C]//Proceedings of the 18th International Conference on Machine Learning. San Francisco: Margan Kaufmann, 2001: 282-289.
Graves A, Mohamed A, Hinton G. Speech Recognition with Deep Recurrent Neural Networks [C]//Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing. 2013: 6645-6649.
[20]
Huang Z, Xu W, Yu K . Bidirectional LSTM-CRF Models for Sequence Tagging[OL]. arXiv Preprint, arXiv: 1508.01991.
[21]
Devlin J, Chang M W, Lee K . Bert: Pre-training of Deep Bidirectional Transformers for Language Understanding[OL]. arXiv Preprint, arXiv:1810.04805.