|
|
Constructing Knowledge Base for Chinese Geographical Name |
Li Xiaomin,Wang Hao( ),Li Yueyan,Zhao Meng |
School of Information Management, Nanjing University, Nanjing 210023, China Jiangsu Key Laboratory of Data Engineering and Knowledge Service (Nanjing University),Nanjing 210093, China |
|
|
Abstract [Objective] This paper uses linked data technology to study the evolution of geographical names in China, aiming to more effectively conduct digital humanity research. [Methods] First, we constructed the knowledge base CGNE_Onto for the evolution of Chinese geographical names. Then, we formulated the strong and weak marker words to identify evolution type sentences from the historical data. Third, we utilized the BERT-BiLSTM-CRF model to identify the time and place name entities from the evolution type sentences. Fourth, we used the newly generated entities as classes to build the ontology knowledge base, which was visualized from the perspective of direct and indirect path relationship. Finally, we analyzed the numbers and reasons of different evolution types in each dynasty. [Results] The proposed model intuitively demonstrated the evolution of geographical names, and provided some new directions for the analysis of geographical names data. [Limitations] The experimental data set needs to be expanded to improve the quality of evolution feature words. [Conclusions] The knowledge base for place names clearly shows their historical evolutions, as well as the evolution types in different dynasties.
|
Received: 06 March 2022
Published: 13 January 2023
|
|
Fund:National Natural Science Foundation of China(72074108);Fundamental Research Funds for the Central Universities(010814370113) |
Corresponding Authors:
Wang Hao
E-mail: ywhaowang@nju.edu.cn
|
[1] |
中华人民共和国民政部. 民政部关于进一步加强地名文化遗产保护工作的通知[EB/OL].[2021-12-12]. https://www.cpll.cn/law9322.shtml.
|
[1] |
(Ministry of Civil Affairs of the People’s Republic of China. Notice of the Ministry of Civil Affairs on Further Strengthening the Protection of Geographical Names and Cultural Heritage[EB/OL].[2021-12-12]. https://www.cpll.cn/law9322.shtml.)
|
[2] |
李娜, 包平. 面向数字人文的馆藏方志古籍地名自动识别模型构建[J]. 图书馆, 2018(5):67-73.
|
[2] |
(Li Na, Bao Ping. Establishment of Automatic Recognition Model of Location Names in Collection of Ancient Local Chronicles Oriented to Digital Humanities[J]. Library, 2018(5):67-73.)
|
[3] |
王东波, 高瑞卿, 沈思, 等. 面向先秦典籍的历史事件基本实体构件自动识别研究[J]. 国家图书馆学刊, 2018, 27(1):65-77.
|
[3] |
(Wang Dongbo, Gao Ruiqing, Shen Si, et al. Research on Automatic Recognition of Basic Entity Component of Historic Events for Pre-Qin Classics[J]. Journal of the National Library of China, 2018, 27(1):65-77.)
|
[4] |
李玉超. 新闻事件地名实体识别和地图链接技术研究[D]. 成都: 电子科技大学, 2020.
|
[4] |
(Li Yuchao. Research on the Identification of Geographical Names of News Events and the Technology of MAP Linking[D]. Chengdu: University of Electronic Science and Technology of China, 2020.)
|
[5] |
魏勇, 李鸿飞, 胡丹露, 等. 一种基于复合特征的中文地名识别方法[J]. 武汉大学学报·信息科学版, 2018, 43(1): 17-23.
|
[5] |
(Wei Yong, Li Hongfei, Hu Danlu, et al. A Method of Chinese Place Name Recognition Based on Composite Features[J]. Geomatics and Information Science of Wuhan University, 2018, 43(1): 17-23.)
|
[6] |
沈思, 朱丹浩. 基于深度学习的中文地名识别研究[J]. 北京理工大学学报, 2017, 37(11): 1150-1155.
|
[6] |
(Shen Si, Zhu Danhao. Chinese Place Name Recognition Based on Deep Learning[J]. Transactions of Beijing Institute of Technology, 2017, 37(11): 1150-1155.)
|
[7] |
林泽斐, 孟雪梅. 基于关联数据的地方文献地名规范控制[J]. 图书馆杂志, 2017, 36(10): 55-62.
|
[7] |
(Lin Zefei, Meng Xuemei. The Toponym Authority Control of Local Literature Base on Linked Data[J]. Library Journal, 2017, 36(10): 55-62.)
|
[8] |
王卉. 近代广东海关档案中的拼音名词规范控制研究——以粤海关为中心[J]. 档案学研究, 2020(4):87-96.
|
[8] |
(Wang Hui. Research on the Authority Control of Proper Nouns of China’s Maritime Customs Archives in Canton (Yuehaiguan) ——Based on Personal Names, Place Names, and Corporate Names[J]. Archives Science Study, 2020(4): 87-96.)
|
[9] |
夏翠娟. 中国历史地理数据在图书馆数字人文项目中的开放应用研究[J]. 中国图书馆学报, 2017, 43(2):40-53.
|
[9] |
(Xia Cuijuan. The Opening and Application of Chinese Historical Geography Data in Digital Humanities Projects of Libraries[J]. Journal of Library Science in China, 2017, 43(2): 40-53.)
|
[10] |
程宁. 古籍专名数据库的构建与统计分析[J]. 文教资料, 2019(35): 52-56.
|
[10] |
(Cheng Ning. Construction and Statistical Analysis of Database of Proper Names of Ancient Books[J]. Data of Culture and Education, 2019(35): 52-56.)
|
[11] |
达日玛. 清代蒙古盟旗地名数据库的构建[D]. 呼和浩特: 内蒙古大学, 2019.
|
[11] |
(Da Rima. Construction of the Geographic Name Database of Mongolian League Banner in Qing Dynasty[D]. Hohhot: Inner Mongolia University, 2019.)
|
[12] |
Santosh T Y S S, Sanyal D K, Bhowmick P K, et al. Gazetteer-Guided Keyphrase Generation from Research Papers[A]//Advances in Knowledge Discovery and Data Mining[M]. Springer, 2021: 655-667.
|
[13] |
Goldberg D W, Wilson J P, Knoblock C A. Extracting Geographic Features from the Internet to Automatically Build Detailed Regional Gazetteers[J]. International Journal of Geographical Information Science, 2009, 23(1): 93-128.
doi: 10.1080/13658810802577262
|
[14] |
于靖. 城市历史地名时空数据模型研究——以六朝建康为例[D]. 南京: 南京大学, 2015.
|
[14] |
(Yu Jing. Research on Spatial-Temporal Data Modeling of Urban Historical Place Name——Taking Jian Kang in Six Dynasties as an Example[D]. Nanjing: Nanjing University, 2015.)
|
[15] |
陈健, 李宏伟, 张斌, 等. 基于地名本体的地名演变分析[J]. 测绘科学技术学报, 2011, 28(6): 446-449.
|
[15] |
(Chen Jian, Li Hongwei, Zhang Bin, et al. Toponym Evolvement Analysis Based on the Toponym Ontology[J]. Journal of Geomatics Science and Technology, 2011, 28(6): 446-449.)
|
[16] |
陈玉冰. 行政区划地名知识图谱的构建方法研究[D]. 合肥: 合肥工业大学, 2020.
|
[16] |
(Chen Yubing. Research on the Construction Method of Knowledge Graph for Administrative Geographical Names[D]. Hefei: Hefei University of Technology, 2020.)
|
[17] |
Yang L P, Lin G F, Chen A L, et al. A Spatio-Temporal Data Model for Administrative Division Place Names: A Case Study of Xiamen[C]/ Proceedings of the 6th International Symposium on Digital Earth: Models, Algorithms, and Virtual Reality. 2010: 73-82.
|
[18] |
杜萍, 姚瑶, 许鹏. 地名时空信息的本体表达[J]. 兰州交通大学学报, 2016, 35(6): 137-140.
|
[18] |
(Du Ping, Xu Peng. Expression of Spatio-Temporal Information of Place Names in Ontology[J]. Journal of Lanzhou Jiaotong University, 2016, 35(6): 137-140.)
|
[19] |
胡颖. 家谱GIS中古今地名的时空关系研究[D]. 南京: 南京师范大学, 2008.
|
[19] |
(Hu Ying. Spatio-Temporal Relationships among Chinese Ancient and Modern Placenames Oriented to Genealogy GIS[D]. Nanjing: Nanjing Normal University, 2008.)
|
[20] |
Devlin J, Chang M W, Lee K, et al. BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding[OL]. arXiv Preprint, arXiv: 1810.04805.
|
[21] |
唐晓波, 肖璐. 基于词汇同现的多用户兴趣本体构建研究[J]. 情报理论与实践, 2012, 35(5): 99-102.
|
[21] |
(Tang Xiaobo, Xiao Lu. Research on the Construction of the Multi-user Interest Ontology Based on Word Co-occurrence[J]. Information Studies: Theory & Application, 2012, 35(5): 99-102.)
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|