%0 Journal Article
%A Li Yu
%A Li Qian
%A Changlei Fu
%A Huaming Zhao
%T Extracting Fine-grained Knowledge Units from Texts with Deep Learning
%D 2019
%R 10.11925/infotech.2096-3467.2018.1352
%J Data Analysis and Knowledge Discovery
%P 38-45
%V 3
%N 1
%X <p><b>[Objective]</b> This paper tries to extract fine-grained knowledge units from texts with a deep learning model based on the modified bootstrapping method. <b>[Methods]</b> First, we built the lexicon for each type of knowledge unit with the help of search engine and keywords from Elsevier. Second, we created a large annotated corpus based on the bootstrapping method. Third, we controlled the quality of annotation with the estimation models of patterns and knowledge units. Finally, we trained the proposed LSTM-CRF model with the annotated corpus, and extracted new knowledge units from texts. <b>[Results]</b> We retrieved four types of knowledge units (study scope, research method, experimental data, as well as evaluation criteria and their values) from 17,756 ACL papers. The average precision was 91%, which was calculated manually. <b>[Limitations]</b> The parameters of models were pre-defined and modified by human. More research is needed to evaluate the performance of this method with texts from other domains. <b>[Conclusions]</b> The proposed model effectively addresses the issue of semantic drifting. It could extract knowledge units precisely, which is an effective solution for the big data acquisition process of intelligence analysis.</p>
%U https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2018.1352