Please wait a minute...
Data Analysis and Knowledge Discovery
Current Issue | Archive | Adv Search |
Hierarchical Multi-label Classification of Children's books for Graded Reading
Cheng Quan, Dong Jia
(School of Economics and Management, Fuzhou University, Fuzhou 350116, China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] In order to realize the automatic classification of children's books,a hierarchical multi-label classification model of children's books is constructed to guide children readers to choose books that are more suitable for their own development. [Methods]The concept of graded reading is embodied into a hierarchical classification system, a hierarchical multi-label text classification model based on ERNIE-HAM is constructed by deep learning technology. [Results]By comparing the four pre-trained models, the ERNIE-HAM model has better performance in the second and third layers of the hierarchical classification of children's books; comparing the single layer algorithm, the hierarchical algorithm improves the  by about 11% in both the second and third layers; comparing the two hierarchical multi-label classification models, HFT-CNN and HMCN, the ERNIE-HAM model has the third layer improved by 12.79% and 6.48% in the classification results, respectively. [Limitations]The overall classification results of the model have not yet reached expectations, and more refinement and exploration can be done in the future in terms of the expansion of the data set and algorithm design. [Conclusions]The effectiveness of the ERNIE-HAM model proposed in this study on the hierarchical multi-label classification task of children's reading materials was verified through three sets of comparison experiments.

Key words Graded reading      Classification of children's books      Classification system      Hierarchical multi-label text classification.      
Published: 11 November 2022

Cite this article:

Cheng Quan, Dong Jia. Hierarchical Multi-label Classification of Children's books for Graded Reading . Data Analysis and Knowledge Discovery, 0, (): 1-.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2022-0649     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y0/V/I/1

[1] Hu Zhengyin, Fang Shu, Wen Yi, Zhang Xian, Liang Tian. Study on Automatic Classification of Patents Oriented to TRIZ[J]. 现代图书情报技术, 2015, 31(1): 66-74.
[2] Li Jia, Zhang Pengzhu, Li Xinmiao. A Design Framework for Speech Act Taxonomy of Online Group Discussion[J]. 现代图书情报技术, 2012, 28(2): 1-9.
[3] Liu Hua. Construction of a Super Classed and Denoted Corpus[J]. 现代图书情报技术, 2006, 22(1): 71-73.
[4] Wang Huaming. RESEARCH ON THE THEORY AND POLICY OF CHINESE STANDARDIZATION FOR INFORMATION AND DOCAMENTATION[J]. 现代图书情报技术, 1995, 11(4): 16-18.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn