Abstract:Based on the analysis of text categorization and the theory of Formal Concept Analysis(FCA),this paper elaborates the text categorization method of using FCA according to the medical field text characteristics.It uses Classification of Chinese to construct the formal context, and generates concept lattices, which are used to classify the medical field text and make classification effect close to artificial classification.This paper explores a new approach based on FCA for medical text categorization.
徐坤, 曹锦丹, 毕强. FCA在医学领域文本分类中的研究和应用[J]. 现代图书情报技术, 2012, 28(3): 23-26.
Xu Kun, Cao Jindan, Bi Qiang. A Study and Application on Medical Text Categorization Based on FCA. New Technology of Library and Information Service, 2012, 28(3): 23-26.
[1] 张铭,宋炜. 语义网简明教程[M].北京:高等教育出版社,2004:36-45.(Zhang Ming,Song Wei. A First Step Towards the Semantic Web[M].Beijing: Higher Education Press,2004:36-45.)[2] 孙霞,郑庆华,刘均. Web 知识挖掘:理论、方法与应用[M]. 北京:科学出版社, 2010:65-77.(Sun Xia,Zheng Qinghua,Liu Jun.Web Knowledge Mining: Theory, Methods and Applications[M].Beijing:Science Press,2010:65-77.)[3] Wille R.Restructuring Lattice Theory: An Approach Based on Hierarchies of Concepts[A].//Rival I.Ordered Sets[M].Dordrecht: Reidel,1982:445-470.[4] 樊旭琴.形式概念分析在突发事件新闻文本聚类中的应用[D].太原:山西大学,2010.(Fan Xuqin.The Application of Emergency News Text Clustering Based on Formal Concept Analysis [D].Taiyuan: Shanxi University, 2010.)[5] Hu X G, Chen H,Ma F. The Mining of Classification Rules Based on Multiple Extended Concept Lattice[C].In:Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Guangzhou, China.2005:18-21.[6] Wang H, Yang J, Hu X G. A New Classification Algorithm Based on Entropy and Relative Reduced Exended Concept Lattice[C]. In:Proceedings of 2004 International Conference on Machine Learning and Cybernetics.2004:26-29.[7] 周顽,周才学. 基于扩展概念格模型的文本分类规则提取的研究[J]. 计算机工程与科学,2010,32(8):98-101.(Zhou Wan,Zhou Caixue.Research on the Extracting Rules of Text Categorization Based on the Extended Concept Lattice Model[J]. Computer Engineering & Science, 2010, 32(8):98-101.)[8] Yang Y. An Evaluation of Statistical Approaches to Text Categorization [J]. Journal of Information Retrieval,1999,l(1-2):69-90.[9] Joachims T. Text Categorization with Support Vector Machine: Learning with Many Relevant Features[C].In:Proceedings of the 10th European Conference on Machine Learning.1998:137-142.[10] Luo S, Tapas K. Thresholding Strategies for Text Classifiers: TREC 2005 Biomedical Triage Task Experiments[C]. In:Proceedings of the 14th Text Retrieval Conference.2005.[11] Niu J, Sun L. WIM at TREC 2005[C]. In:Proceedings of the 14th Text Retrieval Conference.2005.[12] Yang Z, Lin H.TREC 2005 Genomics Track Experiments at DUTAI[C]. In:Proceedings of the 14th Text Retrieval Conference.2005.[13] 倪茂树, 赵晶, 林鸿飞. 生物医学文本分类方法比较研究[J]. 计算机工程与应用,2007,43(12):147-150.(Ni Maoshu,Zhao Jing,Lin Hongfei. Comparison Study on Categorization Algorithms for Biomedical Literatures[J]. Computer Engineering and Applications, 2007,43(12):147-150.)[14] 马张华. 数字环境下文献分类法的检索应用及其发展[J]. 大学图书馆学报,2011,29(4):64-68.(Ma Zhanghua. A Study on Searching Applications and Development of Document Classification Under the Digital-environment [J]. Journal of Academic Libraries, 2011,29(4):64-68.)[15] Godin R,Missaoui R,Alaoui H.Incremental Concept Formation Algorithms Based on Galois(Concept) Lattices[J]. Computational Intelligence, 1995,11(2):246-267.