Please wait a minute...
New Technology of Library and Information Service  2012, Vol. 28 Issue (3): 23-26    DOI: 10.11925/infotech.1003-3513.2012.03.04
Current Issue | Archive | Adv Search |
A Study and Application on Medical Text Categorization Based on FCA
Xu Kun1, Cao Jindan1, Bi Qiang2
1. School of Public Health, Jilin University, Changchun 130021, China;
2. School of Management, Jilin University, Changchun 130022, China
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  Based on the analysis of text categorization and the theory of Formal Concept Analysis(FCA),this paper elaborates the text categorization method of using FCA according to the medical field text characteristics.It uses Classification of Chinese to construct the formal context, and generates concept lattices, which are used to classify the medical field text and make classification effect close to artificial classification.This paper explores a new approach based on FCA for medical text categorization.
Key wordsText categorization      Medical field text      Formal concept analysis      Concept lattices     
Received: 10 February 2012      Published: 19 April 2012
: 

G202

 

Cite this article:

Xu Kun, Cao Jindan, Bi Qiang. A Study and Application on Medical Text Categorization Based on FCA. New Technology of Library and Information Service, 2012, 28(3): 23-26.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2012.03.04     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2012/V28/I3/23

[1] 张铭,宋炜. 语义网简明教程[M].北京:高等教育出版社,2004:36-45.(Zhang Ming,Song Wei. A First Step Towards the Semantic Web[M].Beijing: Higher Education Press,2004:36-45.)

[2] 孙霞,郑庆华,刘均. Web 知识挖掘:理论、方法与应用[M]. 北京:科学出版社, 2010:65-77.(Sun Xia,Zheng Qinghua,Liu Jun.Web Knowledge Mining: Theory, Methods and Applications[M].Beijing:Science Press,2010:65-77.)

[3] Wille R.Restructuring Lattice Theory: An Approach Based on Hierarchies of Concepts[A].//Rival I.Ordered Sets[M].Dordrecht: Reidel,1982:445-470.

[4] 樊旭琴.形式概念分析在突发事件新闻文本聚类中的应用[D].太原:山西大学,2010.(Fan Xuqin.The Application of Emergency News Text Clustering Based on Formal Concept Analysis [D].Taiyuan: Shanxi University, 2010.)

[5] Hu X G, Chen H,Ma F. The Mining of Classification Rules Based on Multiple Extended Concept Lattice[C].In:Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Guangzhou, China.2005:18-21.

[6] Wang H, Yang J, Hu X G. A New Classification Algorithm Based on Entropy and Relative Reduced Exended Concept Lattice[C]. In:Proceedings of 2004 International Conference on Machine Learning and Cybernetics.2004:26-29.

[7] 周顽,周才学. 基于扩展概念格模型的文本分类规则提取的研究[J]. 计算机工程与科学,2010,32(8):98-101.(Zhou Wan,Zhou Caixue.Research on the Extracting Rules of Text Categorization Based on the Extended Concept Lattice Model[J]. Computer Engineering & Science, 2010, 32(8):98-101.)

[8] Yang Y. An Evaluation of Statistical Approaches to Text Categorization [J]. Journal of Information Retrieval,1999,l(1-2):69-90.

[9] Joachims T. Text Categorization with Support Vector Machine: Learning with Many Relevant Features[C].In:Proceedings of the 10th European Conference on Machine Learning.1998:137-142.

[10] Luo S, Tapas K. Thresholding Strategies for Text Classifiers: TREC 2005 Biomedical Triage Task Experiments[C]. In:Proceedings of the 14th Text Retrieval Conference.2005.

[11] Niu J, Sun L. WIM at TREC 2005[C]. In:Proceedings of the 14th Text Retrieval Conference.2005.

[12] Yang Z, Lin H.TREC 2005 Genomics Track Experiments at DUTAI[C]. In:Proceedings of the 14th Text Retrieval Conference.2005.

[13] 倪茂树, 赵晶, 林鸿飞. 生物医学文本分类方法比较研究[J]. 计算机工程与应用,2007,43(12):147-150.(Ni Maoshu,Zhao Jing,Lin Hongfei. Comparison Study on Categorization Algorithms for Biomedical Literatures[J]. Computer Engineering and Applications, 2007,43(12):147-150.)

[14] 马张华. 数字环境下文献分类法的检索应用及其发展[J]. 大学图书馆学报,2011,29(4):64-68.(Ma Zhanghua. A Study on Searching Applications and Development of Document Classification Under the Digital-environment [J]. Journal of Academic Libraries, 2011,29(4):64-68.)

[15] Godin R,Missaoui R,Alaoui H.Incremental Concept Formation Algorithms Based on Galois(Concept) Lattices[J]. Computational Intelligence, 1995,11(2):246-267.
[1] Liu Ping,Peng Xiaofang. Calculating Word Similarities Based on Formal Concept Analysis[J]. 数据分析与知识发现, 2020, 4(5): 66-74.
[2] Jie Ma,Yan Ge,Hongyu Pu. Survey of Attribute Reduction Methods[J]. 数据分析与知识发现, 2020, 4(1): 40-50.
[3] Zhanglu Tan,Zhaogang Wang,Han Hu. Study on a Method of Feature Classification Selection Based on χ2 Statistics[J]. 数据分析与知识发现, 2019, 3(2): 72-78.
[4] Li Xiangdong,Gao Fan,Li Youhai. Categorizing Documents Automatically within Common Semantic Space[J]. 数据分析与知识发现, 2018, 2(9): 66-73.
[5] Liu Ping,Li Yanan,Yu Cong. Building Interactive Knowledge Map for Academic Search[J]. 数据分析与知识发现, 2018, 2(12): 43-51.
[6] Feng Guoming,Zhang Xiaodong,Liu Suhui. Classifying Chinese Texts with CapsNet[J]. 数据分析与知识发现, 2018, 2(12): 68-76.
[7] Xu Dongdong, Wu Shaobo. An Improved TF-IDF Feature Selection Based on Categorical Description[J]. 现代图书情报技术, 2015, 31(3): 39-48.
[8] Tan Xueqing, Zhou Tong, Luo Lin. A Text Classification Algorithm Based on the Average Category Similarity[J]. 现代图书情报技术, 2014, 30(9): 66-73.
[9] Li Xiangdong, He Haihong, Cao Huan, Huang Li. An Algorithm of Digital Resources Text Categorization for Training Sets Skewed Distribution[J]. 现代图书情报技术, 2014, 30(7): 24-33.
[10] Li Xiangdong, Liao Xiangpeng, Huang Li. Research and Implementation of Bibliographic Information Classification System in LDA Model[J]. 现代图书情报技术, 2014, 30(5): 18-25.
[11] Lu Yonghe, Liang Minghui. Improvement of Text Feature Extraction with Genetic Algorithm[J]. 现代图书情报技术, 2014, 30(4): 48-57.
[12] Yan Shiyan, Wang Shengqing, Luo Yunchuan, Huang Haojun. An Ontology Collaborative Construction Model Based on FCA in Cloud Computing Environment[J]. 现代图书情报技术, 2014, 30(3): 49-56.
[13] Wang Hao, Ye Peng, Deng Sanhong. The Application of Machine-Learning in the Research on Automatic Categorization of Chinese Periodical Articles[J]. 现代图书情报技术, 2014, 30(3): 80-87.
[14] Lu Yonghe, Li Yanfeng. A Feature Selection Based on Consideration of Multiple Factors[J]. 现代图书情报技术, 2013, (5): 34-39.
[15] Qu Peng, Wang Huilin. Fundamental Research Questions in Patent Text Categorization[J]. 现代图书情报技术, 2013, 29(3): 38-44.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn