Extracting Text Features with Improved Fruit Fly Optimization Algorithm
Wen Tingxin1, Li Yangzi1(), Sun Jingshuang2
1Institute of Systems Engineering, Liaoning Technical University, Huludao 125105, China 2 College of Business Administration, Liaoning Technical University, Huludao 125105, China
[Objective] This paper tries to reduce the dimension of text feature vector space and then improves the accuracy of text classification. [Methods] We proposed a text feature selection model IFOATFSO based on the improved fruit fly optimization algorithm. It introduced the classification accuracy variance to monitor the convergence degree of the model. We also used the crossover operator, roulette wheel selection method based on simulated annealing mechanism and genetic algorithm to deepen global search and improve population diversity. [Results] The IFOATFSO model, which optimized the feature selection based on CHI method, not only reduced the feature dimension, but also improved the accuracy of text classification by up to 10.5%. [Limitations] The performance of IFOATFSO model for extracting English text features needs to be improved. [Conclusions] The IFOATFSO model improves the text classification.
温廷新, 李洋子, 孙静霜. 基于改进的果蝇优化算法的文本特征选择优化模型[J]. 数据分析与知识发现, 2018, 2(5): 59-69.
Wen Tingxin,Li Yangzi,Sun Jingshuang. Extracting Text Features with Improved Fruit Fly Optimization Algorithm. Data Analysis and Knowledge Discovery, 2018, 2(5): 59-69.
(Lu Yonghe, Liang Minghui.Improvement of Text Feature Extraction with Genetic Algorithm[J]. New Technology of Library and Information Service, 2014(4): 48-57.)
[4]
张彪. 文本分类中特征选择算法的分析与研究[D]. 合肥: 中国科学技术大学, 2010.
[4]
(Zhang Biao.Analysis and Research on Feature Selection Algorithm for Text Classification [D]. Hefei: University of Science and Technology of China, 2010.)
(Shi Hui, Jia Daiping, Miao Pei.Improved Information Gain Text Feature Selection Algorithm Based on Word Frequency Information[J]. Journal of Computer Applications, 2014, 34(11): 3279-3282.)
doi: 10.11772/j.issn.1001-9081.2014.11.3279
(Liu Song, Zhang Dexian.Mutual Information Feature Selection Method Based on Weight Difference and Categories Association[J]. Application Research of Computers, 2014, 31(7): 1998-2000.)
doi: 10.3969/j.issn.1001-3695.2014.07.017
[8]
Uğuz H.A Two-stage Feature Selection Method for Text Categorization by Using Information Gain, Principal Component Analysis and Genetic Algorithm[J]. Knowledge- Based Systems, 2011, 24(7): 1024-1032.
doi: 10.1016/j.knosys.2011.04.014
(Wu Kaijun, Lu Huaiwei.PCGA Used to Solve Text Feature Selection[J]. Systems Engineering — Theory & Practice, 2012, 32(10): 2215-2220.)
doi: 10.3969/j.issn.1000-6788.2012.10.012
[10]
Lu Y, Liang M, Ye Z, et al.Improved Particle Swarm Optimization Algorithm and Its Application in Text Feature Selection[J]. Applied Soft Computing, 2015, 35(C): 629-636.
doi: 10.1016/j.asoc.2015.07.005
[11]
Dadaneh B Z, Markid H Y, Zakerolhosseini A.Unsupervised Probabilistic Feature Selection Using Ant Colony Optimization[J]. Expert Systems with Applications, 2016, 53: 27-42.
doi: 10.1016/j.eswa.2016.01.021
(Xiao Zhenjiu, Sun Jian, Wang Yongbin, et al.Wavelet Domain Digital Watermarking Method Based on Fruit Fly Optimization Algorithm[J]. Journal of Computer Applications, 2015, 35(9): 2527-2530.)
doi: 10.11772/j.issn.1001-9081.2015.09.2527
[15]
Li M W, Geng J, Han D F, et al.Ship Motion Prediction Using Dynamic Seasonal RvSVR with Phase Space Reconstruction and the Chaos Adaptive Efficient FOA[J]. Neurocomputing, 2016, 174: 661-680.
doi: 10.1016/j.neucom.2015.09.089
(Geng Liyan, Chen Lihua.Forecast on Railway Traffic Volume Using Mixed-kernel LSSVM Optimized by FOA[J]. Application Research of Computers, 2017, 34(2): 409-412.)
doi: 10.3969/j.issn.1001-3695.2017.02.020
(Tian Xu, Li Jie.An Improved Fruit Fly Optimization Algorithm and Its Application in Aerodynamic Optimization Design[J]. Acta Aeronautica et Astronautica Sinica, 2017, 38(4): 120370.)
doi: 10.7527/S1000-6893.2016.0198
(Xu Tongwei, He Qing, Wu Yile, et al.Spectrum Allocation Based on Quantum Fruit Fly Optimization Algorithm in Cognitive Radio Network[J]. Application Research of Computers, 2017, 34(10): 3116-3120.)
doi: 10.3969/j.issn.1001-3695.2017.10.052
(Wang Yan, Zhang Bo, Xue Bo.Research on Chinese Classification Based on FOA-SVM[J]. Journal of Sichuan University: Natural Science Edition, 2016, 53(4): 759-763.)