[Objective]This paper tries to improve the effectiveness and efficiency of acquiring decision-making knowledge from Financial Institutions. [Methods]First, we built a framework of an acquisition system for financial decision-making knowledge, which used neighborhood rough set to remove redundant attributes. Then, we adopted the SMOTE method to balance the data. We also applied grid search method to optimize parameters of the ensemble classifiers. Third, we trained and used the new model to identify the optimal reduction group. Finally, we acquired the needed knowledge through the optimal reduction, and stored them in the database.[Results]We examined the proposed method with 4,521 pieces of financial record, which yielded sensitivity of 83.55%, specificity of 80.74% and AUC of 0.8214. [Limitations]We did not run the proposed model with data of insurance or consumer loans. [Conclusions] The proposed method could improve the classification performance of financial decision-making system, which could identify and acquire knowledge of key customers effectively.
李静,刘潇,王效俐. 邻域粗糙集融合网格搜索组合分类器的理财决策知识获取研究*[J]. 数据分析与知识发现, 2019, 3(1): 85-94.
Jing Li,Xiao Liu,Xiaoli Wang. Financial Decision Knowledge Acquisition Based on Neighborhood Rough Set and Ensemble Classifiers with Grid Search. Data Analysis and Knowledge Discovery, DOI：10.11925/infotech.2096-3467.2018.0323.
McKenney J L, Scott M M. Management Decision Systems: Computer-Based Support for Decision Making[M]. Harvard Business School Press, 1984.
Kim N S, Park B, Lee K D.A Knowledge Based Freight Management Decision Support System Incorporating Economies of Scale: Multimodal Minimum Cost Flow Optimization Approach[J]. Information Technology and Management, 2016, 17(1): 81-94.
(Wang Junhua, Gao Qisheng, Ma Jinyan.Analysis on the Behavior of Consumers Purchasing Bank’s Financial Products—Based on Multi-level Logistic Model[J]. Journal of Qingdao University: Natural Science Edition, 2017, 30(4): 109-116.)
Pawlak Z.Rough Sets[J]. International Journal of Computer and Information Sciences, 1982, 11(5): 341-356.
(Zhao Guanhua.Study on Financial Distress Prediction Model of Least Squares Support Vector Machine of Dual Constraint Type Based on Attribute Reduction of Neighborhood Rough Set[J]. Operations Research and Management Science, 2011, 20(3): 132-139.)
Hu Q, Yu D, Xie Z.Neighborhood Classifiers[J]. Expert Systems with Applications, 2008, 34(2): 866-876.
Chawla N V, Bowyer K W, Hall L O, et al.SMOTE: Synthetic Minority Over-Sampling Technique[J]. Journal of Artificial Intelligence Research, 2002, 16(1): 321-357.
Luengo J, Fernández A, García S, et al.Addressing Data Complexity for Imbalanced Data Sets: Analysis of SMOTE-Based Oversampling and Evolutionary Undersampling[J]. Soft Computing, 2011, 15(10): 1909-1936.
Bunkhumpornpat C, Sinapiromsaran K, Lursinsap C.Safe-Level-Smote: Safe-Level-Synthetic Minority Over-Sampling Technique for Handling the Class Imbalanced Problem[C]// Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining. Springer, 2009: 475-482.
Ramentol E, Caballero Y, Bello R, et al.SMOTE-RSB*: A Hybrid Preprocessing Approach Based on Oversampling and Undersampling for High Imbalanced Data-Sets Using SMOTE and Rough Sets Theory[J]. Knowledge and Information Systems, 2012, 33(2): 245-265.
Wang X Z, Xing H J, Li Y, et al.A Study on Relationship Between Generalization Abilities and Fuzziness of Base Classifiers in Ensemble Learning[J]. IEEE Transactions on Fuzzy Systems, 2015, 23(5): 1638-1654.
Schapire R E.The Strength of Weak Learnability[J]. Machine Learning, 1990, 5(2): 197-227.
(Ji Changming, Zhou Ting, Xiang Tengfei, et al.Application of Support Vector Machine Based on Grid Search and Cross Validation in Implicit Stochastic Dispatch of Cascaded Hydropower Stations[J]. Electric Power Automation Equipment, 2014, 34(3): 125-131.)
Moro S, Cortez P, Rita P.A Data-Driven Approach to Predict the Success of Bank Telemarketing[J]. Decision Support Systems, 2014, 62: 22-31.