1 School of Management, Guangdong University of Technology, Guangzhou 510520, China 2 The First Affiliated Hospital of Guangzhou Medical University,Guangzhou 510120, China
[Objective] This paper tries to effectively predict stroke risks, aiming to improve the diagnoses, treatments and interventions of stroke. [Methods] Firstly, we collected about 6000 inpatient medical records from a top hospital. Then, we identified 12 risk factors affecting stroke with logistic regression modeling. Thirdly, we constructed a multi-layer neural network model to predict stroke risks. Finally, we implemented the model with Python to examine its effectiveness. [Results] I. Total cholesterol and low-density lipoprotein etc. are the most important risk factors affecting the onset of stroke. II. When the number of hidden layer neurons was 7, the risk prediction model accuracy reached 97.10%.[Limitations] We need to include more risk factors and use multiple machine learning models for comparative analyses. [Conclusion] The proposed model could effectively predict the stoke risks facing patients.
( Wang Longde, Liu Jianmin, Yang Yi , et al. The Prevention and Treatment of Stroke Still Face Huge Challenges——Brief Report on Stroke Prevention and Treatment in China, 2018[J]. Chinese Circulation Journal, 2019,34(2):105-119.)
[2]
Kim A S, Cahill E, Cheng N T . Global Stroke Belt: Geographic Variation in Stroke Burden Worldwide[J]. Stroke, 2015,46(12):3564-3570.
[3]
Moran A, Gu D, Zhao D , et al. Future Cardiovascular Disease in China: Markov Model and Risk Factor Scenario Projections from the Coronary Heart Disease Policy Model-China[J]. Circulation: Cardiovascular Quality and Outcomes, 2010,3(3):243-252.
[4]
Hijazi Z, Lindbäck J, Alexander J H , et al. The ABC (Age, Biomarkers, Clinical History) Stroke Risk Score: A BiomarkerBased Risk Score for Predicting Stroke in Atrial Fibrillation[J]. European Heart Journal, 2016,37(20):1582-1590.
[5]
Wang Y, Wang J, Cheng J , et al. Is the Population Detected by Screening in China Truly at High Risk of Stroke?[J]. Journal of Stroke and Cerebrovascular Diseases, 2018,27(8):2118-2123.
( Shao Zeguo, Chen Chen, Chen Wei . Analysis of Risk Factors of Daily Life Habits in Stroke Based on Optimal Decision Tree[J]. Modern Preventive Medicine, 2018,45(15):2689-2693.)
[7]
Chauhan S, Vig L, De Grazia M D F , et al. A Comparison of Shallow and Deep Learning Methods for Predicting Cognitive Performance of Stroke Patients from MRI Lesion Images[J]. Frontiers in Neuroinformatics. https://doi.org/10.3389/fninf. 2019. 00053.
[8]
Almadani O, Alshammari R . Prediction of Stroke Using Data Mining Classification Techniques[J]. International Journal of Advanced Computer Science and Applications, 2018,9(1):457-460.
[9]
Feigin V L, Mensah G A, Norrving B , et al. for the GBD 2013 Stroke Panel Experts Group. Atlas of the Global Burden of Stroke (1990-2013): The GBD 2013 Study[J]. Neuroepidemiology, 2015,45(3):230-236.
[10]
Jusuf M I, Machfoed M H, Keman S . Infarction Stroke Risk Prediction Model for Indonesian Population: A Case-Control Study[J]. Bangladesh Journal of Medical Science, 2016,15(2):269-274.
[11]
Aigner A, Grittner U, Rolfs A , et al. Contribution of Established Stroke Risk Factors to the Burden of Stroke in Young Adults[J]. Stroke, 2017,48(7):1744-1751.
[12]
Wang J, Wen X, Li W , et al. Risk Factors for Stroke in the Chinese Population: A Systematic Review and Meta-analysis[J]. Journal of Stroke and Cerebrovascular Diseases, 2017,26(3):509-517.
( Du Qiuming, Cao Shuhua, Wang Shuliang , et al. Analysis of Influencing Factors of Acute Cerebral Infarction in Patients with Hypertension[J]. Chinese Journal of Prevention and Control of Chronic Diseases, 2018,26(2):133-137.)
[14]
Navis A, Garcia-Santibanez R, Skliut M . Epidemiology and Outcomes of Ischemic Stroke and Transient Ischemic Attack in the Adult and Geriatric Population[J]. Journal of Stroke and Cerebrovascular Diseases, 2018,28(1):84-89.
( Li Min, Wang Chunxia, Xia Bing , et al. Risk Prediction Model for Stroke in Health Management Population[J]. Journal of Shandong University: Medical Sciences, 2017,55(6):93-97, 103.)
[16]
Cai R, Zhu B, Ji L , et al. An CNN-LSTM Attention Approach to Understanding User Query Intent from Online Health Communities [C]// Proceedings of 2017 IEEE International Conference on Data Mining Workshops (ICDMW). IEEE, 2017: 430-437.
[17]
Mackay J, Mensah G A, Greenlund K . The Atlas of Heart Disease and Stroke[M]. World Health Organization, 2004.
[18]
Huang S C, Huang Y F . Bounds on the Number of Hidden Neurons in Multilayer Perceptrons[J]. IEEE Transactions on Neural Networks, 1991,2(1):47-55.
[19]
Piri S, Delen D, Liu T , et al. A Data Analytics Approach to Building a Clinical Decision Support System for Diabetic Retinopathy: Developing and Deploying a Model Ensemble[J]. Decision Support Systems, 2017,101:12-27.
[20]
Yang X, Li J, Hu D , et al. Predicting the 10-year Risks of Atherosclerotic Cardiovascular Disease in Chinese Population: The China-PAR Project (Prediction for ASCVD Risk in China)[J]. Circulation, 2016,134(19):1430-1440.
[21]
Agarwal R, Dhar V . Big Data, Data Science, and Analytics: The Opportunity and Challenge for IS Research[J]. Information Systems Research, 2014,25(3):443-448.
[22]
Lin Y K, Chen H, Brown R A , et al. Health Care Predictive Analytics for Risk Profiling in Chronic Care: A Bayesian Multitask Learning Approach[J]. MIS Quarterly, 2017,41(2):473-495.