Predicting Time Series of Theft Crimes Based on LSTM Network
Yan Jinghua1,2,3(),Hou Miaomiao3
1National Science Library, Chinese Academy of Sciences, Beijing 100190, China 2Department of Library, Information and Archives Management, School of Economics and Management, University of Chinese Academy of Sciences, Beijing 100190, China 3School of Information and Network Security, People’s Public Security University of China, Beijing 100038, China
[Objective] This paper tries to predict the daily number of theft activities. [Methods] We used LSTM network to analyze theft data from a large city in north China. First, we retrieved our data from January 1, 2005 to February 24, 2007 and from January 1, 2009 to January 7, 2011, respectively. Then, we set three different cases to examine the time series prediction of the daily number. Finally, we compared our results with those of ARIMA, Support Vector Regression, Random Forest and XGBoost with the same data set. [Results] The percentage root mean square error (PRMSE) of our model were 18.4%, 11.7% and 41.9%, respectively, which were better than those of ARIMA, Support Vector Regression, Random Forest or XGBoost model. [Limitations] More research is needed to predict the period when the number of theft crimes fluctuates dramatically. [Conclusions] The proposed model could improve the decision makings for community safety, police patrol and other specific missions.
颜靖华,侯苗苗. 基于LSTM网络的盗窃犯罪时间序列预测研究*[J]. 数据分析与知识发现, 2020, 4(11): 84-91.
Yan Jinghua,Hou Miaomiao. Predicting Time Series of Theft Crimes Based on LSTM Network. Data Analysis and Knowledge Discovery, 2020, 4(11): 84-91.
( Han Yishi, Fan Yingsheng, Li Guojun, et al. Analysis on the Growth Trend of Communication Network Fraud Based on ARIMA Model——Take Quzhou City of Zhejiang Province as an Example[J]. Theoretic Observation, 2017(5):101-103.)
[2]
屈茂辉, 郝士铭 . 基于 ARMA 模型的中国财产类犯罪人数预测研究[J]. 中国刑事法杂志, 2013,23(4):100-106.
[2]
( Qu Maohui, Hao Shiming. Research on the Prediction of the Number of Property Crimes in China Based on ARMA Model[J]. Criminal Science, 2013,23(4):100-106.)
[3]
Gorr W, Olligschlaeger A, Thompson Y. Short-Term Forecasting of Crime[J]. International Journal of Forecasting, 2003,19(4):579-594.
doi: 10.1016/S0169-2070(03)00092-X
[4]
卢睿. 基于灰色理论的网络犯罪形势预测[J]. 警察技术, 2015 ( 4):62-64.
[4]
( Lu Rui. Prediction of Cybercrime Situation Based on Grey Theory[J]. Police Technology, 2015(4):62-64.)
[5]
Alves L G, Ribeiro H V, Rodrigues F A. Crime Prediction Through Urban Metrics and Statistical Learning[J]. Physica A: Statistical Mechanics and Its Applications, 2018,505:435-443.
doi: 10.1016/j.physa.2018.03.084
( Chen Peng, Hu Xiaofeng, Chen Jianguo. The Application of Fuzzy Information Granulation and Support Vector Machine in Crime Forecasting[J]. Science Technology and Engineering, 2015,15(35):54-57, 63.)
( Yu Hongzhi, Liu Fengxin, Zou Kaiqi. Improved Fuzzy BP Neural Network and Its Application in Crime Prediction[J]. Journal of Liaoning Technical University(Natural Science), 2012(2):244-247.)
[8]
Dash S K, Safro I, Srinivasamurthy R S. Spatio-temporal Prediction of Crimes Using Network Analytic Approach[C]// Proceedings of 2018 IEEE International Conference on Big Data. DOI: 10.1109/BigData.2018.8622041.
( Xiao Yanhui, Wang Xin, Feng Wen’gang, et al. Predicting Crime Locations Based on Long Short Term Memory and Convolutional Neural Networks[J]. Data Analysis and Knowledge Discovery, 2018,2(10):15-20.)
[10]
Hochreiter S, Schmidhuber J. Long Short-Term Memory[J]. Neural Computation, 1997,9(8):1735-1780.
pmid: 9377276
[11]
Breuel T M. High Performance Text Recognition Using a Hybrid Convolutional-LSTM Implementation[C]// Proceedings of the 14th IAPR International Conference on Document Analysis & Recognition. 2017: 11-16.
[12]
Sun L F, Kang S K, Li K, et al. Voice Conversion Using Deep Bidirectional Long Short-Term Memory Based Recurrent Neural Networks[C]// Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2015: 4869-4873.
( Feng Chen, Chen Zhide. Application of Weighted Combination Model Based on XGBoost and LSTM in Sales Forecasting[J]. Computer Systems and Applications, 2019,28(10):226-232.)
( Zhou Rui, Wei Zhengying, Zhang Yubin, et al. Time Series Prediction of Tomato Yield Based on LSTM Recurrent Neural Network[J]. Water Saving Irrigation, 2018(8):66-70.)
[16]
Box G E, Jenkins G M, Reinsel G C, et al. Time Series Analysis: Forecasting and Control[M]. John Wiley & Sons, 2015.
( Gong An, Ma Guangming, Guo Wenting, et al. Nuclear Power Equipment Status Prediction Based on LSTM Recurrent Neural Network[J]. Computer Technology and Development, 2019,29(10):41-45.)
[18]
Ma X L, Tao Z M, Wang Y H, et al. Long Short-Term Memory Neural Network for Traffic Speed Prediction Using Remote Microwave Sensor Data[J]. Transportation Research Part C: Emerging Technologies, 2015,54:187-197.
doi: 10.1016/j.trc.2015.03.014
[19]
Srivastava N, Mansimov E, Salakhudinov R. Unsupervised Learning of Video Representations Using LSTMs[C]// Proceedings of the 32nd International Conference on Machine Learning. 2015: 843-852.
[20]
Hu X F, Li D, Huang H, et al. Modeling and Sensitivity Analysis of Transport and Deposition of Radionuclides from the Fukushima Dai-ichi Accident[J]. Atmospheric Chemistry & Physics, 2014,14(20):11065-11092.
[21]
Drucker H, Burges C J C, Kaufman L, et al. Support Vector Regression Machines[C]// Proceedings of the 9th International Conference on Neural Information Processing Systems. 1996.
[22]
Liaw A, Wiener M. Classification and Regression by Random Forest[J]. R News, 2002,2(3):18-22.
[23]
Chen T Q, Guestrin C. XGBoost: A Scalable Tree Boosting System[C]// Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016: 785-794.