|
|
Predicting Prices and Analyzing Features of Online Short-Term Rentals Based on XGBoost |
Cao Rui1,Liao Bin1(),Li Min1,2,Sun Ruina1,3,4 |
1College of Statistics and Data Science, Xinjiang University of Finance & Economics, Urumqi 830012, China 2School of Information Science and Engineering, Xinjiang University, Urumqi 830008, China 3Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, China 4School of Cyber Security, University of Chinese Academy of Sciences, Beijing 100049, China |
|
|
Abstract [Objective] This paper proposed a model to predict prices and analyze properties of online short-term rentals based on XGBoost, aiming to address the issue of lacking reasonable pricing suggestion mechanism for housing with different characteristics. [Methods] We collected data from the Airbnb platform and used Lasso to extract features from these raw data as well as reduced their dimensions. Then, we input the extracted data to XGBoost and iteratively trained the prediction model. Finally, we used the SHAP value to interpret the model features. [Results] The RMSE, MAE and R-squared values of the proposed model were 0.091, 0.065 and 0.798 respectively after tuning the hyperparameters, which were better than those of the four existing models. [Limitations] Our new model could not merge the features of real-time online business data, which influenced the prediction accuracy. [Conclusions] The proposed model has good interpretability, and could identify the key factors affecting housing prices, which helps the landlords improve services.
|
Received: 29 November 2020
Published: 06 July 2021
|
|
Fund:National Natural Science Foundation of China(61562078);Tianshan Youth Program of Xinjiang(2018Q073) |
Corresponding Authors:
Liao Bin
E-mail: liaobin665@163.com
|
[1] |
吴新宇, 吴捷. 在线短租市场研究——以蚂蚁短租为例[J]. 中外企业家, 2018(35):77-78.
|
[1] |
(Wu Xinyu, Wu Jie. Research on the Online Short-term Rental Market--Case Study of Ant Short-term Rental[J]. Chinese Foreign Entrepreneurs, 2018 (35):77-78.)
|
[2] |
国家信息中心分享经济研究中心. 《中国共享住宿发展报告2020》[R]. 2020.
|
[2] |
(State Information Center Sharing Economic Research Center. China Shared Accommodation Development Report in 2020[R]. 2020.)
|
[3] |
王保乾, 邓菲. 基于消费者偏好选择的短租房市场定价因素研究[J]. 统计与信息论坛, 2018,33(7):92-99.
|
[3] |
(Wang Baoqian, Deng Fei. Research on Market Pricing Factors Based on Consumer Preferences to Choose Short Rental[J]. Statistics & Information Forum, 2018,33(7):92-99.)
|
[4] |
Wang D, Nicolau J L. Price Determinants of Sharing Economy Based Accommodation Rental: A Study of Listings from 33 Cities on Airbnb.com[J]. International Journal of Hospitality Management, 2017,62:120-131.
doi: 10.1016/j.ijhm.2016.12.007
|
[5] |
武亮. 共享经济下短租商业模式创新策略研究——基于途家短租模式的分析[J]. 价格理论与实践, 2019(1):149-152.
|
[5] |
(Wu Liang. Research on the Innovation Strategy of Short-term Business Model Under the Shared Economy——Analysis Based on Tujia Short-term Rental Model[J]. Price: Theory & Practice, 2019(1):149-152.)
|
[6] |
徐燕, 戴菲. 分享经济下在线短租商业模式画布创新研究——基于小猪短租商业模式与途家短租比较分析[J]. 价格理论与实践, 2019(6):137-140.
|
[6] |
(Xu Yan, Dai Fei. Research on Canvas Innovation of Online Short-Term Business Model Under the Sharing Economy——Based on the Comparative Analysis of the Short-Term Business Model of Piglet and the Short-Term Rent of Tujia[J]. Price: Theory & Practice, 2019(6):137-140.)
|
[7] |
李立威. 分享经济中多层信任的构建机制研究——基于Airbnb和小猪短租的案例分析[J]. 电子政务, 2019(2):101-107.
|
[7] |
(Li Liwei. Research on the Construction Mechanism of Multi-Layer Trust in the Sharing Economy——Based on the Cases of Airbnb and Xiaozhu Short-Term Rental[J]. E-Government, 2019(2):101-107.)
|
[8] |
赵建欣, 朱阁, 宋玲玉. 在线短租平台用户住宿决策影响因素研究[J]. 北京邮电大学学报(社会科学版), 2017,19(5):56-61.
|
[8] |
(Zhao Jianxin, Zhu Ge, Song Lingyu. Influencing Factors of User Decision via Online Short-rent Platform[J]. Journal of Beijing University of Posts and Telecommunications(Social Sciences Edition), 2017,19(5):56-61.)
|
[9] |
凌超, 张赞. “分享经济”在中国的发展路径研究——以在线短租为例[J]. 现代管理科学, 2014(10):36-38.
|
[9] |
(Ling Chao, Zhang Zan. Research on the Development Path of "Sharing Economy" in China—— Case Study of Online Short-term Rental[J]. Modern Management Science, 2014(10):36-38.)
|
[10] |
阮连法, 张跃威, 张鑫. 基于特征价格与SVM的二手房价格评估[J]. 技术经济与管理研究, 2008(5):75-78.
|
[10] |
(Ruan Lianfa, Zhang Yuewei, Zhang Xin. Price Appraisal of Second-hand Housing Based on Hedonic Price and SVM[J]. Journal of Technical Economics & Management, 2008(5):75-78.)
|
[11] |
徐戈, 张科. 基于随机森林模型的房产价格评估[J]. 统计与决策, 2014(17):22-25.
|
[11] |
(Xu Ge, Zhang Ke. Real Estate Price Evaluation Based on Random Forest Model[J]. Statistics & Decision, 2014(17):22-25.)
|
[12] |
唐晓彬, 张瑞, 刘立新. 基于蝙蝠算法SVR模型的北京市二手房价预测研究[J]. 统计研究, 2018,35(11):71-81.
|
[12] |
(Tang Xiaobin, Zhang Rui, Liu Lixin. Research on Forecast of Second-hand House Price in Beijing Based on SVR Model of Bat Algorithm[J]. Statistical Research, 2018,35(11):71-81.)
|
[13] |
董倩, 孙娜娜, 李伟. 基于网络搜索数据的房地产价格预测[J]. 统计研究, 2014,31(10):81-88.
|
[13] |
(Dong Qian, Sun Nana, Li Wei. Real Estate Price Prediction Based on Web Search Data[J]. Statistical Research, 2014,31(10):81-88.)
|
[14] |
邓磊. 基于机器学习的酒店价格预测分析[D]. 南京: 东南大学, 2017.
|
[14] |
(Deng Lei. The Analysis of Hotel Price Prediction Based on Machine Learning[D]. Nanjing: Southeast University, 2017.)
|
[15] |
Zhang H L, Zhang J, Lu S J, et al. Modeling Hotel Room Price with Geographically Weighted Regression[J]. International Journal of Hospitality Management, 2011,30(4):1036-1043.
doi: 10.1016/j.ijhm.2011.03.010
|
[16] |
夏学文. 商品房价格预测模型及其应用[J]. 统计学与应用, 2017,6(1):81-86.
|
[16] |
(Xia Xuewen. The Price Forecast Model of Commodity Houses and Its Application[J]. Statistics and Application, 2017,6(1):81-86.)
|
[17] |
龙会典, 张海燕. 基于ARIMA模型的广州市商品房价格预测[J]. 商业研究, 2007(7):211-213.
|
[17] |
(Long Huidian, Zhang Haiyan. Prediction of Commodity Housing Prices in Guangzhou Based on ARIMA Model[J]. Commercial Research, 2007(7):211-213.)
|
[18] |
谢勇, 项薇, 季孟忠, 等. 基于XGBoost和LightGBM算法预测住房月租金的应用分析[J]. 计算机应用与软件, 2019,36(9):151-155,191.
|
[18] |
(Xie Yong, Xiang Wei, Ji Mengzhong, et al. An Application and Analysis of Forecast Housing Rental Based on XGBoost and LightGBM Algorithms[J]. Computer Applications and Software, 2019,36(9):151-155,191.)
|
[19] |
Hu L R, He S J, Han Z X, et al. Monitoring Housing Rental Prices Based on Social Media: An Integrated Approach of Machine-Learning Algorithms and Hedonic Modeling to Inform Equitable Housing Policies[J]. Land Use Policy, 2019,82:657-673.
doi: 10.1016/j.landusepol.2018.12.030
|
[20] |
Parsa A B, Movahedi A, Taghipour H, et al. Toward Safer Highways, Application of XGBoost and SHAP for Real-Time Accident Detection and Feature Analysis[J]. Accident Analysis & Prevention, 2020,136:105405.
doi: 10.1016/j.aap.2019.105405
|
[21] |
Mangalathu S, Hwang S H, Jeon J S. Failure Mode and Effects Analysis of RC Members Based on Machine-learning-based SHapley Additive exPlanations (SHAP) Approach[J]. Engineering Structures, 2020,219:110927.
doi: 10.1016/j.engstruct.2020.110927
|
[22] |
Xu J S, Saleh M, Hatzopoulou M. A Machine Learning Approach Capturing the Effects of Driving Behaviour and Driver Characteristics on Trip-Level Emissions[J]. Atmospheric Environment, 2020,224:117311.
doi: 10.1016/j.atmosenv.2020.117311
|
[23] |
Sánchez-Franco M J, Alonso-Dos-Santos M. Exploring Gender-Based Influences on Key Features of Airbnb Accommodations[J/OL]. Economic Research-Ekonomska Istraživanja, https://doi.org/10.1080/1331677X.2020.1831943.
|
[24] |
Chen T Q, Guestrin C. XGBoost: A Scalable Tree Boosting System[C]// Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016: 785-794.
|
[25] |
朱明, 王春梅, 高翔, 等. XGBoost在卫星网络协调态势预测中的应用[J]. 小型微型计算机系统, 2019,40(12):2561-2565.
|
[25] |
(Zhu Ming, Wang Chunmei, Gao Xiang, et al. Application of XGBoost in the Prediction of Satellite Network Coordination Situation[J]. Journal of Chinese Computer Systems, 2019,40(12):2561-2565.)
|
[26] |
杨贵军, 徐雪, 赵富强. 基于XGBoost算法的用户评分预测模型及应用[J]. 数据分析与知识发现, 2019,3(1):118-126.
|
[26] |
(Yang Guijun, Xu Xue, Zhao Fuqiang. Predicting User Ratings with XGBoost Algorithm[J]. Data Analysis and Knowledge Discovery, 2019,3(1):118-126.)
|
[27] |
丁勇, 陈夕, 蒋翠清, 等. 一种融合网络表示学习与XGBoost的评分预测模型[J]. 数据分析与知识发现, 2020,4(11):52-62.
|
[27] |
(Ding Yong, Chen Xi, Jiang Cuiqing, et al. A Rating Prediction Model by Integrating Network Representation Learning and XGBoost[J]. Data Analysis and Knowledge Discovery, 2020,4(11):52-62.)
|
[28] |
Lundberg S M, Lee S I. A Unified Approach to Interpreting Model Predictions[C]// Proceedings of Annual Conference on Neural Information Processing Systems. 2017: 4765-4774.
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|