|
|
Automatic Transferring Government Website E-Mails Based on Text Classification |
Wang Sidi1,2,Hu Guangwei1,2( ),Yang Siyu1,2,Shi Yun1 |
1School of Information Management, Nanjing University, Nanjing 210023, China 2Government Data Resources Institution of Nanjing University, Nanjing 210023, China |
|
|
Abstract [Objective] This research proposes a method to automatically transferring e-mails received by government websites, aiming to reduce labor costs of managing public email boxes. [Methods] First, we chose four representative classification algorithms, including Naïve Bayes, Decision Tree, Random Forest and Multi-Layer Perception, and compared their classification resutls of e-mails received by the websites of Mayor’s Offices in Beijing, Hefei and Shenzhen. Then, we designed a method of automatically transferring these emails. Finally, we gave suggestions on the application of our method in the real world settings. [Results] Multi-Layer Perception yielded the best performance in our study, with the macro average precision and recall reaching more than 0.85, and all micro average indicators reaching more than 0.93. Naïve Bayes took the second place. Random Forest had a high macro average precision, but poor recall score. Decision Tree had an average precision and recall results. [Limitations] We did not examine the impacts of skewed distribution of received emails and eliminated the departments receiving few emails. [Conclusions] The proposed method optimizes the operation of public e-mails, which improves the efficiency of online government and reduces administrative costs.
|
Received: 31 October 2019
Published: 07 July 2020
|
|
Corresponding Authors:
Hu Guangwei
E-mail: hugw@nju.edu.cn
|
[1] |
孙宗锋, 赵兴华. 网络情境下地方政府政民互动研究——基于青岛市市长信箱的大数据分析[J]. 电子政务, 2019(5):12-26.
|
[1] |
( Sun Zongfeng, Zhao Xinghua. A Study on the Interaction Between the Government and the People in the Internet-Based on the Big Data Analysis of the Mayor’s Mailbox of Qingdao[J]. E-Government, 2019(5):12-26.)
|
[2] |
于君博, 李慧龙, 于书鳗. “网络问政”中的回应性——对K市领导信箱的一个探索性研究[J]. 长白学刊, 2018(2):65-74.
|
[2] |
( Yu Junbo, Li Huilong, Yu Shuman. Responsiveness in “Governing Online”—An Exploratory Study on K City’s Leader Mailbox[J]. Changbai Journal, 2018(2):65-74.)
|
[3] |
郑俊田, 郜媛莹, 顾清. 地方政府权力清单制度体系建设的实践与完善[J]. 中国行政管理, 2016(2):6-9.
|
[3] |
( Zheng Juntian, Gao Yuanying, Gu Qing. Practice and Perfection of Local Governmental Administrative Power List System Construction[J]. Chinese Public Administration, 2016(2):6-9.)
|
[4] |
王珺. 基于文本特征识别的电子档案自动归类系统研究[J]. 现代电子技术, 2019,42(18):45-49.
|
[4] |
( Wang Jun. Research on Electronic Archive Automatic Classification System Based on Text Feature Recognition[J]. Modern Electronics Technique, 2019,42(18):45-49.)
|
[5] |
李湘东, 徐朋, 黄莉, 等. 基于KNN算法的文本自动分类方法研究——以学术期刊栏目自动归类为例[J]. 图书情报知识, 2010(4):71-76.
|
[5] |
( Li Xiangdong, Xu Peng, Huang Li, et al. Research of Journals Manuscript Categorization Based on KNN Algorithm[J]. Document, Information & Knowledge, 2010(4):71-76.)
|
[6] |
李成铭. 基于文本特征提取技术的在线人职匹配研究及应用[D]. 成都:电子科技大学, 2017.
|
[6] |
( Li Chengming. Research and Application of Talent Job Online Matching Based on Text Feature Extraction Technology[D]. Chengdu:University of Electronic Science and Technology of China, 2017.)
|
[7] |
王若佳, 张璐, 王继民. 基于机器学习的在线问诊平台智能分诊研究[J]. 数据分析与知识发现, 2019,3(9):88-97.
|
[7] |
( Wang Ruojia, Zhang Lu, Wang Jimin. Automatic Triage of Online Doctor Services Based on Machine Learning[J]. Data Analysis and Knowledge Discovery, 2019,3(9):88-97.)
|
[8] |
Kim K, Zzang S Y. Trigonometric Comparison Measure: A Feature Selection Method for Text Categorization[J]. Data & Knowledge Engineering, DOI: 10.1016/j.datak.2018.10.003.
doi: 10.1016/j.datak.2011.03.009
pmid: 21765568
|
[9] |
Ghareb A S, Bakara A A Al-Radaideh Q A, et al. Enhanced Filter Feature Selection Methods for Arabic Text Categorization[J]. International Journal of Information Retrieval Research (IJIRR), 2018,8(2):1-24.
|
[10] |
Hartmann J, Huppertz J, Schamp C, et al. Comparing Automated Text Classification Methods[J]. International Journal of Research in Marketing, 2019,36(1):20-38.
doi: 10.1016/j.ijresmar.2018.09.009
|
[11] |
田欢, 李红莲, 吕学强, 等. 基于改进BP神经网络的学术活动文本分类[J]. 北京信息科技大学学报(自然科学版), 2018,33(5):38-44.
|
[11] |
( Tian Huan, Li Honglian, Lv Xueqiang, et al. Text Categorization of Academic Activities Based on an Improved BP Neural Network[J]. Journal of Beijing Information Science & Technology University, 2018,33(5):38-44.)
|
[12] |
刘浏, 王东波. 基于论文自动分类的社科类学科跨学科性研究[J]. 数据分析与知识发现, 2018,2(3):30-38.
|
[12] |
( Liu Liu, Wang Dongbo. Identifying Interdisciplinary Social Science Research Based on Article Classification[J]. Data Analysis and Knowledge Discovery, 2018,2(3):30-38.)
|
[13] |
Gauld R, Flett J, McComb S, et al. How Responsive are Government Agencies When Contacted by Email? Findings from a Longitudinal Study in Australia and New Zealand[J]. Government Information Quarterly, 2016,33(2):283-290.
doi: 10.1016/j.giq.2016.03.004
|
[14] |
李慧龙, 于君博. 数字政府治理的回应性陷阱——基于东三省“地方领导留言板”的考察[J]. 电子政务, 2019(3):72-87.
|
[14] |
( Li Huilong, Yu Junbo. The Responsive Trap of Digital Government Governance-Based on the Investigation of “Message Board of Local Leaders” in Three Northeastern Provinces[J]. E-Government, 2019(3):72-87.)
|
[15] |
Ong C S, Wang S W. Managing Citizen-Initiated Email Contacts[J]. Government Information Quarterly, 2009,26(3):498-504.
doi: 10.1016/j.giq.2008.07.005
|
[16] |
胡佳妮, 徐蔚然, 郭军, 等. 中文文本分类中的特征选择算法研究[J]. 光通信研究, 2005(3):44-46.
|
[16] |
( Hu Jiani, Xu Weiran, Guo Jun, et al. Study on Feature Selection Methods in Chinese Text Categorization[J]. Study on Optical Communications, 2005(3):44-46.)
|
[17] |
张志飞, 苗夺谦, 高灿. 基于LDA 主题模型的短文本分类方法[J]. 计算机应用, 2013,33(6):1587-1590.
doi: 10.3724/SP.J.1087.2013.01587
|
[17] |
( Zhang Zhifei, Miao Duoqian, Gao Can. Short Text Classification Using Latent Dirichlet Allocation[J]. Journal of Computer Applications, 2013,33(6):1587-1590.)
doi: 10.3724/SP.J.1087.2013.01587
|
[18] |
Salton G, Wong A, Yang C S. A Vector Space Model for Automatic Indexing[J]. Communications of the ACM, 1975,18(11):613-620.
doi: 10.1145/361219.361220
|
[19] |
Manning C, Raghavan P, Schütze H. Introduction to Information Retrieval[M]. Cambridge University Press, 2008.
|
[20] |
Breiman L. Random Forests[J]. Machine Learning, 2001,45(1):5-32.
doi: 10.1023/A:1010933404324
|
[21] |
Breiman L, Friedman J, Stone C J, et al. Classification and Regression Trees[M]. CRC Press, 1984.
|
[22] |
Hinton G E. Connectionist Learning Procedures[J]. Artificial Intelligence, 1989,40(1-3):185-234.
doi: 10.1016/0004-3702(89)90049-0
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|