[Objective] This paper proposes a Document Influence Model (DIM) based on Dynamic Automatic Time, aiming to solve the time window dividing issue of dynamic topic model. [Methods] Firstly, we processed the text corpora with the traditional LDA model and word vector model. Secondly, we constructed a comprehensive index reflecting the differences between time windows and similarity within the time windows. Finally, we built a new model based on this index and conducted an empirical study with news corpus of the “Belt and Road” International Cooperation Summit Forum. [Results] The proposed model could quickly and effectively divide the time windows, which not only ensured the comparability of the topics under different windows, but also evaluated the influence factors of the document. [Limitations] We built the similarity index of time windows based on the traditional LDA model, which could be improved by the latest LDA models. [Conclusions] The new model is able to divide the time series text effectively, which improves the performance of traditional dynamic topic model.
王婷婷, 王宇, 秦琳杰. 基于动态主题模型的时间窗口划分研究*[J]. 数据分析与知识发现, 2018, 2(10): 54-64.
Wang Tingting,Wang Yu,Qin Linjie. Dividing Time Windows of Dynamic Topic Model. Data Analysis and Knowledge Discovery, 2018, 2(10): 54-64.
(Liao Junhua, Sun Keying, Zhong Lixia.Study on a Hot Topic Analysis System Based on Time Sliced Topic Model[J]. Library and Information Service, 2003, 57(9): 96-102, 118.)
doi: 10.7536/j.jssn.0252-3116.2013.09.016
[3]
Wang X R, McCallum A. Topics over Time: A Non-Markov Continuous-Time Model of Topical Trends[C]//Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, NY, USA: ACM, 2006: 424-433.
[4]
Ding W, Chen C.Dynamic Topic Detection and Tracking: A Comparison of HDP, C-word, and Cocitation Methods[J]. Journal of the Association for Information Science & Technology, 2014, 65(10): 2084-2097.
doi: 10.1002/asi.23134
[5]
Blei D M, Lafferty J D.Dynamic Topic Models[C]// Proceedings of the 23rd International Conference on Machine Learning. ACM, 2006: 113-120.
[6]
Derntl M, Günnemann N, Klamma R.A Dynamic Topic Model of Learning Analytics Research[C]// Proceedings of International Symposium on Instrumentation and Measurement, Sensor Network and Automation. IEEE, 2013: 436-439.
[7]
Ha T, Beijnon B, Kim S, et al.Examining User Perceptions of Smartwatch Through Dynamic Topic Modeling[J]. Telematics and Informatics, 2017, 34(7): 1262-1273.
doi: 10.1016/j.tele.2017.05.011
(Qi Yashuang,Zhu Na,Zhai Yujia.A Comparative Study on Topic Heats Evolution in the Field of Information Science Between the Domestic and Foreign Research Based on DTM[J]. Library and Information Service, 2016, 60(16): 99-109.)
(Li Chaoxiong, Huang Faliang, Wen Xiaoqian, et al.Evolution Analysis Method of Microblog Topic-Sentiment Based on Dynamic Topic Sentiment Combining Model[J]. Journal of Computer Applications, 2015, 35(10): 2905-2910.)
doi: 10.11772/j.issn.1001-9081.2015.10.2905
(Li Hui, Hu Yunfeng.Analyzing Online Reviews with Dynamic Sentiment Topic Model[J]. Data Analysis and Knowledge Discovery, 2017, 1(9): 74-82.)
[13]
Gerrish S M, Blei D M.A Language-based Approach to Measuring Scholarly Impact[C]//Proceedings of International Conference on Machine Learning. DBLP, 2010: 375-382.