Please wait a minute...
New Technology of Library and Information Service  2011, Vol. 27 Issue (7/8): 104-109    DOI: 10.11925/infotech.1003-3513.2011.07-08.17
Current Issue | Archive | Adv Search |
Topic Evolution Based on Seminal Document and Topic Model
Shan Bin, Li Fang
School of Electronic Information and Electrical Engineering, Shanghai Jiaotong University, Shanghai 200240, China
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  This paper presents a new method to infer the LDA topic evolution automatically based on seminal documents. The semantic distribution of the seminal documents is used to guide the successive model and link topics between consecutive time slices. The experiments are based on NIPS dataset and Chinese newswire of NPC and CPPCC,and the results show that the method can not only get the correct evolutions in various forms, but also avoid those related topics without evolution relationship.
Key wordsLDA      Topic evolution      Seminal document      Topic model     
Received: 11 May 2011      Published: 09 October 2011
: 

TP393

 

Cite this article:

Shan Bin, Li Fang. Topic Evolution Based on Seminal Document and Topic Model. New Technology of Library and Information Service, 2011, 27(7/8): 104-109.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2011.07-08.17     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2011/V27/I7/8/104

[1] Blei D M, Ng A Y, Jordan M I. Latent Dirichlet Allocation[J]. The Journal of Machine Learning Research,2003(3):993-1022.

[2] Wang X, McCallum A. Topic over Time: A Non-markov Continuous-time Model of Topical Trends . In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Philadelphia,PA,USA.2006:424-433.

[3] Rosen-Zvi M,Griffiths T,Steyvers M,et al. The Author-topic Model for Authors and Documents . In: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence,Banff,Canada.2004:487-494.

[4] Blei D M,McAuliffe J D. Supervised Topic Models . In: Proceeding of the 22nd Annual Conference on Neural Information Processing Systems.2008.

[5] Blei D M, LaffertyJ D. Dynamic Topic Model .In: Proceedings of the 23rd International Conference on Machine Learning,Pittsburgh,Pennsylvania.2006:113-120.

[6] Wei X,Sun J,Wang X. Dynamic Mixture Models for Multiple Time Series .In: Proceedings of the 20th International Joint Conference on Artificial Intelligence.2007: 2909-2914.

[7] 单斌,李芳.基于LDA话题演化研究方法综述[J]. 中文信息学报, 2010,24(6):43-49,68.

[8] Makkonen J. Investigations on Event Evolution in TDT . In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology.2003:43-48.

[9] 楚克明,李芳.基于LDA 话题关联的话题演化[J]. 上海交通大学学报, 2010,44(11):1501-1506.

[10] Nallapati R M,Ahmed A,Xing E P,et al. Joint Latent Topic Models for Text and Citations . In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.New York:ACM Press,2008:542-550.

[11] Alsumait L,Barbará D,Gentle J,et al. Topic Significance Ranking of LDA Generative Models . In: Proceeding of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I.2009:67-82.

[12] GriffithsT L,Steyvers M. Finding Scientific Topics .In: Proceeding of the National Academy of Science of United States of America.2004,101:5228-5235.
[1] Li Yueyan,Wang Hao,Deng Sanhong,Wang Wei. Research Trends of Information Retrieval——Case Study of SIGIR Conference Papers[J]. 数据分析与知识发现, 2021, 5(4): 13-24.
[2] Yi Huifang,Liu Xiwen. Analyzing Patent Technology Topics with IPC Context-Enhanced Context-LDA Model[J]. 数据分析与知识发现, 2021, 5(4): 25-36.
[3] Wang Hongbin,Wang Jianxiong,Zhang Yafei,Yang Heng. Topic Recognition of News Reports with Imbalanced Contents[J]. 数据分析与知识发现, 2021, 5(3): 109-120.
[4] Shen Si,Li Qinyu,Ye Yuan,Sun Hao,Ye Wenhao. Topic Mining and Evolution Analysis of Medical Sci-Tech Reports with TWE Model[J]. 数据分析与知识发现, 2021, 5(3): 35-44.
[5] Zhang Xin,Wen Yi,Xu Haiyun. A Prediction Model with Network Representation Learning and Topic Model for Author Collaboration[J]. 数据分析与知识发现, 2021, 5(3): 88-100.
[6] Zhao Tianzi, Duan Liang, Yue Kun, Qiao Shaojie, Ma Zijuan. Generating News Clues with Biterm Topic Model[J]. 数据分析与知识发现, 2021, 5(2): 1-13.
[7] Wang Wei, Gao Ning, Xu Yuting, Wang Hongwei. Topic Evolution of Online Reviews for Crowdfunding Campaigns[J]. 数据分析与知识发现, 2021, 5(10): 103-123.
[8] Chen Hao, Zhang Mengyi, Cheng Xiufeng. Identifying Cross-Region Patent Collaboration Opportunities Using LDA and Decision Trees——Case Study of Universities from Guangdong and Wuhan[J]. 数据分析与知识发现, 2021, 5(10): 37-50.
[9] Liu Qian, Li Chenliang. A Survey of Topic Evolution on Social Media[J]. 数据分析与知识发现, 2020, 4(8): 1-14.
[10] Yue Lixin,Liu Ziqiang,Hu Zhengyin. Evolution Analysis of Hot Topics with Trend-Prediction[J]. 数据分析与知识发现, 2020, 4(6): 22-34.
[11] Cai Yongming,Liu Lu,Wang Kewei. Identifying Key Users and Topics from Online Learning Community[J]. 数据分析与知识发现, 2020, 4(6): 69-79.
[12] Yu Chuanming,Yuan Sai,Zhu Xingyu,Lin Hongjun,Zhang Puliang,An Lu. Research on Deep Learning Based Topic Representation of Hot Events[J]. 数据分析与知识发现, 2020, 4(4): 1-14.
[13] Ye Guanghui,Zeng Jieyan,Hu Jinglan,Bi Chongwu. Analyzing Public Sentiments from the Perspective of City Profiles[J]. 数据分析与知识发现, 2020, 4(4): 15-26.
[14] Pan Youneng,Ni Xiuli. Recommending Online Medical Experts with Labeled-LDA Model[J]. 数据分析与知识发现, 2020, 4(4): 34-43.
[15] Liu Yuwen,Wang Kai. Finding Geographic Locations of Popular Online Topics[J]. 数据分析与知识发现, 2020, 4(2/3): 173-181.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn