Please wait a minute...
Advanced Search
现代图书情报技术  2012, Vol. 28 Issue (4): 61-67     https://doi.org/10.11925/infotech.1003-3513.2012.04.10
  情报分析与研究 本期目录 | 过刊浏览 | 高级检索 |
科技文献话题演化研究
贺亮, 李芳
上海交通大学计算机科学与工程系 上海 200240
Topic Evolution in Scientific Literature
He Liang, Li Fang
Department of Computer Science & Engineering, Shanghai Jiaotong University, Shanghai 200240, China
全文: PDF (1365 KB)   HTML  
输出: BibTeX | EndNote (RIS)      
摘要 提出一种研究话题演化的方法,利用LDA话题模型抽取科技文献的话题,通过计算话题的强度和特征词,研究话题的演化趋势。对NIPS 论文集与ACL论文集进行实验,结果显示了机器学习领域以及计算语言学领域的一些发展状况,从而验证该方法的可行性。
服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
贺亮
李芳
关键词 话题模型趋势分析话题演化    
Abstract:This paper uses LDA model to generate topics from the scientific literature,then calculates the strength and feature key to find the evolution trends of topics. The experiments on NIPS anthology and ACL anthology show the trends of machine learning and computational linguistics, and also prove the feasibility of the proposed calculating method.
Key wordsTopic model    Trend analysis    Topic evolution
收稿日期: 2012-01-30      出版日期: 2012-05-20
: 

TP391

 
基金资助:

本文系国家自然科学基金项目“新闻话题线索与主题的探测研究”(项目编号:60873134)的研究成果之一。

引用本文:   
贺亮, 李芳. 科技文献话题演化研究[J]. 现代图书情报技术, 2012, 28(4): 61-67.
He Liang, Li Fang. Topic Evolution in Scientific Literature. New Technology of Library and Information Service, 2012, 28(4): 61-67.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2012.04.10      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2012/V28/I4/61
[1] Blei D M, Ng A Y,Jordan M I. Latent Dirichlet Allocation[J]. The Journal of Machine Learning Research, 2003, 3:993-1022.

[2] Blei D M, Lafferty J D. A Correlated Topic Model of Science[J]. The Annals of Applied Statistics, 2007,1(1): 17-35.

[3] Blei D M,Lafferty J D. Dynamic Topic Models[C]. In: Proceedings of the 23rd International Conference on Machine Learning. 2006: 113-120.

[4] Rosen-Zvi M, Grifiths T, Steyvers M,et al. The Author-Topic Model for Authors and Documents[C]. In: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence. 2004.

[5] Tang J, Zhang J, Yao L M,et al. ArnetMiner: Extraction and Mining of Academic Social Networks[C].In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD).2008:990-998.

[6] Daud A, Li J Z, Zhou L Z, et al. Exploiting Temporal Authors Interests via Temporal-Author-Topic Modeling[C].In:Proceedings of the 5th International Conference on Advanced Data Mining and Applications.Berlin,Heidelberg:Springer-Verlag,2009:435-443.

[7] He Q, Chen B,Pei J,et al. Detecting Topic Evolution in Scientific Literature: How Can Citations Help[C]. In:Proceedings of the 18th ACM Conference on Information and Knowledge Management. 2009:957-966.

[8] Jo Y, Lagoze C, Giles C L. Detecting Research Topics via the Correlation Between Graphs and Texts[C]. In:Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.2007:370-379.

[9] Wang X R,McCallum A. Topic Over Time: A Non-Markov Continuous-Time Model of Topical Trends[C]. In:Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.2006.

[10] Griffiths T L,Steyvers M. Finding Scientific Topics[J]. Proceeding of the National Academy of Science,2004,101(S1):5228-5235.

[11] Hall D, Jurafsky D, Manning C D. Studying the History of Ideas Using Topic Models[C]. In: Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing.2008:363-371.

[12] Ramage D, Rosen E, Chuang J, et al.Topic Modeling for the Social Sciences[C]. In: Proceedings of NIPS 2009 Workshop on Applications for Topic Models: Text and Beyond,Whistler, Canada. 2009.

[13] 楚克明,李芳. 基于LDA话题关联的话题演化[J]. 上海交通大学学报 ,2010,44(11): 1501-1506. (Chu Keming, Li Fang. Topic Evolution Based on LDA and Topic Association[J]. Journal of Shanghai Jiaotong University,2010,44(11): 1501-1506.)

[14] 贺亮,李芳. 基于话题模型的科技文献话题发现和趋势分析[C]. 见: 全国第十一届计算语言学学术会议(CNCCL-2011)论文集 .2011. (He Liang, Li Fang. Topic Discovery and Trend Analysis in Scientific Literature Based on Topic Model[C].In: Proceedings of the 11th Chinese National Conference on Computational Linguistics.2011.)

[15] Advances in Neural Information Processing Systems [DB/OL]. [2011-11-09].http://books.nips.cc.

[16] ACL Anthology[DB/OL].[2011-11-09].http://www.aclweb.org/anthology/.

[17] GibbsLDA + +: A C/C + + Implementation of Latent Dirichlet Allocation[EB/OL]. [2011-11-09].http://gibbslda.sourceforge.net/.
[1] 徐建民,张丽青,王苗. 基于贝叶斯网络的静态话题追踪模型*[J]. 数据分析与知识发现, 2020, 4(2/3): 200-206.
[2] 张佩瑶,刘东苏. 基于词向量和BTM的短文本话题演化分析*[J]. 数据分析与知识发现, 2019, 3(3): 95-101.
[3] 汪鸿沁泠,巴志超,李纲. 微信群会话话题强度计算及演化分析*[J]. 数据分析与知识发现, 2019, 3(2): 33-42.
[4] 李纲,陈思菁,毛进,谷岩松. 自然灾害事件微博热点话题的时空对比分析 *[J]. 数据分析与知识发现, 2019, 3(11): 1-15.
[5] 王璟琦, 李锐, 吴华意. 基于空间自相关的网络舆情话题演化时空规律分析*[J]. 数据分析与知识发现, 2018, 2(2): 64-73.
[6] 单斌, 李芳. 基于种子文档LDA话题的演化研究[J]. 现代图书情报技术, 2011, 27(7/8): 104-109.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn