融合语义相似度的商务情报链接分析算法研究

doi:10.11925/infotech.1003-3513.2013.03.05

现代图书情报技术

2013, Vol. 29

Issue (3): 27-32 https://doi.org/10.11925/infotech.1003-3513.2013.03.05

知识组织与知识管理

本期目录 | 过刊浏览 | 高级检索

融合语义相似度的商务情报链接分析算法研究

何超, 张玉峰

武汉大学信息管理学院武汉 430072

Research on Business Intelligence Link Analysis Algorithm Combining Semantic Similarity

He Chao, Zhang Yufeng

School of Information Management, Wuhan University, Wuhan 430072, China

摘要
参考文献
相关文章
Metrics

全文: PDF (667 KB) HTML
输出: BibTeX | EndNote (RIS)

摘要针对传统链接分析算法存在的链接丢失问题和语义异构问题,设计基于语义相似度的商务情报链接分析算法。该算法综合应用锚链文本和锚链结构信息解决链接丢失问题,应用领域本体提供语义知识解决语义异构问题。实验结果表明,该算法能够显著提高商务情报分析结果的准确性。

	服务

	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	何超
	张玉峰

关键词 ：商务情报, 语义相似度, 链接分析

Abstract：A business intelligence link analysis algorithm based on semantic similarity is designed for the problem of link lost and semantic heterogeneity in the traditional link analysis algorithm. The algorithm exploits anchor chain text and structure synthetically to solve link lost, uses semantic knowledge presented by domain Ontology to solve semantic heterogeneity. The experiment results show that the model and the algorithm achieve a good expected effect and can raise the accuracy and efficiency of business intelligence analysis.

Key words： Business intelligence Semantic similarity Link analysis

收稿日期: 2013-01-31 出版日期: 2013-05-14

G350

基金资助:本文系教育部博士研究生学术新人奖基金项目“基于数据挖掘的商务情报分析方法研究”(项目编号:5052012104001)和国家自然科学基金项目“企业竞争情报智能分析模型与方法研究”(项目编号:71073121)的研究成果之一。

引用本文:

何超, 张玉峰. 融合语义相似度的商务情报链接分析算法研究[J]. 现代图书情报技术, 2013, 29(3): 27-32.
He Chao, Zhang Yufeng. Research on Business Intelligence Link Analysis Algorithm Combining Semantic Similarity. New Technology of Library and Information Service, 2013, 29(3): 27-32.

链接本文:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2013.03.05 或 https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2013/V29/I3/27

[1] Getoor L. Link Mining: A New Data Mining Challenge[J]. ACM SIGKDD Explorations Newsletter, 2003, 5(1): 84-89.
[2] Zhang X, Zhao C F, Wang P, et al. Mining Link Patterns in Linked Data[C]. In: Proceedings of WAIM. 2012: 83-94.
[3] Thelwall M. 链接分析:信息科学的研究方法[M].孙建军,李江,张煦译.南京:东南大学出版社,2009:3-7.(Thelwall M. Link Analysis: An Information Science Approach[M].Translated by Sun Jianjun, Li Jiang, Zhang Xu. Nanjing: Southeast University Press, 2009: 3-7.)
[4] Kleinberg J M. Authoritative Sources in a Hyperlinked Environment [C]. In: Proceedings of the 9th Annual ACM-SIAM Symposium on Discrete Algorithms(SODA ’98). 1998: 668-677.
[5] Brin S, Page L. The Anatomy of a Large-scale Hypertextual Web Search Engine[J]. Computer Networks and ISDN Systems, 1998, 30(1): 107-117.
[6] Stattner E, Collard M. MAX-FLMin: An Approach for Mining Maximal Frequent Links and Generating Semantical Structures from Social Networks[C]. In: Proceedings of the 23rd International Conference on DEXA, Vienna, Austria. 2012: 468-483.
[7] 马丽.融入语义相似度的HITS算法研究及实现[D].南京:南京理工大学, 2011.(Ma Li. Research and Realization of HITS Algorithm Integrating Semantic Similarity[D]. Nanjing: Nanjing University of Science and Technology, 2011.)
[8] 马慧芳,史忠植.一种高效稳定的链接分析算法[J]. 计算机应用研究 ,2011,28(2):488-491.(Ma Huifang, Shi Zhongzhi. Efficient and Stable Link Analysis Algorithm[J]. Application Research of Computers, 2011,28(2):488-491.)
[9] 郑庆华,刘均,田锋,等.Web知识挖掘:理论、方法与应用[M].北京:科学出版社,2010:78-102.(Zheng Qinghua, Liu Jun, Tian Feng, et al. Web Knowledge Mining: The Theory, Method and Application[M]. Beijing: Science Press, 2010:78-102.)
[10] 张玉峰,周磊,王志芳,等.领域本体构建与可视化展示研究[J]. 情报理论与实践 ,2012,35(10):95-98.(Zhang Yufeng, Zhou Lei, Wang Zhifang, et al. Research on Domain Ontology Construction and Visualization [J]. Information Studies: Theory & Application, 2012,35(10):95-98.)
[11] 张乃洲,李石君,余伟,等.使用联合链接相似度评估爬取Web资源[J]. 计算机学报 ,2010,33(12):2267-2280.(Zhang Naizhou, Li Shijun, Yu Wei, et al. Using a Joint Link Similarity Evaluation Based Method for Crawling the Resources on Web[J].Chinese Journal of Computers, 2010,33(12):2267-2280.)
[12] 王志芳.基于领域本体的文本挖掘算法与实验研究[D].武汉:武汉大学,2011.(Wang Zhifang. Research on Text Mining Algorithm and Experiment Based on Domain Ontology[D]. Wuhan: Wuhan University, 2011.)
[13] Pant G, Menczer F. Topical Crawling for Business Intelligence [C]. In: Proceedings of the 7th European Conference on Research and Advanced Technology for Digital Libraries. 2003: 233-244.)

[1]	严娇,马静,房康. 基于融合共现距离的句法网络下文本语义相似度计算 ^*[J]. 数据分析与知识发现, 2019, 3(12): 93-100.
[2]	史昱天, 朱庆华, 赵宇翔, 陈晓威. 基于链接分析法对国内网络直播平台综合影响力的评价研究^*[J]. 数据分析与知识发现, 2017, 1(9): 40-48.
[3]	陈二静, 姜恩波. 文本相似度计算方法研究综述[J]. 数据分析与知识发现, 2017, 1(6): 1-11.
[4]	翟东升, 蔡文浩, 张杰, 李振飞. 改进的中文商标语义相似度计算方法研究[J]. 数据分析与知识发现, 2017, 1(11): 19-28.
[5]	刘健,毕强,刘庆旭,王福. *数字文献资源内容服务推荐研究^——基于本体规则推理和语义相似度计算**[J]. 现代图书情报技术, 2016, 32(9): 70-77.
[6]	巴志超,李纲,朱世伟. 基于语义网络的研究兴趣相似性度量方法^*[J]. 现代图书情报技术, 2016, 32(4): 81-90.
[7]	毕强, 刘健, 鲍玉来. 基于语义相似度的文本聚类研究^*[J]. 数据分析与知识发现, 2016, 32(12): 9-16.
[8]	刘怀亮, 杜坤, 秦春秀. 基于知网语义相似度的中文文本分类研究[J]. 现代图书情报技术, 2015, 31(2): 39-45.
[9]	范雪雪, 王志荣, 徐晤, 梁银, 马小虎. 基于医学本体的术语相似度算法研究[J]. 现代图书情报技术, 2015, 31(12): 57-64.
[10]	胡吉明, 肖璐. 向量空间模型文本建模的语义增量化改进研究[J]. 现代图书情报技术, 2014, 30(10): 49-55.
[11]	孙海霞, 李军莲, 李丹亚, 吴英杰, 李晓瑛. 基于CMeSH语义系统的领域自由词-主题词语义映射研究[J]. 现代图书情报技术, 2013, 29(11): 46-51.
[12]	马军红. 分阶段融合的文本语义相似度计算方法[J]. 现代图书情报技术, 2013, 29(10): 20-26.
[13]	王莉. 基于关键词链的动态分面研究[J]. 现代图书情报技术, 2012, 28(7): 76-81.
[14]	邢美凤. 科技文献关键词冗余解决方案研究[J]. 现代图书情报技术, 2012, 28(1): 34-39.
[15]	徐健张智雄肖卓邓昭俊. 科技术语语义相似度计算方法研究综述[J]. 现代图书情报技术, 2010, 26(7/8): 51-57.

Viewed

Full text

Abstract

Cited

Shared

Discussed