Please wait a minute...
Advanced Search
现代图书情报技术  2007, Vol. 2 Issue (6): 33-37     https://doi.org/10.11925/infotech.1003-3513.2007.06.08
  知识组织与知识管理 本期目录 | 过刊浏览 | 高级检索 |
中文文档复制检测方法研究
耿崇 薛德军
(中国学术期刊(光盘版)电子杂志社 北京 100084)
Study on Chinese Document Copy Detection
Geng Chong   Xue Dejun
(China Academic Journal (CD) Publishing House,    Beijing    100084,China)
全文: PDF (503 KB)  
输出: BibTeX | EndNote (RIS)      
摘要 

介绍不同的文档复制检测方法,对不同方法的技术特点进行对比,通过实验系统论证不同方法的优缺点,并在CNKI海量资源的基础上实现中文文档复制检测系统。最后针对目前文档复制检测存在的问题进行分析并确定后续工作内容。

服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
耿崇
薛德军
关键词 文档复制检测抄袭检测    
Abstract

This paper discusses different methods of Document Copy Detection(DCD), compares with each other in the features of DCD techniques through experiment system, and builds a DCD system based on the CNKI repositories. At last, the paper gives a number of recommendations for further work in the field of DCD.

Key wordsDocument copy detection    Plagiarism detection
收稿日期: 2007-04-24      出版日期: 2007-06-25
: 

TP391.1

 
通讯作者: 耿崇     E-mail: gengchong@gmail.com
作者简介: 耿崇,薛德军
引用本文:   
耿崇,薛德军. 中文文档复制检测方法研究[J]. 现代图书情报技术, 2007, 2(6): 33-37.
Geng Chong,Xue Dejun. Study on Chinese Document Copy Detection. New Technology of Library and Information Service, 2007, 2(6): 33-37.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2007.06.08      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2007/V2/I6/33

1Brin S, Davis J, Garcia-Molina H. Copy Detection Mechanisms for Digital Documents. In: Proc.SIGMOD'95, New York: ACM Press, 1995: 398-409
2Shivakumar N, Garcia-Molina H.SCAM:A Copy Detection Mechanism for Digital Documents. http://stanford.edu/pub/papers/scam.ps(Accessed Apr.30,2007)
3Si A, Leong H V, Lau R W H. CHECK: A Document Plagiarism Detection System. Proceedings of the ACM Symposium on Applied Computing, New York: ACM Press, 1997:70-77
4Kang N,  Gelbukh A F, Han S Y. PPChecker: Plagiarism Pattern  Checkerin DocumentCopy Detection. Ninth International Conference on TSD. Springer Berlin/Heidelberg,2006: 661-667
5The Plagiarism Resource Site Charlottesville,Virginia.http://plagiarism.phys.virginia.edu/(Accessed Apr.30,2007)
6Meter Project.http://www.dcs.shef.ac.uk/nlp/meter/(Accessed Apr.30,2007)
7鲍军鹏, 沈钧毅等. 自然语言文档复制检测研究综述. 软件学报, 2003,14(10):1753-1761
8金博, 史彦军等. 中文文档复制检测系统研究 .计算机工程, 2005,31(19): 79-81
9史彦军, 滕弘飞等. 抄袭论文识别研究与进展. 大连理工大学学报, 2005,45(1):50-57
10鲍军鹏, 沈钧毅等. 一个基于网格的文本复制检测系统. 微电子学与计算机, 2004,21(9):7-10
11Forman G, Eshghi K, Chiocchetti S. Finding Similar Files in Large Document Repositories .Conference on Knowledge Discovery in Data, New York: ACM Press,  2005: 394-400
12Turnitin.http://www.turnitin.com(Accessed Apr.30,2007)
13Copyscape.http://www.copyscape.com(Accessed Apr.30,2007)
14稿件检查软件出炉,协助美国新闻界打击内容剽.http://digi.it.sohu.com/20050907/n240352816.shtml (Accessed Apr.30,2007)
15Chen X, Francia B, Li M, et al. Shared Information and Program Plagiarism Detection. IEEE Trans. Inform. Theory, 2004,50(7): 1545-1551
16Song Q B, Shen J Y. On Illegal Coping and Distributing Detection Mechanism for Digital Goods. Journal of Computer Research and Development, 2001,38(1):121-125
17Welcome to Glatt Plagiarism Services, Inc.http://www.plagiarism.com(Accessed Apr.30,2007)
18Sven Meyer zu Eissen, Benno Stein. Intrinsic Plagiarism Detection, (ECIR-06), Springer, 2006: 565-569
19张庆国, 薛德军等.海量数据集上基于特征组合的关键词自动抽取. 情报学报,2006,25(5):587-593

[1] 刘伙玉, 王东波. 面向论文相似性检测的数据预处理研究[J]. 现代图书情报技术, 2015, 31(5): 50-56.
[2] 秦新国. 基于句子相似度的文档复制检测算法研究[J]. 现代图书情报技术, 2007, 2(11): 63-66.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn