Please wait a minute...
New Technology of Library and Information Service  2007, Vol. 2 Issue (6): 33-37    DOI: 10.11925/infotech.1003-3513.2007.06.08
Current Issue | Archive | Adv Search |
Study on Chinese Document Copy Detection
Geng Chong   Xue Dejun
(China Academic Journal (CD) Publishing House,    Beijing    100084,China)
Download: PDF(503 KB)   HTML  
Export: BibTeX | EndNote (RIS)      

This paper discusses different methods of Document Copy Detection(DCD), compares with each other in the features of DCD techniques through experiment system, and builds a DCD system based on the CNKI repositories. At last, the paper gives a number of recommendations for further work in the field of DCD.

Key wordsDocument copy detection      Plagiarism detection     
Received: 24 April 2007      Published: 25 June 2007


Corresponding Authors: Geng Chong     E-mail:
About author:: Geng Chong,Xue Dejun

Cite this article:

Geng Chong,Xue Dejun. Study on Chinese Document Copy Detection. New Technology of Library and Information Service, 2007, 2(6): 33-37.

URL:     OR

1Brin S, Davis J, Garcia-Molina H. Copy Detection Mechanisms for Digital Documents. In: Proc.SIGMOD'95, New York: ACM Press, 1995: 398-409
2Shivakumar N, Garcia-Molina H.SCAM:A Copy Detection Mechanism for Digital Documents. Apr.30,2007)
3Si A, Leong H V, Lau R W H. CHECK: A Document Plagiarism Detection System. Proceedings of the ACM Symposium on Applied Computing, New York: ACM Press, 1997:70-77
4Kang N,  Gelbukh A F, Han S Y. PPChecker: Plagiarism Pattern  Checkerin DocumentCopy Detection. Ninth International Conference on TSD. Springer Berlin/Heidelberg,2006: 661-667
5The Plagiarism Resource Site Charlottesville,Virginia. Apr.30,2007)
6Meter Project. Apr.30,2007)
7鲍军鹏, 沈钧毅等. 自然语言文档复制检测研究综述. 软件学报, 2003,14(10):1753-1761
8金博, 史彦军等. 中文文档复制检测系统研究 .计算机工程, 2005,31(19): 79-81
9史彦军, 滕弘飞等. 抄袭论文识别研究与进展. 大连理工大学学报, 2005,45(1):50-57
10鲍军鹏, 沈钧毅等. 一个基于网格的文本复制检测系统. 微电子学与计算机, 2004,21(9):7-10
11Forman G, Eshghi K, Chiocchetti S. Finding Similar Files in Large Document Repositories .Conference on Knowledge Discovery in Data, New York: ACM Press,  2005: 394-400
12Turnitin. Apr.30,2007)
13Copyscape. Apr.30,2007)
14稿件检查软件出炉,协助美国新闻界打击内容剽. (Accessed Apr.30,2007)
15Chen X, Francia B, Li M, et al. Shared Information and Program Plagiarism Detection. IEEE Trans. Inform. Theory, 2004,50(7): 1545-1551
16Song Q B, Shen J Y. On Illegal Coping and Distributing Detection Mechanism for Digital Goods. Journal of Computer Research and Development, 2001,38(1):121-125
17Welcome to Glatt Plagiarism Services, Inc. Apr.30,2007)
18Sven Meyer zu Eissen, Benno Stein. Intrinsic Plagiarism Detection, (ECIR-06), Springer, 2006: 565-569
19张庆国, 薛德军等.海量数据集上基于特征组合的关键词自动抽取. 情报学报,2006,25(5):587-593

[1] Liu Huoyu, Wang Dongbo. Research and Implementation of Data Preprocessing Oriented to Paper Similarity Detection[J]. 现代图书情报技术, 2015, 31(5): 50-56.
[2] Qin Xinguo. Research on the Copy Detection Based on the Similarity of Sentences[J]. 现代图书情报技术, 2007, 2(11): 63-66.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938