Please wait a minute...
New Technology of Library and Information Service  2007, Vol. 2 Issue (6): 33-37    DOI: 10.11925/infotech.1003-3513.2007.06.08
Current Issue | Archive | Adv Search |
Study on Chinese Document Copy Detection
Geng Chong   Xue Dejun
(China Academic Journal (CD) Publishing House,    Beijing    100084,China)
Download: PDF(503 KB)   HTML  
Export: BibTeX | EndNote (RIS)      
Abstract  

This paper discusses different methods of Document Copy Detection(DCD), compares with each other in the features of DCD techniques through experiment system, and builds a DCD system based on the CNKI repositories. At last, the paper gives a number of recommendations for further work in the field of DCD.

Key wordsDocument copy detection      Plagiarism detection     
Received: 24 April 2007      Published: 25 June 2007
: 

TP391.1

 
Corresponding Authors: Geng Chong     E-mail: gengchong@gmail.com
About author:: Geng Chong,Xue Dejun

Cite this article:

Geng Chong,Xue Dejun. Study on Chinese Document Copy Detection. New Technology of Library and Information Service, 2007, 2(6): 33-37.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2007.06.08     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2007/V2/I6/33

1Brin S, Davis J, Garcia-Molina H. Copy Detection Mechanisms for Digital Documents. In: Proc.SIGMOD'95, New York: ACM Press, 1995: 398-409
2Shivakumar N, Garcia-Molina H.SCAM:A Copy Detection Mechanism for Digital Documents. http://stanford.edu/pub/papers/scam.ps(Accessed Apr.30,2007)
3Si A, Leong H V, Lau R W H. CHECK: A Document Plagiarism Detection System. Proceedings of the ACM Symposium on Applied Computing, New York: ACM Press, 1997:70-77
4Kang N,  Gelbukh A F, Han S Y. PPChecker: Plagiarism Pattern  Checkerin DocumentCopy Detection. Ninth International Conference on TSD. Springer Berlin/Heidelberg,2006: 661-667
5The Plagiarism Resource Site Charlottesville,Virginia.http://plagiarism.phys.virginia.edu/(Accessed Apr.30,2007)
6Meter Project.http://www.dcs.shef.ac.uk/nlp/meter/(Accessed Apr.30,2007)
7鲍军鹏, 沈钧毅等. 自然语言文档复制检测研究综述. 软件学报, 2003,14(10):1753-1761
8金博, 史彦军等. 中文文档复制检测系统研究 .计算机工程, 2005,31(19): 79-81
9史彦军, 滕弘飞等. 抄袭论文识别研究与进展. 大连理工大学学报, 2005,45(1):50-57
10鲍军鹏, 沈钧毅等. 一个基于网格的文本复制检测系统. 微电子学与计算机, 2004,21(9):7-10
11Forman G, Eshghi K, Chiocchetti S. Finding Similar Files in Large Document Repositories .Conference on Knowledge Discovery in Data, New York: ACM Press,  2005: 394-400
12Turnitin.http://www.turnitin.com(Accessed Apr.30,2007)
13Copyscape.http://www.copyscape.com(Accessed Apr.30,2007)
14稿件检查软件出炉,协助美国新闻界打击内容剽.http://digi.it.sohu.com/20050907/n240352816.shtml (Accessed Apr.30,2007)
15Chen X, Francia B, Li M, et al. Shared Information and Program Plagiarism Detection. IEEE Trans. Inform. Theory, 2004,50(7): 1545-1551
16Song Q B, Shen J Y. On Illegal Coping and Distributing Detection Mechanism for Digital Goods. Journal of Computer Research and Development, 2001,38(1):121-125
17Welcome to Glatt Plagiarism Services, Inc.http://www.plagiarism.com(Accessed Apr.30,2007)
18Sven Meyer zu Eissen, Benno Stein. Intrinsic Plagiarism Detection, (ECIR-06), Springer, 2006: 565-569
19张庆国, 薛德军等.海量数据集上基于特征组合的关键词自动抽取. 情报学报,2006,25(5):587-593

[1] Liu Huoyu, Wang Dongbo. Research and Implementation of Data Preprocessing Oriented to Paper Similarity Detection[J]. 现代图书情报技术, 2015, 31(5): 50-56.
[2] Qin Xinguo. Research on the Copy Detection Based on the Similarity of Sentences[J]. 现代图书情报技术, 2007, 2(11): 63-66.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn