|
|
Research on Extracting E-mail Information Based on Structure of MIME Mail |
Hu Yan Teng Guifa Dong Sufen Wang Dan |
(School of Information Science and Technology, Agricultural University of Hebei, Baoding 071001, China) |
|
|
Abstract In order to accurately extract the information of E-mail, E-mail’s structure and content features are analyzed, and an E-mail pretreatment system based on structure of MIME mail is designed. Using block-treatment and feature identification methods, this system overcomes the shortcomings of informal style and filteres reply lines and advertising lines. The system finally realizes expectative goal of extracting E-mail information quickly and accurately.
|
Received: 04 January 2008
Published: 25 May 2008
|
|
Corresponding Authors:
Hu Yan
E-mail: katehu_2001@163.com
|
About author:: Hu Yan,Teng Guifa,Dong Sufen,Wang Dan |
[1] 汪晓平,钟军.Visual C++网络通信协议分析与应用实现[M].北京:人民邮电出版社,2003:347-380.
[2] 张孝祥,方立勋.Java 邮件开发详解[M].北京:电子工业出版社,2007:64-78.
[3] MIME (Multipurpose Internet Mail Extensions) Part One: Mechanisms for Specifying and Describing the Format of Internet Message Bodies[S]. Nathaniel Borenstein and Ned Freed, 1994.
[4] KFC 822:Standard for ARPA Internet Text Messages[EB/OL].[2007-09-28]. http://www.ietf.org/rfc/rfco822.txt?number=822.
[5] Carvalho V R, Cohen W W. Learning to Extract Signature and Reply Lines from Email[EB/OL]. [2007-09-28].http://www.cs.cmu.edu/~wcohen/postscript/email-2004.pdf. |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|