This paper firstly generalizes the formats of Chinese time words and numerals appearing in the text. Based on them, this paper then sets up a rule sets for recognition, proposes a method about Chinese time words and numerals based on rules and discusses its application value in competitive intelligence analysis as well as machine translation field at last.
高霄云,杨建林 . 基于规则的中文时间词和数词的自动识别算法[J]. 现代图书情报技术, 2007, 2(3): 46-50.
Gao Xiaoyun,Yang Jianlin . Chinese Time Words and Numerals Automatic Segmentation Method Based on Rules. New Technology of Library and Information Service, 2007, 2(3): 46-50.
1余战秋.中文分词技术及其应用初探.电脑知识与技术,2004(32):81-83
2孙茂松,邹嘉彦.汉语自动分词研究评述.当代语言学,2001,3(1):22-32
3温有奎.基于知识元的文本知识标引.情报学报,2006,25(3):282-288
4Regina Barzilay, Noemie Elhadad, and Kathleen R. McKeown. Sentence Ordering in Multidocument Summarization. In: Proceedings of the 1st Human Language Technology Conference. San Diego, California, 2001
5孙广范,宋金平,袁琦.机器翻译中规则和模板的协调方法研究.中文信息学报,2006(20):31-35
6张江.基于规则的分词方法.计算机与现代化,2005,(4):18-20
7郑泽之,张普,杨建国.基于语料库的字母词语自动提取研究.中文信息学报,2005,19(2):78-85