搜索引擎日志中“N<sub>1</sub>+N<sub>2</sub>”型名词短语研究

doi:10.11925/infotech.1003-3513.2010.12.10

现代图书情报技术

2010, Vol. 26

Issue (12): 58-63 https://doi.org/10.11925/infotech.1003-3513.2010.12.10

情报分析与研究

本期目录 | 过刊浏览 | 高级检索

搜索引擎日志中“N₁+N₂”型名词短语研究

刘志杰, 吕学强, 程涛

北京信息科技大学中文信息处理研究中心北京 100101

Study on Noun Phrase of “N₁ +N₂”Structure in Search Engine Query Logs

Liu Zhijie, Lv Xueqiang, Cheng Tao

Chinese Information Processing Research Center, Beijing Information Science & Technology University, Beijing 100101, China

摘要
参考文献
相关文章
Metrics

全文: PDF (531 KB) HTML
输出: BibTeX | EndNote (RIS)

摘要

在基于搜索日志的基础上,根据语料本身具有的特点,对“N₁+N₂”型结构的名词短语进行全面的描述,其中包括各组成要素的特点和句法功能,并给出该结构类型名词短语挖掘与校对的基本方法。通过对实验结果的分析,进一步说明短语的研究在搜索引擎中的重要作用。

	服务

	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	刘志杰
	吕学强
	程涛

关键词 ： &ldquo, N₁+N₂&rdquo, 型结构, 搜索日志, 名词短语, 句法功能

Abstract：

Based on query logs, comprehensive description of the “N₁+N₂” structure noun phrase form is given according to the characteristics of corpus itself,including the characteristics of each element and syntactic function.And the basic methods of mining and proofreading are given about the type of noun phrase. Through the analysis of experimental results, the authors further illustrate that the study of phrase is important in search engine.

Key words： “N₁+N₂”structure Query log Noun phrase Syntactic function

收稿日期: 2010-10-26 出版日期: 2011-01-07

H08

基金资助:

本文系国家社会科学基金项目“搜索引擎用短语词典的语法理论和构建方法研究”(项目编号:09CYY021)的研究成果之一。

引用本文:

刘志杰, 吕学强, 程涛. 搜索引擎日志中“N₁+N₂”型名词短语研究[J]. 现代图书情报技术, 2010, 26(12): 58-63.
Liu Zhijie, Lv Xueqiang, Cheng Tao. Study on Noun Phrase of “N₁ +N₂”Structure in Search Engine Query Logs. New Technology of Library and Information Service, 2010, 26(12): 58-63.

链接本文:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2010.12.10 或 https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2010/V26/I12/58

[1] 吕学强,苏祺,孙斌,等.搜索引擎用短语词典建设
[J]. 清华大学学报:自然科学版,2005,45(9):1892-1895.

[2] 孙艳.“名词1+名词2”形式研究
[J]. 语文学刊,2009(17):119-121.

[3] 刁晏斌. 当代汉语中新的“名1+名2”形式——名词陈述化的一种新形式
[J]. 语言与翻译(汉文),2005(4): 23-27.

[4] Church K W. A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text. In: Proceedings of the 2nd Conference on Applied Natural Language Processing,Austin,Texas,USA. 1988:136-143.

[5] Abney S P. Parsing by Chunks.//Berwick R, Abney S, Tenny C. Principle-Based Parsing
[M]. Dordrecht: Kluwer Academic Publishers,1991:257-278.

[6] 周蕾,朱巧明. 基于统计和规则的未登录词识别方法研究
[J]. 计算机工程,2007,33(8):196-198.

[7] 焦丽,路波.基于五大因素的搜索引擎优化研究
[J]. 中国管理信息化,2009,12(17):120-122.

[8] 余慧佳,刘奕群,张敏,等. 基于大规模日志分析的网络搜索引擎用户行为研究
[J]. 中文信息学报,2007,21(1):109-113.

[1]	任育伟, 吕学强, 李卓, 徐丽萍. 搜索日志中命名实体识别[J]. 现代图书情报技术, 2015, 31(6): 49-56.
[2]	曾镇, 吕学强, 李卓. 搜索日志中中文人名的自动识别[J]. 现代图书情报技术, 2014, 30(12): 71-77.
[3]	李雪伟, 吕学强, 刘克会. 扩展搜索日志上下文的新词识别[J]. 现代图书情报技术, 2014, 30(11): 59-65.
[4]	王东波, 朱丹浩. 面向汉语句法功能分布知识库的词汇类别知识挖掘研究[J]. 现代图书情报技术, 2013, 29(3): 33-37.
[5]	李智锋, 张李义. 混合动力思想下的图书检索系统研究[J]. 现代图书情报技术, 2012, 28(7): 54-58.
[6]	张权,韩明杰. 我校“211工程”图书馆计算机网络项目建设及其意义[J]. 现代图书情报技术, 2001, 17(2): 74-76.
[7]	梁锦华. 两种文献数据库设计方案分析[J]. 现代图书情报技术, 1991, 7(4): 16-18.

Viewed

Full text

Abstract

Cited

Shared

Discussed