New Technology of Library and Information Service  2009, Vol. 25 Issue (4): 50-56    DOI: 10.11925/infotech.1003-3513.2009.04.10
The POS &|Mining Study on Search Engine’s Query Log
Lai Maosheng  Qu Peng
(Department of Information Management, Peking University, Beijing 100871, China)
The paper analyzes the query logs in March, 2007, from Sogou search engine. POS tagging is used to get the characters of high frequency POS results. Web users use nouns as primary and verbs as complementary methods in Web queries; but other parts of speech seldom appear in the queries. The empty words in natural language, such as “的”, do not appear in the high frequency POS results very often. Queries in the Web searching are different from natural language in syntax to a certain degree and they have shared characters at the same time. Web users’ use nouns to do concept-focused retrieval and keywords are still the primary method to search on the Web. The high frequency results of POS tagging partially obey the Zipf’s law.

Key wordsLog mining      Part-of-speech tagging      Language behavior      POS distribution      Query syntax     
Received: 16 February 2009      Published: 25 April 2009


Corresponding Authors: Qu Peng     E-mail:
About author:: Lai Maosheng,Qu Peng

Cite this article:

Lai Maosheng,Qu Peng. The POS &|Mining Study on Search Engine’s Query Log. New Technology of Library and Information Service, 2009, 25(4): 50-56.

