New Technology of Library and Information Service  2007, Vol. 2 Issue (8): 59-62    DOI: 10.11925/infotech.1003-3513.2007.08.14
Study of Statistics-rule Based Hierarchical Web Page Classification
Tan Jinbo Yang Xiaojiang2   Li Yi2
1(Educational Technology Department,Shandong Normal University,Jinan 250014,China)
2(Educational Technology Department,Nanjing Normal University,Nanjing 210097,China)
Statistics-based classification methods are common-used in hierarchical Web classification.However,classification precision of statistics-based methods often drops when categories are very similar to each other because of feature overlapping.Due to the nature of hierarchical Web classification,categories sharing the same parent (e.g.,leaf categories in the hierarchy) are often very similar to each other.To improve the classification precision,the paper proposes to use rule-based classification methods on top of statistics-based methods in hierarchical Web classification.Experiments show that our methods perform well on our education Web collections.

Key wordsStatistics-based classification      Rule-based classification      Hierarchical Web classification      Statistics-rule based classification     
Received: 11 June 2007      Published: 25 August 2007


Corresponding Authors: Tan Jinbo     E-mail:
About author:: Tan Jinbo,Yang Xiaojiang,Li Yi

Tan Jinbo,Yang Xiaojiang,Li Yi. Study of Statistics-rule Based Hierarchical Web Page Classification. New Technology of Library and Information Service, 2007, 2(8): 59-62.

