Data Analysis and Knowledge Discovery  2017, Vol. 1 Issue (2): 58-63    DOI: 10.11925/infotech.2096-3467.2017.02.08
Identifying Chinese Microblog Author Gender Based on Dependency
Ruihua Qi()
School of Software, Dalian University of Foreign Languages, Dalian 116044, China
[Objective] This paper proposes a new method to indentify the gender of Chinese microblog author with the help of dependency features. [Methods] This study collected public posts from Tencent Microblogs and extracted the dependency features, which were analyzed and compared with existing vocabulary, structure, function words, and part-of-speech tagging features. [Results] A controlled experiment showed that the proposed method obtained the highest values of precision, recall and F-measure. [Limitations] The new method needs to be examined with larger corpus. [Conclusions] The proposed method is the most effective way to identify the gender of microblog author.

Key wordsDependency      Chinese Microblog      Gender Identification     
Received: 06 October 2016      Published: 27 March 2017

Ruihua Qi. Identifying Chinese Microblog Author Gender Based on Dependency. Data Analysis and Knowledge Discovery, 2017, 1(2): 58-63.

