|
|
A Method for Generating Co-occurrence Matrix of Mass Data Based on Hadoop |
Yang Daiqing1,2 Zhang Zhixiong1 |
1 (National Science Library, Chinese Academy of Sciences, Beijing 100190, China)
2(Institute of Scientific and Technical Information of China, Beijing 100038, China) |
|
|
Abstract Mass data processing is a focal point of information techniques. This paper introduces architecture of open source parallel system-Hadoop, analyzes the MapReduce programming framework based on Hadoop, and proposes a method for generating co-occurrence matrix of mass data through multiple MapReduce operations.
|
Received: 28 March 2009
Published: 25 April 2009
|
|
Corresponding Authors:
Yang Daiqing
E-mail: yangdq@mail.las.ac.cn
|
About author:: Yang Daiqing,Zhang Zhixiong |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|