hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gokula Krishnan <gokula.p.krish...@gmail.com>
Subject Need details how hadoop 2.x perform better than hadoop 1.x
Date Fri, 06 Dec 2013 09:59:05 GMT
The setup consist of hadoop 1.0.1 and hbase 0.94.x. Loading raw data into
hdfs and then into hbase consumes good amount of time for 10tb of raw data
(using hadoop shell -copyFromLocal and pig script to load hbase).

Moving to hadoop 2.x will benefit performing better is my question. If yes
please provide relevent links or docs which expains how it is achieved.

I do not need sorting my data while loading into hbase so what are the ways
i can disable sort ta Mapper and at Reducer is my 2nd question.

any other thoughts are welcome...

thanks in advance

View raw message