hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Arun Vasu <arun...@gmail.com>
Subject Sorting huge text files in Hadoop
Date Fri, 15 Feb 2013 14:39:09 GMT

Is it possible to sort a huge text file lexicographically using a mapreduce
job which has only map tasks and zero reduce tasks?

The records of the text file is separated by new line character and the
size of the file is around 1 Terra Byte.

It will be great if any one can suggest a way to achieve sorting on this
huge file.

Thanks in advance,

View raw message