hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Azuryy Yu <azury...@gmail.com>
Subject Re: Sorting huge text files in Hadoop
Date Sat, 16 Feb 2013 01:13:27 GMT
This is a typical total sort using map/reduce.  it can be done with both
map and reduce.


On Fri, Feb 15, 2013 at 10:39 PM, Arun Vasu <arunvpy@gmail.com> wrote:

> Hi,
>
> Is it possible to sort a huge text file lexicographically using a
> mapreduce job which has only map tasks and zero reduce tasks?
>
> The records of the text file is separated by new line character and the
> size of the file is around 1 Terra Byte.
>
> It will be great if any one can suggest a way to achieve sorting on this
> huge file.
>
> Thanks in advance,
> Arun
>
>

Mime
View raw message