hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Marcus Herou <marcus.he...@tailsweep.com>
Subject Re: Sort by value
Date Thu, 09 Jul 2009 20:17:50 GMT
Yep figured that.

On Thu, Jul 9, 2009 at 7:09 PM, Owen O'Malley <omalley@apache.org> wrote:

> You need two jobs:
>
> 1. map: line -> line, 1, combiner & reducer: sum values, sort by line
> 2. map: line, count -> count, line & reducer: count, line -> line, count
>
> So job 1 looks like word count and job 2 sorts it by the counts.
>
> -- Owen
>



-- 
Marcus Herou CTO and co-founder Tailsweep AB
+46702561312
marcus.herou@tailsweep.com
http://www.tailsweep.com/

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message