hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <ted.dunn...@gmail.com>
Subject Re: Counting no. of keys.
Date Sun, 02 Aug 2009 06:08:31 GMT
Sure.  Write a word count map-reduce program.  The mapper outputs the key
from the sequence file as the output key and includes a count.  Then you do
the normal combiner and reducer from a normal word count program.

On Sat, Aug 1, 2009 at 9:53 PM, prashant ullegaddi <prashullegaddi@gmail.com
> wrote:

> Hi,
>
> I've say 800 sequence files written using SequenceFileOutputFormat. Is
> there
> any way to know
> no. of unique keys in those sequence files?
>
> Thanks,
> Prashant.
>



-- 
Ted Dunning, CTO
DeepDyve

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message