hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "W.P. McNeill" <bill...@gmail.com>
Subject Re: What's the easiest way to count the number of <Key, Value> pairs in a directory?
Date Fri, 20 May 2011 17:33:49 GMT
The keys are Text and the values are large custom data structures serialized
with Avro.

I also have counters for the job that generates these files that gives me
this information but sometimes...Well, it's a long story.  Suffice to say
that it's nice to have a post-hoc method too.  :-)

The identity mapper sounds like the way to go.

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message