hadoop-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rakesh Davanum <rakesh...@gmail.com>
Subject Restricting number of records from map output
Date Wed, 12 Jan 2011 18:03:56 GMT

I have a sort job consisting of only the Mapper (no Reducer) task. I want my
results to contain only the top n records. Is there any way of restricting
the number of records that are emitted by the Mappers?

Basically I am looking to see if there is an equivalent of achieving
the behavior similar to LIMIT in SQL queries.

Thanks & Regards,

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message