hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jens Scheidtmann <jens.scheidtm...@gmail.com>
Subject Re: Understanding Sys.output from mapper & partitioner
Date Fri, 29 Mar 2013 20:10:09 GMT
Dear Sai Sai,

you wrote:
> key = 0 value = 10    10
> key = 6 value = 20    200
> ...

the provided key is the byte offset of the respective line in your input
file.
See TextInputFormat docs here:
http://hadoop.apache.org/docs/stable/api/org/apache/hadoop/mapred/TextInputFormat.html

I guess this is used as default, as you didn't specify something different
by using FileInputFormat...

Best regards,

Jens





Best regards,

Jens

Mime
View raw message