hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jim Falgout <jim.falg...@pervasive.com>
Subject RE: Quick question
Date Fri, 18 Feb 2011 19:55:14 GMT
That's right. The TextInputFormat handles situations where records cross split boundaries.
What your mapper will see is "whole" records. 

-----Original Message-----
From: maha [mailto:maha@umail.ucsb.edu] 
Sent: Friday, February 18, 2011 1:14 PM
To: common-user
Subject: Quick question

Hi all,

  I want to check if the following statement is right:

 If I use TextInputFormat to process a text file with 2000 lines (each ending with \n) with
20 mappers. Then each map will have a sequence of COMPLETE LINES . 

In other words,  the input is not split byte-wise but by lines. 

Is that right?


Thank you,
Maha


Mime
View raw message