hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Arun C Murthy <...@hortonworks.com>
Subject Re: question about file input format
Date Thu, 18 Aug 2011 00:10:37 GMT
What file format do you want to use ?

If it's Text or SequenceFile, or any other existing derivative of FileInputFormat, just override
isSplittable and rely on the actual RecordReader.

Arun

On Aug 17, 2011, at 3:58 PM, Zhixuan Zhu wrote:

> I'm new Hadoop and currently using Hadoop 0.20.2 to try out some simple
> tasks. I'm trying to send each whole file of the input directory to the
> mapper without splitting them line by line. How should I set the input
> format class? I know I could derive a customized FileInputFormat class
> and override the isSplitable function. But I have no idea how to
> implement around the record reader. Any suggestion or a sample code will
> be greatly appreciated. 
> 
> Thanks in advance,
> Grace


Mime
View raw message