hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Harsh J <ha...@cloudera.com>
Subject Re: Splitting input file
Date Wed, 22 Aug 2012 03:57:49 GMT
Hi Grandl,

You can set "textinputformat.record.delimiter" to "," to have records
from a text file split at commas. Isn't that sufficient? You do not
need to write any special InputFormat for text files this way.

On Wed, Aug 22, 2012 at 8:51 AM, Grandl Robert <rgrandl@yahoo.com> wrote:
> Hi,
> I think there are many discussions about splitting the input file based on
> custom delimiters.
> However, I am not sure if there is a simple way to split text input file
> based on end of sentences(.) without writing any custom split delimiter or
> so. Can I simply specify such delimiter when I add the input into HDFS ?
> Thanks,
> Robert

Harsh J

View raw message