hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Manish Verma <manish.lifepa...@gmail.com>
Subject Re: Using MS DOS File in Hive
Date Wed, 26 Mar 2014 03:55:51 GMT
Hi Andrew,
Some of the field values in this file have LF in them. I was trying to find
a way which does not require processing the file to make it conform to Unix
style file. I believe that by writing your own File Format/Splitter classes
you could use any delimiter in map reduce input file. I did not find how to
specify CRLF as record delimiter in Hive.


On Tue, Mar 25, 2014 at 7:36 PM, Andrew Ehrlich <andrew@aehrlich.com> wrote:

> I think you can only use "\n" to denote new lines in Hive. What if you
> replaced the CRLF character with a "\n" in the data pipeline into HDFS or
> with a mapreduce job after the files are in HDFS?
> On Tue, Mar 25, 2014 at 6:15 PM, Manish Verma <manish.lifepart2@gmail.com>wrote:
>> I am trying to build a hive table on a MS DOS File (record ends with CRLF
>> character).  Could some know  how to do this?
>> Thanks
>> Manish

View raw message