hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mark Grover <mgro...@oanda.com>
Subject Re: Problem: LINES TERMINATED BY only supports newline '\n' right now.
Date Mon, 04 Jun 2012 16:26:47 GMT
Hi Tabriz,
As far as I know, newlines are the only supported way to separate records right now. As a
corollary if a single logical records exists across multiple lines, you will have to get rid
of the extra newlines for all of it to be in the same record.

So, to get around it, you can do one of two things:
1) Pre-process your files to break records apart on newlines.
2) As Ed Capriolo suggested in a previous email thread, you could try to use streaming, parse
out your XML there and emit out multiple records.

Mark

----- Original Message -----
From: "tabraiz anwer" <tabraizanwer@yahoo.com>
To: "hive group" <user@hive.apache.org>
Sent: Monday, June 4, 2012 12:08:12 PM
Subject: Problem: LINES TERMINATED BY only supports newline '\n' right now.



Hi, 
i had tried to create the table by "LINES terminated by '\001' " 
and it is giving me the error 


Error in semantic analysis: 3:66 LINES TERMINATED BY only supports newline '\n' right now.
Error encountered near token ''\001'' 


CREATE TABLE xmlgw4 ( transactionid string, typeid string, 
sentxml string,receivedxml string ) 
ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\001' 
STORED AS TEXTFILE; 



instead of '\n' i am using '\001' because i have an xml value , which i want to store in hive
that invludes \n values.. 
using hive version : hive-0.8.1 

any suggestion? 
Regards. 

Mime
View raw message