hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Namit Jain <nj...@fb.com>
Subject Re: tons of bugs and problem found
Date Fri, 28 Jan 2011 00:12:52 GMT
Hi Laurent,

1. Are you saying that _top.sql did not exist in the home directory.
Or that, _top.sql existed, but hive was not able to read it after loading


2. I don¹t think reserved words are documented somewhere. Can you file a
jira
   for this ?

3. The bad row is printed in the task log.

1. 2011-01-27 11:11:07,046 INFO org.apache.hadoop.fs.FSInputChecker: Found
checksum error: b[1024,
1536]=7374796c653d22666f6e742d73697a653a20313270743b223e3c623e266e6273703b2
66e6273703b266e6273703b202a202838302920416d69656e733a3c2f623e3c2f7370616e3e
3c2f7370616e3e5c6e20203c2f703e5c6e20203c703e5c6e202020203c7370616e207374796
c653d22666f66742d66616d696c793a2068656c7665746963613b223e3c7370616e20737479
6c653d22666f6e742d73697a653a20313270743b223e3c623e266e6273703b266e6273703b2
66e6273703b266e6273703b266e6273703b266e6273703b266e6273703b266e6273703b266e
6273703b206f203132682c2050697175652d6e6971756520646576616e74206c65205265637
46f7261742e3c2f623e3c2f7370616e3e3c2f7370616e3e5c6e20203c2f703e5c6e20203c70
3e5c6e202020203c7370616e207374796c653d22666f6e742d66616d696c793a2068656c766
5746963613b223e3c7370616e207374796c653d22666f6e742d73697a653a20313270743b22
3e3c623e266e6273703b266e6273703b266e6273703b266e6273703b266e6273703b266e627
3703b266e6273703b266e6273703b266e6273703b206f2031346833302c204d6169736f6e20
6465206c612063756c747572652e3c2f623e3c2f7370616e3e3c2f7370616e3e5c6e20203c2
f703e5c6e20203c703e5c6e202020203c7370616e207374796c653d

2. org.apache.hadoop.fs.ChecksumException: Checksum error:
/blk_2466764552666222475:of:/user/hive/warehouse/article/article.copy at
23446528

3.         at 
org.apache.hadoop.fs.FSInputChecker.verifySum(FSInputChecker.java:27




Thanks,
-namit





On 1/27/11 7:50 AM, "Laurent Laborde" <kerdezixe@gmail.com> wrote:

>Friendly greetings !
>I am very frustrated about hive. i like the program but i find a
>problem avery day i try to use it and i can't find any support on irc.
>so here i am !
>
>1st problem :
>hive> create table top_test like top;
>OK
>
>hive> load data local inpath
>'/home/ker2x/over-blog/archive-20101213/_top.sql' into table top_test;
>Copying data from file:/home/ker2x/over-blog/archive-20101213/_top.sql
>Loading data to table top_test
>OK
>Time taken: 0.832 seconds
>
>hive> select count(1) from top_test;
>...
>OK
>0
>
>if the file loaded being with a "_" there is no error but it's
>impossible to do any query on it.
>
>of course if i do :
>hive> load data local inpath
>'/home/ker2x/over-blog/archive-20101213/top.sql' into table top_test;
>Copying data from file:/home/ker2x/over-blog/archive-20101213/top.sql
>Loading data to table top_test
>OK
>Time taken: 0.908 seconds
>
>hive> select count(1) from top_test;
>...
>OK
>1460112
>Time taken: 34.779 seconds
>
>it works without problem ...
>
>The first time i got this problem i lost many hours (loading + query)
>because the imported file was tens of GB :(
>
>
>2nd problem :
>hive> create table testing (data string);
>FAILED: Parse Error: line 1:22 mismatched input 'data' expecting
>Identifier in column specification
>
>can't create a column named "data". is it documented anywhere ?
>same problem with a column named "date".
>same problem with a column named "comment".
>same problem with a column named "timestamp".
>
>3rd problem :
>i'm loading a big file, no problem.
>now i run *any* query on this table and here is the result :
>http://pastebin.com/mE8Gzdbq
>Look like a line or a column doesn't have the expected format. But...
>which line ? which column ?
>i'm on this problem since days :(((
>
>Can you help please ?
>thank you.
>
>-- 
>Laurent "ker2x" Laborde
>Sysadmin & DBA at http://www.over-blog.com/


Mime
View raw message