hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From John Sichi <jsi...@fb.com>
Subject Re: Hive/HBase integration issue.
Date Thu, 18 Nov 2010 19:33:20 GMT
As noted here, when writing to HBase, existing rows are overwritten, but old rows are not deleted.

http://wiki.apache.org/hadoop/Hive/HBaseIntegration#Overwrite

There is not yet any deletion support.

JVS

On Nov 18, 2010, at 1:00 AM, afancy wrote:

> Hi, 
> 
> Does  the INSERT clause have to include the OVERWRITE, which means that the new data
will overwrite the previous data?  How to implement the indeed INSERT operation, instead of
OVERWRITE? 
> BTW: How to implement the DELETE operator? thanks
> 
> afancy
> 
> 
> -------------------------------------------------------
> hive> insert OVERWRITE table  pagedim select 0, url, strToint('2'), 'domain', 'serversion'
from downloadlog;
> Total MapReduce jobs = 2
> Launching Job 1 out of 2
> Number of reduce tasks is set to 0 since there's no reduce operator
> Starting Job = job_201011121525_0006, Tracking URL = http://localhost:50030/jobdetails.jsp?jobid=job_201011121525_0006
> Kill Command = /home/xiliu/hadoop-0.20.2/bin/../bin/hadoop job  -Dmapred.job.tracker=localhost:54311
-kill job_201011121525_0006
> 2010-11-18 09:55:52,155 Stage-1 map = 0%,  reduce = 0%
> 2010-11-18 09:55:55,169 Stage-1 map = 100%,  reduce = 0%
> 2010-11-18 09:55:58,200 Stage-1 map = 100%,  reduce = 100%
> Ended Job = job_201011121525_0006
> Ended Job = 487027960, job is filtered out (removed at runtime).
> Launching Job 2 out of 2
> Number of reduce tasks is set to 0 since there's no reduce operator
> Starting Job = job_201011121525_0007, Tracking URL = http://localhost:50030/jobdetails.jsp?jobid=job_201011121525_0007
> Kill Command = /home/xiliu/hadoop-0.20.2/bin/../bin/hadoop job  -Dmapred.job.tracker=localhost:54311
-kill job_201011121525_0007
> 2010-11-18 09:56:04,701 Stage-2 map = 0%,  reduce = 0%
> 2010-11-18 09:56:07,723 Stage-2 map = 100%,  reduce = 0%
> 2010-11-18 09:56:10,751 Stage-2 map = 100%,  reduce = 100%
> Ended Job = job_201011121525_0007
> Loading data to table pagedim
> 1000 Rows loaded to pagedim
> OK
> Time taken: 23.194 seconds
> hive> insert table  pagedim select 0, url, strToint('2'), 'domain', 'serversion' from
downloadlog;          
> FAILED: Parse Error: line 1:7 mismatched input 'table' expecting OVERWRITE in insert
clause
> 


Mime
View raw message