hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Nitin Pawar <nitinpawar...@gmail.com>
Subject Re: hive task fails when left semi join
Date Tue, 16 Jul 2013 07:29:10 GMT
Can you try map only join?
Your one table is just 1k records .. map join will help you run it faster
and hopefully you will not hit memory condition


On Tue, Jul 16, 2013 at 12:56 PM, <kira.wang@xiaoi.com> wrote:

> Hello,****
>
> ** **
>
> I am trying to filter out some records in a table in hive.****
>
> The number of lines in this table is 4billions+, ****
>
> I make a left semi join between above table and a small table with 1k
> lines.****
>
> ** **
>
> However, after 3 hours job running, it turns out a fail status.****
>
> ** **
>
> My question are as follows,****
>
> **1.     **How could I address this problem and final solve it?****
>
> **2.     **Is there any other good methods could filter out records with
> give conditions?****
>
> ** **
>
> The following picture is a snapshot of the failed job.****
>
> ****
>
> ** **
>



-- 
Nitin Pawar

Mime
View raw message