hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Prem Yadav <ipremya...@gmail.com>
Subject Re: Hive huge 'startup time'
Date Fri, 18 Jul 2014 13:52:56 GMT
may be you can post your partition structure and the query..Over
partitioning data is one of the reasons it happens.


On Fri, Jul 18, 2014 at 2:36 PM, diogo <diogo@uken.com> wrote:

> This is probably a simple question, but I'm noticing that for queries that
> run on 1+TB of data, it can take Hive up to 30 minutes to actually start
> the first map-reduce stage. What is it doing? I imagine it's gathering
> information about the data somehow, this 'startup' time is clearly a
> function of the amount of data I'm trying to process.
>
> Cheers,
>

Mime
View raw message