hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Owen O'Malley" <>
Subject Re: Partition performance
Date Wed, 03 Jul 2013 14:56:54 GMT
On Wed, Jul 3, 2013 at 5:19 AM, David Morel <> wrote:

> That is still not really answering the question, which is: why is it slower
> to run a query on a heavily partitioned table than it is on the same number
> of files in a less heavily partitioned table.

According to Gopal's investigations in, each time Hive plans a
query, it does a query per a partition to the backing SQL database. That
would explain a lot of the latency for tables with large numbers of

-- Owen

View raw message