hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hive QA (JIRA)" <>
Subject [jira] [Commented] (HIVE-14199) Enable Bucket Pruning for ACID tables
Date Sat, 13 Aug 2016 09:28:20 GMT


Hive QA commented on HIVE-14199:

Here are the results of testing the latest attachment:

{color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified.

{color:red}ERROR:{color} -1 due to 7 failed/errored test(s), 10471 tests executed
*Failed tests:*

Test results:
Console output:
Test logs:

Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 7 tests failed

This message is automatically generated.

ATTACHMENT ID: 12823552 - PreCommit-HIVE-MASTER-Build

> Enable Bucket Pruning for ACID tables
> -------------------------------------
>                 Key: HIVE-14199
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>          Components: Transactions
>            Reporter: Saket Saurabh
>            Assignee: Saket Saurabh
>         Attachments: HIVE-14199.01.patch, HIVE-14199.02.patch, HIVE-14199.03.patch
> Currently, ACID tables do not benefit from the bucket pruning feature introduced in HIVE-11525.
The reason for this has been the fact that bucket pruning happens at split generation level
and for ACID, traditionally the delta files were never split. The parallelism for ACID was
then restricted to the number of buckets. There would be as many splits as the number of buckets
and each worker processing one split would inevitably read all the delta files for that bucket,
even when the query may have originally required only one of the buckets to be read.
> However, HIVE-14035 now enables even the delta files to be also split. What this means
is that now we have enough information at the split generation level to determine appropriate
buckets to process for the delta files. This can efficiently allow us to prune unnecessary
buckets for delta files and will lead to good performance gain for a large number of selective
queries on ACID tables.

This message was sent by Atlassian JIRA

View raw message