hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Joydeep Sen Sarma (JIRA)" <j...@apache.org>
Subject [jira] Resolved: (HIVE-110) table/partition compaction utility
Date Wed, 03 Dec 2008 22:41:44 GMT

     [ https://issues.apache.org/jira/browse/HIVE-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Joydeep Sen Sarma resolved HIVE-110.

    Resolution: Duplicate

> table/partition compaction utility
> ----------------------------------
>                 Key: HIVE-110
>                 URL: https://issues.apache.org/jira/browse/HIVE-110
>             Project: Hadoop Hive
>          Issue Type: Bug
>            Reporter: Joydeep Sen Sarma
> in many cases, we cannot help producing a large number of output files in a final result
table. This can put pressure on the namenode and as well makes downstream queries potentially
inefficient (until we have combinefileinputformat). Users would like to have a simple utility
in hive to reduce the number of files by concatenating them together. This should apply on
a per partition basis.
> Long term - Hive should do this kind of compaction automatically - either online or offline.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message