Return-Path: Delivered-To: apmail-hive-dev-archive@www.apache.org Received: (qmail 79335 invoked from network); 15 Dec 2010 21:22:26 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 15 Dec 2010 21:22:26 -0000 Received: (qmail 57570 invoked by uid 500); 15 Dec 2010 21:22:25 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 57542 invoked by uid 500); 15 Dec 2010 21:22:25 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 57534 invoked by uid 500); 15 Dec 2010 21:22:25 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 57531 invoked by uid 99); 15 Dec 2010 21:22:25 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 Dec 2010 21:22:25 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 Dec 2010 21:22:23 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id oBFLM25H021714 for ; Wed, 15 Dec 2010 21:22:02 GMT Message-ID: <8985946.145571292448122116.JavaMail.jira@thor> Date: Wed, 15 Dec 2010 16:22:02 -0500 (EST) From: "Ning Zhang (JIRA)" To: hive-dev@hadoop.apache.org Subject: [jira] Updated: (HIVE-1806) The merge criteria on dynamic partitons should be per partiton In-Reply-To: <18222401.268461290540193656.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HIVE-1806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ning Zhang updated HIVE-1806: ----------------------------- Attachment: HIVE-1806.patch > The merge criteria on dynamic partitons should be per partiton > -------------------------------------------------------------- > > Key: HIVE-1806 > URL: https://issues.apache.org/jira/browse/HIVE-1806 > Project: Hive > Issue Type: Bug > Reporter: Ning Zhang > Assignee: Ning Zhang > Attachments: HIVE-1806.patch > > > Currently the criteria of whether a merge job should be fired on dynamic generated partitions are is the average file size of files across all dynamic partitions. It is very common that some dynamic partitions contains mostly large files and some contains mostly small files. Even though the average size of the total files are larger than the hive.merge.smallfiles.avgsize, we should merge those partitions containing small files only. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.