Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 19494D77A for ; Mon, 16 Jul 2012 11:15:39 +0000 (UTC) Received: (qmail 97241 invoked by uid 500); 16 Jul 2012 11:15:38 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 96688 invoked by uid 500); 16 Jul 2012 11:15:37 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 96619 invoked by uid 500); 16 Jul 2012 11:15:35 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 96605 invoked by uid 99); 16 Jul 2012 11:15:35 -0000 Received: from issues-vm.apache.org (HELO issues-vm) (140.211.11.160) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 16 Jul 2012 11:15:35 +0000 Received: from isssues-vm.apache.org (localhost [127.0.0.1]) by issues-vm (Postfix) with ESMTP id 11C1E140B94 for ; Mon, 16 Jul 2012 11:15:35 +0000 (UTC) Date: Mon, 16 Jul 2012 11:15:35 +0000 (UTC) From: "Namit Jain (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: <1909429532.57233.1342437335076.JavaMail.jiratomcat@issues-vm> In-Reply-To: <1579589844.56704.1342422154889.JavaMail.jiratomcat@issues-vm> Subject: [jira] [Commented] (HIVE-3260) support bucketed mapjoin where the small table has different number of buckets for different partitons MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-3260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13415004#comment-13415004 ] Namit Jain commented on HIVE-3260: ---------------------------------- I agree - this code is not the difficult part. But, we need to make sure everywhere partition metadata is getting used, like in sampling etc. Let us defer it for now, and get back to it later. Do you want to open the jira for now ? > support bucketed mapjoin where the small table has different number of buckets for different partitons > ------------------------------------------------------------------------------------------------------ > > Key: HIVE-3260 > URL: https://issues.apache.org/jira/browse/HIVE-3260 > Project: Hive > Issue Type: Bug > Reporter: Namit Jain > > Consider the following scenario: > A (1 partition) join B (2 partitions) > A has 2 buckets, whereas B has 2 and 4 buckets for different partitions. > The bucketed mapjoin should still work. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira