Return-Path: Delivered-To: apmail-hadoop-hive-dev-archive@minotaur.apache.org Received: (qmail 15567 invoked from network); 4 Jan 2010 07:39:18 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 4 Jan 2010 07:39:18 -0000 Received: (qmail 76804 invoked by uid 500); 4 Jan 2010 07:39:18 -0000 Delivered-To: apmail-hadoop-hive-dev-archive@hadoop.apache.org Received: (qmail 76748 invoked by uid 500); 4 Jan 2010 07:39:18 -0000 Mailing-List: contact hive-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hive-dev@hadoop.apache.org Delivered-To: mailing list hive-dev@hadoop.apache.org Received: (qmail 76692 invoked by uid 99); 4 Jan 2010 07:39:18 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 Jan 2010 07:39:17 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 Jan 2010 07:39:15 +0000 Received: from brutus.apache.org (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id D5606234C4AF for ; Sun, 3 Jan 2010 23:38:54 -0800 (PST) Message-ID: <554221621.16021262590734872.JavaMail.jira@brutus.apache.org> Date: Mon, 4 Jan 2010 07:38:54 +0000 (UTC) From: "Ning Zhang (JIRA)" To: hive-dev@hadoop.apache.org Subject: [jira] Resolved: (HIVE-900) Map-side join failed if there are large number of mappers In-Reply-To: <546728803.1256345039494.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HIVE-900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ning Zhang resolved HIVE-900. ----------------------------- Resolution: Won't Fix > Map-side join failed if there are large number of mappers > --------------------------------------------------------- > > Key: HIVE-900 > URL: https://issues.apache.org/jira/browse/HIVE-900 > Project: Hadoop Hive > Issue Type: Improvement > Reporter: Ning Zhang > Assignee: Ning Zhang > > Map-side join is efficient when joining a huge table with a small table so that the mapper can read the small table into main memory and do join on each mapper. However, if there are too many mappers generated for the map join, a large number of mappers will simultaneously send request to read the same block of the small table. Currently Hadoop has a upper limit of the # of request of a the same block (250?). If that is reached a BlockMissingException will be thrown. That cause a lot of mappers been killed. Retry won't solve but worsen the problem. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.