Return-Path: Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: (qmail 60296 invoked from network); 31 Aug 2010 19:20:17 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 31 Aug 2010 19:20:17 -0000 Received: (qmail 41056 invoked by uid 500); 31 Aug 2010 19:20:17 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 40949 invoked by uid 500); 31 Aug 2010 19:20:16 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 40889 invoked by uid 99); 31 Aug 2010 19:20:16 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 31 Aug 2010 19:20:16 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 31 Aug 2010 19:20:16 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o7VJJusR025852 for ; Tue, 31 Aug 2010 19:19:56 GMT Message-ID: <15650356.98981283282395988.JavaMail.jira@thor> Date: Tue, 31 Aug 2010 15:19:55 -0400 (EDT) From: "Namit Jain (JIRA)" To: mapreduce-issues@hadoop.apache.org Subject: [jira] Created: (MAPREDUCE-2046) A input split cannot be less than a dfs block MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 A input split cannot be less than a dfs block ---------------------------------------------- Key: MAPREDUCE-2046 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2046 Project: Hadoop Map/Reduce Issue Type: Improvement Reporter: Namit Jain I ran into this while testing some hive features. Whether we use hiveinputformat or combinehiveinputformat, a split cannot be less than a dfs block size. This is a problem if we want to increase the block size for older data to reduce memory consumption for the name node. It would be useful if the input split was independent of the dfs block size. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.