Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 94080 invoked from network); 5 Sep 2007 20:23:56 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 5 Sep 2007 20:23:56 -0000 Received: (qmail 96794 invoked by uid 500); 5 Sep 2007 20:23:49 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 96772 invoked by uid 500); 5 Sep 2007 20:23:49 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 96763 invoked by uid 99); 5 Sep 2007 20:23:49 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Sep 2007 13:23:49 -0700 X-ASF-Spam-Status: No, hits=-100.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Sep 2007 20:23:54 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id E31A0714209 for ; Wed, 5 Sep 2007 13:23:33 -0700 (PDT) Message-ID: <18219431.1189023813926.JavaMail.jira@brutus> Date: Wed, 5 Sep 2007 13:23:33 -0700 (PDT) From: "Owen O'Malley (JIRA)" To: hadoop-dev@lucene.apache.org Subject: [jira] Commented: (HADOOP-1838) Files created with an pre-0.15 gets blocksize as zero, causing performance degradation In-Reply-To: <4315729.1188974793507.JavaMail.root@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-1838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12525207 ] Owen O'Malley commented on HADOOP-1838: --------------------------------------- I'd much rather have the upgrade set the blocksize to the default block size in the case of single block files, rather leave 0 as a special value. The problem with special values is that they need to be tested for in every single use of the field and are thus much much harder to maintain. > Files created with an pre-0.15 gets blocksize as zero, causing performance degradation > -------------------------------------------------------------------------------------- > > Key: HADOOP-1838 > URL: https://issues.apache.org/jira/browse/HADOOP-1838 > Project: Hadoop > Issue Type: Bug > Components: dfs > Affects Versions: 0.15.0 > Reporter: dhruba borthakur > Assignee: dhruba borthakur > Priority: Blocker > Fix For: 0.15.0 > > Attachments: blockSizeZero.patch > > > HADOOP-1656 introduced the support for storing block size persistently as inode metadata. Previously, if the file has only one block then it was not possible to accurately determine the blocksize that the application has requested at file-creation time. > The upgrade of an older layout to the new layout kept the blocksize as zero for single-block files that were upgraded to the new layout. This was done to indicate the DFS really does not know the "true" blocksize of this file. This caused map-reduce to determine that a split is 1 byte in length! -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.