Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A1E369F86 for ; Thu, 9 Aug 2012 13:12:18 +0000 (UTC) Received: (qmail 98566 invoked by uid 500); 9 Aug 2012 13:12:13 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 98473 invoked by uid 500); 9 Aug 2012 13:12:13 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 98466 invoked by uid 99); 9 Aug 2012 13:12:13 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 09 Aug 2012 13:12:13 +0000 X-ASF-Spam-Status: No, hits=0.0 required=5.0 tests=FSL_RCVD_USER,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [130.203.14.76] (HELO arlo.cse.psu.edu) (130.203.14.76) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 09 Aug 2012 13:12:05 +0000 Received: from pool-98-111-120-203.hrbgpa.fios.verizon.net ([98.111.120.203] helo=[192.168.1.50]) by arlo.cse.psu.edu with esmtpsa (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1SzSWQ-0001Pw-NR for user@hadoop.apache.org; Thu, 09 Aug 2012 09:11:30 -0400 Message-ID: <5023B6AB.807@cse.psu.edu> Date: Thu, 09 Aug 2012 09:10:03 -0400 From: "Ellis H. Wilson III" User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:10.0.6esrpre) Gecko/20120807 Thunderbird/10.0.6 MIME-Version: 1.0 To: user@hadoop.apache.org Subject: fs.local.block.size vs file.blocksize Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Sender: ehw111@cse.psu.edu X-PSUCSE-Spam-Score: -1.4 X-PSUCSE-Spam-Level: - X-Virus-Checked: Checked by ClamAV on apache.org Hi all! Can someone please briefly explain the difference? I do not see deprecated warnings for fs.local.block.size when I run with them set and I see two copies of RawLocalFileSystem.java (the other is local/RawLocalFs.java). The things I really need to get answers to are: 1. Is the default boosted to 64MB from Hadoop 1.0 to Hadoop 2.0? I believe it is, but want validation on that. 2. Which one controls shuffle block-size? 3. If I have a single machine non-distributed instance, and point it at file://, do both of these control the persistent data's block size or just one of them or what? 4. Is there any way to run with say a 512MB blocksize for the persistent data and the default 64MB blocksize for the shuffled data? Thanks! ellis