Return-Path: Delivered-To: apmail-hadoop-core-user-archive@www.apache.org Received: (qmail 57310 invoked from network); 3 Apr 2009 01:45:15 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 3 Apr 2009 01:45:15 -0000 Received: (qmail 8031 invoked by uid 500); 3 Apr 2009 01:45:12 -0000 Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org Received: (qmail 7932 invoked by uid 500); 3 Apr 2009 01:45:12 -0000 Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-user@hadoop.apache.org Delivered-To: mailing list core-user@hadoop.apache.org Received: (qmail 7922 invoked by uid 99); 3 Apr 2009 01:45:12 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 Apr 2009 01:45:12 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of javateck@gmail.com designates 209.85.220.176 as permitted sender) Received: from [209.85.220.176] (HELO mail-fx0-f176.google.com) (209.85.220.176) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 Apr 2009 01:45:04 +0000 Received: by fxm24 with SMTP id 24so798460fxm.29 for ; Thu, 02 Apr 2009 18:44:44 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:date:message-id:subject :from:to:content-type; bh=2YLimhpdLX2S2OdqZbjOn1nEATGts38oKVXNdhRPpac=; b=cHTk5fdef4RruLxsVVk/gv5JPCk6jW1aoDfAIuS1G1QLWh92J+f3/0NTND5D8TXttv /0jtM8TNu3T5RXihg/th+gxm5ITuKLd4caq/yE/fKPy36P2MHVyAvEHFP/N14GHx8uOa 4dJyNCbkclz/DFD45tIv7W6DX7DNG4uL6hYfU= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=qJwe+0zEYIMitvZkvbhhmtuDWlk1w6OWjKKT8lngD7aMCx4qMqmW8zymDnulIAB/Um bQ4SVRw3C3KPupCxkyp8cWUiORzgoEzJcxlGEIKjp5PQaLzu+dV7FQtgNPuDogQV5/kD HKeUEqDd6JoFdaaXV8GnHb+6K9tFNrytEv+cE= MIME-Version: 1.0 Received: by 10.204.118.138 with SMTP id v10mr208033bkq.208.1238723084184; Thu, 02 Apr 2009 18:44:44 -0700 (PDT) Date: Thu, 2 Apr 2009 18:44:44 -0700 Message-ID: <5ad58bec0904021844x43898f82vad65321bec58260@mail.gmail.com> Subject: HDFS data block clarification From: javateck javateck To: core-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=0016e6d77d9b686c3704669cb143 X-Virus-Checked: Checked by ClamAV on apache.org --0016e6d77d9b686c3704669cb143 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Can someone tell whether a file will occupy one or more blocks? for example, the default block size is 64MB, and if I save a 4k file to HDFS, will the 4K file occupy the whole 64MB block alone? so in this case, do I do need to configure the block size to 10k if most of my files are less than 10K? thanks, --0016e6d77d9b686c3704669cb143--