Return-Path: Delivered-To: apmail-hadoop-common-dev-archive@www.apache.org Received: (qmail 10306 invoked from network); 7 May 2010 10:53:02 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 7 May 2010 10:53:02 -0000 Received: (qmail 81897 invoked by uid 500); 7 May 2010 10:53:01 -0000 Delivered-To: apmail-hadoop-common-dev-archive@hadoop.apache.org Received: (qmail 81595 invoked by uid 500); 7 May 2010 10:52:58 -0000 Mailing-List: contact common-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-dev@hadoop.apache.org Delivered-To: mailing list common-dev@hadoop.apache.org Received: (qmail 81586 invoked by uid 500); 7 May 2010 10:52:57 -0000 Delivered-To: apmail-hadoop-core-dev@hadoop.apache.org Received: (qmail 81583 invoked by uid 99); 7 May 2010 10:52:56 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 May 2010 10:52:56 +0000 X-ASF-Spam-Status: No, hits=-0.2 required=10.0 tests=AWL,FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of momina.azam@gmail.com designates 209.85.221.172 as permitted sender) Received: from [209.85.221.172] (HELO mail-qy0-f172.google.com) (209.85.221.172) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 May 2010 10:52:50 +0000 Received: by qyk2 with SMTP id 2so1551658qyk.20 for ; Fri, 07 May 2010 03:52:29 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:date:message-id :subject:from:to:content-type; bh=VpHl8USAi+YGpEFeoluBvvFr+mjCtp04CAFS0w+xHIM=; b=XJ3w5hWXfYF335yC4GsYtlf/2K4vS3S+XkMR6+cz3YZ95m9hNDwvQtxSSTOK1GY/HY obhtdXsOAPVsT8bh/bsi2qYRw6gnI5H1YHcayc3nY03Vu3TVLEa8VwHA1p/9wDvaCZFD BpkxcH6OG38HUN2LW1BzHQj/PYdJEicGbfBvk= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=cv0gvreVmBjbxZb6wZdhpmkVn/BgH2xK8+uJLPjcnR92kqk26zY3I/fesOJCXELGH6 dwPMpn6iXqlJesgwJpnzsbRm8Mp1tD60mcIJhiah2cpeIEYvWyq4/Sm+MW7k/XLRQDkx w3NpVpaP/g/tQ63cs4r3dPGMhT1yNqGfdAS3Y= MIME-Version: 1.0 Received: by 10.229.218.21 with SMTP id ho21mr5541806qcb.102.1273229549386; Fri, 07 May 2010 03:52:29 -0700 (PDT) Received: by 10.229.47.197 with HTTP; Fri, 7 May 2010 03:52:28 -0700 (PDT) Date: Fri, 7 May 2010 16:52:28 +0600 Message-ID: Subject: data locality on HDFS From: momina khan To: core-dev@hadoop.apache.org Content-Type: multipart/alternative; boundary=0016362842fa023e090485fedb1e --0016362842fa023e090485fedb1e Content-Type: text/plain; charset=ISO-8859-1 hi, i am trying to figure out how hadoop uses data locality to schedule maps on nodes which locally store tha map input ... going through code i am going in circles in between a couple of file but not really getting anywhere ... that is to say that i cant locate the HDFS API or func that can communicate a node list that store replicas foe say a block! i am going from FSNameSystem.java to DFSClient.java to BlocksWithLocations.java to DataNodeDescriptor.java and then back again without getting to the HDFS interface that communicates replicas' storing nodes for a block! someone plz help! momina --0016362842fa023e090485fedb1e--