Return-Path: Delivered-To: apmail-hadoop-core-user-archive@www.apache.org Received: (qmail 1233 invoked from network); 17 Sep 2008 14:43:14 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 17 Sep 2008 14:43:14 -0000 Received: (qmail 64475 invoked by uid 500); 17 Sep 2008 14:43:05 -0000 Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org Received: (qmail 64440 invoked by uid 500); 17 Sep 2008 14:43:05 -0000 Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-user@hadoop.apache.org Delivered-To: mailing list core-user@hadoop.apache.org Received: (qmail 64429 invoked by uid 99); 17 Sep 2008 14:43:05 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 17 Sep 2008 07:43:05 -0700 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [65.99.197.50] (HELO s01.igfoo.com) (65.99.197.50) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 17 Sep 2008 14:42:05 +0000 Received: from localhost (localhost [127.0.0.1]) by s01.igfoo.com (Postfix) with ESMTP id 8801AF34165 for ; Wed, 17 Sep 2008 09:42:05 -0500 (CDT) X-DKIM: Sendmail DKIM Filter v2.5.4 s01.igfoo.com 8801AF34165 DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=igfoo.com; s=mail; t=1221662525; bh=mrJaLU9ZXNV21shnboLOGXcdKRUvOblzbmfRz1Nz3zc=; h=Message-ID:Date:From:MIME-Version:To:Subject:References: In-Reply-To:Content-Type:Content-Transfer-Encoding; b=OGtJA2pYRbzL UYermQjW2ZoYw/RLlAeise8Pktoe168u+JkL583v/1U8sO6NY7uD2sQxoorRccJ2jsA 5q3uBhbXES44ZiTVjoT+Wyueoa+RwmUb0rIMmgMqD95qA8GC+PUbfg9/DOR2oCDJeVc BAj8uWltb7e0RsgUN0hbKkZB0= X-Virus-Scanned: Debian amavisd-new at s01.igfoo.com Received: from s01.igfoo.com ([127.0.0.1]) by localhost (s01.igfoo.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 3Kg4bbVHthYj for ; Wed, 17 Sep 2008 09:42:00 -0500 (CDT) Received: from [192.168.1.200] (unknown [63.133.162.98]) by s01.igfoo.com (Postfix) with ESMTPSA id AF808F3411E for ; Wed, 17 Sep 2008 09:42:00 -0500 (CDT) X-DKIM: Sendmail DKIM Filter v2.5.4 s01.igfoo.com AF808F3411E DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=igfoo.com; s=mail; t=1221662520; bh=mrJaLU9ZXNV21shnboLOGXcdKRUvOblzbmfRz1Nz3zc=; h=Message-ID:Date:From:MIME-Version:To:Subject:References: In-Reply-To:Content-Type:Content-Transfer-Encoding; b=srs46DKbuNqD zMcqHHwRISVdY92C6dl3604tP+lzfjrGzelyefGs9HViwLyqsg/kpYWhehgudKHTedd Eho3gaEnALY7Ce42aKfriHJzEUqlpn94A+4I3Z42aEqTQcDsKx2baIDVHNYr0Qa2p8x WNhJKNag/dHGCVGJSbEJw6PWA= Message-ID: <48D11736.9000305@apache.org> Date: Wed, 17 Sep 2008 09:41:58 -0500 From: Dennis Kubes User-Agent: Thunderbird 2.0.0.16 (X11/20080724) MIME-Version: 1.0 To: core-user@hadoop.apache.org Subject: Re: scp to namenode faster than dfs put? References: <200809171743.37525.pvvpr@research.iiit.ac.in> In-Reply-To: <200809171743.37525.pvvpr@research.iiit.ac.in> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org While an scp will copy data to the namenode machine, it does *not* store the data in dfs, it simply copies the data to namenode machine. This is the same as copying data to any other machine. The data isn't in DFS and is not accessible from DFS. If the box running the namenode fails you lose your data. The reason put is slower is that the data is actually being stored into the DFS on multiple machines in block format. It is then accessible from programs accessing the DFS such as MR jobs. Dennis Prasad Pingali wrote: > Hello, > I observe that scp of data to the namenode is faster than actually putting > into dfs (all nodes coming from same switch and have same ethernet cards, > homogenous nodes)? I understand that "dfs -put" breaks the data into blocks > and then copies to datanodes, but shouldn't that be atleast as fast as > copying data to namenode from a single machine, if not faster? > > thanks and regards, > Prasad Pingali, > IIIT Hyderabad. > >