Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 41207 invoked from network); 27 Mar 2008 19:53:01 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 27 Mar 2008 19:53:01 -0000 Received: (qmail 18205 invoked by uid 500); 27 Mar 2008 19:52:57 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 18184 invoked by uid 500); 27 Mar 2008 19:52:57 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Delivered-To: moderator for core-dev@hadoop.apache.org Received: (qmail 99052 invoked by uid 99); 27 Mar 2008 19:43:27 -0000 X-ASF-Spam-Status: No, hits=3.8 required=10.0 tests=RCVD_NUMERIC_HELO,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) X-USANET-Routed: 3 gwsout-vs R:localhost:1825 X-USANET-Source: 165.212.116.254 IN jean-pierre.ocalan@247realmedia.com GW2.EXCHPROD.USA.NET X-USANET-MsgId: XID318mcATQ29164Xo2 Subject: [Map/Reduce][HDFS] From: Jean-Pierre To: core-dev@hadoop.apache.org, core-user@hadoop.apache.org Content-Type: text/plain Content-Transfer-Encoding: 7bit Date: Thu, 27 Mar 2008 15:41:59 -0400 Message-Id: <1206646919.32242.0.camel@JPBeast> Mime-Version: 1.0 X-Mailer: Evolution 2.12.1 X-OriginalArrivalTime: 27 Mar 2008 19:42:51.0962 (UTC) FILETIME=[BE9339A0:01C89042] X-Virus-Checked: Checked by ClamAV on apache.org Hello, I'm working on large amount of logs, and I've noticed that the distribution of data on the network (./hadoop dfs -put input input) takes a lot of time. Let's says that my data is already distributed among the network, is there anyway to say to hadoop to use the already existing distribution ?. Thanks -- Jean-Pierre