Return-Path: X-Original-To: apmail-accumulo-dev-archive@www.apache.org Delivered-To: apmail-accumulo-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id BA43810841 for ; Wed, 22 Jan 2014 19:01:45 +0000 (UTC) Received: (qmail 1055 invoked by uid 500); 22 Jan 2014 19:01:44 -0000 Delivered-To: apmail-accumulo-dev-archive@accumulo.apache.org Received: (qmail 1004 invoked by uid 500); 22 Jan 2014 19:01:44 -0000 Mailing-List: contact dev-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@accumulo.apache.org Delivered-To: mailing list dev@accumulo.apache.org Received: (qmail 945 invoked by uid 99); 22 Jan 2014 19:01:43 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 22 Jan 2014 19:01:43 +0000 X-ASF-Spam-Status: No, hits=2.5 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,SPF_SOFTFAIL,URI_HEX X-Spam-Check-By: apache.org Received-SPF: softfail (nike.apache.org: transitioning domain of mastergeek505@gmail.com does not designate 216.139.236.26 as permitted sender) Received: from [216.139.236.26] (HELO sam.nabble.com) (216.139.236.26) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 22 Jan 2014 19:01:37 +0000 Received: from [192.168.236.26] (helo=sam.nabble.com) by sam.nabble.com with esmtp (Exim 4.72) (envelope-from ) id 1W6336-0002cq-R1 for dev@accumulo.apache.org; Wed, 22 Jan 2014 11:01:16 -0800 Date: Wed, 22 Jan 2014 11:01:16 -0800 (PST) From: Jeff N To: dev@accumulo.apache.org Message-ID: <1390417276828-7225.post@n5.nabble.com> In-Reply-To: References: <1390254996382-7193.post@n5.nabble.com> Subject: Re: Rack and Datacenter Awareness MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org @Adam I am currently interested with the latter half of your second question. My main interest lies in determining how to optimize data processing. If I have two data centers that are geographically far apart and I am working on a local machines but I need data from the second data center, how do I have the processing occur on the second data center? The constraints to this problem include a lack of empirical knowledge of the HDFS node that the data contains, but is within the network topology I currently reside in. Furthermore, it pertains to Map/Reduce jobs that utilize the AccumuloInputFormat. Is it possible to have the distant data center process my Mapper and send me the resulting data set instead of processing the Mapper locally and making numerous network queries? ----- -- View this message in context: http://apache-accumulo.1065345.n5.nabble.com/Rack-and-Datacenter-Awareness-tp7193p7225.html Sent from the Developers mailing list archive at Nabble.com.