Return-Path: X-Original-To: apmail-hadoop-hdfs-dev-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E1945DC8A for ; Sat, 22 Sep 2012 03:24:31 +0000 (UTC) Received: (qmail 11372 invoked by uid 500); 22 Sep 2012 03:24:31 -0000 Delivered-To: apmail-hadoop-hdfs-dev-archive@hadoop.apache.org Received: (qmail 11168 invoked by uid 500); 22 Sep 2012 03:24:30 -0000 Mailing-List: contact hdfs-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-dev@hadoop.apache.org Delivered-To: mailing list hdfs-dev@hadoop.apache.org Received: (qmail 11141 invoked by uid 99); 22 Sep 2012 03:24:30 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 22 Sep 2012 03:24:30 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jbounour@ddn.com designates 74.62.46.229 as permitted sender) Received: from [74.62.46.229] (HELO mail.datadirectnet.com) (74.62.46.229) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 22 Sep 2012 03:24:24 +0000 Received: from LAX-EX-CAHT2.datadirect.datadirectnet.com (10.8.103.82) by dermtp01.datadirect.datadirectnet.com (10.8.16.38) with Microsoft SMTP Server (TLS) id 8.3.192.1; Fri, 21 Sep 2012 20:24:02 -0700 Received: from LAX-EX-MB2.datadirect.datadirectnet.com ([fe80::96:2379:f1b2:ef2d]) by LAX-EX-CAHT2.datadirect.datadirectnet.com ([fe80::5097:7f65:334d:ade5%12]) with mapi id 14.02.0298.004; Fri, 21 Sep 2012 20:24:00 -0700 From: Joe Bounour To: "hdfs-dev@hadoop.apache.org" Subject: Re: data center aware hadoop? Thread-Topic: data center aware hadoop? Thread-Index: AQHNj6xCRLWD4cTt0USS9jWUA/jxT5eWAvoA///DcAA= Date: Sat, 22 Sep 2012 03:24:00 +0000 Message-ID: In-Reply-To: <1868241821.57598398.1348272550538.JavaMail.root@vmware.com> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: user-agent: Microsoft-MacOutlook/14.2.3.120616 x-originating-ip: [10.8.103.201] Content-Type: text/plain; charset="us-ascii" Content-ID: <53A9C139C75F3648A258A738D8232888@ddn.com> Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 Hello Interesting topic but is it really a general use case? The average HDFS cluster today is lower than 100 nodes , still, it is a lot of stored data. You would have to synchronize petabytes over high latency networks and I would assume, you are using HDFS as an archive (meaning you do not replace the content often). The Social network companies cycles logs in HDFS because the most recent data is their focus. HDFS Site protection would have to be async mode for sure (performance) and dealing with data consistency will have to be handle as well which is never simple; Could use WAN accelerator, of course it is all doable. HDFS has already protection (3 replica), disaster recovery is relevant for the namenode, hopefully you would not lose HDFS content Enterprise requirements from Ops would look at SAN solution for Datanodes and replicate the storage array or do a backup of it; if you cannot use SAN and stuck with DAS, make more copies to have more protection level (be pragmatic and save $$) Maybe I am missing the point below, why is it really needed -J On 9/21/12 5:09 PM, "Jun Ping Du" wrote: >Hi Sujee, > HDFS today doesn't consider too much on data center level reliability >(although it is supposed to extend to data center layer in topology but >never honored in replica policemen/balancer/task scheduling policy) and >performance is part of concern to cross data center (assume cross-dc >bandwidth is lower than within data center). However, in future, I think >we should deliver a solution to enable data center level disaster >recovery even performance is downgrade. My several years experience in >delivering enterprise software is: it is best to let customer to make >trade-off decision on performance and reliability, and engineering effort >is to provide options. >BTW, HDFS HA is a protection of key nodes from SPOF but not handle the >whole data center shutdown. > >Thanks, > >Junping > >----- Original Message ----- >From: "Sujee Maniyam" >To: "hdfs-dev" >Sent: Tuesday, September 11, 2012 7:29:39 AM >Subject: data center aware hadoop? > >HI devs >now that hfds HA is is a reality, how about HDFS spanning multiple >data centers? Are there any discussions / work going on in this area? > >It could be a single cluster spanning multiple data centers or having >a 'standby cluster' in another data center. > >curious, and thanks for your time! > >regards >Sujee Maniyam >http://sujee.net