Return-Path: Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: (qmail 15479 invoked from network); 25 May 2010 10:04:37 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 25 May 2010 10:04:37 -0000 Received: (qmail 18307 invoked by uid 500); 25 May 2010 10:04:36 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 18037 invoked by uid 500); 25 May 2010 10:04:33 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 18029 invoked by uid 99); 25 May 2010 10:04:32 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 May 2010 10:04:32 +0000 X-ASF-Spam-Status: No, hits=-2.0 required=10.0 tests=AWL,RCVD_IN_DNSWL_MED,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [192.6.10.60] (HELO tobor.hpl.hp.com) (192.6.10.60) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 May 2010 10:04:24 +0000 Received: from localhost (localhost [127.0.0.1]) by tobor.hpl.hp.com (Postfix) with ESMTP id CE000B7E48 for ; Tue, 25 May 2010 11:04:01 +0100 (BST) X-Virus-Scanned: amavisd-new at hplb.hpl.hp.com Received: from tobor.hpl.hp.com ([127.0.0.1]) by localhost (tobor.hpl.hp.com [127.0.0.1]) (amavisd-new, port 10024) with LMTP id rQpt+rX-voBq for ; Tue, 25 May 2010 11:03:51 +0100 (BST) Received: from 0-imap-br1.hpl.hp.com (0-imap-br1.hpl.hp.com [16.25.144.60]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by tobor.hpl.hp.com (Postfix) with ESMTPS id D986FB7C6D for ; Tue, 25 May 2010 11:03:50 +0100 (BST) MailScanner-NULL-Check: 1275386617.53637@tfPlj9dxNcrkHzeTvUjaDQ Received: from [16.25.175.158] (morzine.hpl.hp.com [16.25.175.158]) by 0-imap-br1.hpl.hp.com (8.14.1/8.13.4) with ESMTP id o4PA3a1o002163 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Tue, 25 May 2010 11:03:37 +0100 (BST) Message-ID: <4BFBA078.5000505@apache.org> Date: Tue, 25 May 2010 11:03:36 +0100 From: Steve Loughran User-Agent: Thunderbird 2.0.0.24 (X11/20100228) MIME-Version: 1.0 To: general@hadoop.apache.org Subject: Re: Active-Active Performance References: <590CDE7A083C4142AF05E96F2FB543BC9733D2@r-exchange.cardlink.local> <590CDE7A083C4142AF05E96F2FB543BC9733F9@r-exchange.cardlink.local> In-Reply-To: <590CDE7A083C4142AF05E96F2FB543BC9733F9@r-exchange.cardlink.local> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-HPL-MailScanner-Information: Please contact the ISP for more information X-MailScanner-ID: o4PA3a1o002163 X-HPL-MailScanner: Found to be clean X-HPL-MailScanner-From: stevel@apache.org Anthony Ikeda wrote: > Thanks Hemanth, > > In regards to different locations of the HADOOP home this is low > priority more for testing not production. I was trying to install HADOOP > for testing over 2 machines with only a Windows XP machine running > Cygwin and a Mac running Darwin. Not a priority. Things are much easier if -all your machines have the same OS, disk structure -you are running on linux -you use some CM tool to automate setup/deploy, pushing out of config files Start now, start with VMWare or virtualbox images now, so you learn about management sooner rather than later > In regards to my last question about operating in a detached fashion, we > are trying to factor in what happens when the link between both sites is > cut. Will both sites operate independently until the connection is > re-established? Is there any particular setup required to ensure we can > cover this scenario or is it an out-of-the-box feature? HDFS and the MapReduce engine is designed to run on a single datacentre with high bandwidth, high reliability links, current releases assume the facility is secure and all users are trusted. The key SPOF, the Namenode, doesn't do failover, so when it goes down or the network partitions, all machines that cannot see the NN poll and spin until it comes back -which can take a while, unless you have a secondary namenode to keep the persistent files up to date. the workers all assume that the hostname and IPAddr of the namenode doesn't change, and never reread their config. You could use DNS to do failover, but you have to tune the JVMs to not cache IP addresses for very long. To do cross site stuff you'd need a separate HDFS filesystem per site, synchronisation of data now becomes a task for the higher level apps. I don't know what HBase, Cassandra or other column DB tools do here. -steve