Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 306CC305C for ; Fri, 6 May 2011 12:56:49 +0000 (UTC) Received: (qmail 50355 invoked by uid 500); 6 May 2011 12:56:48 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 50282 invoked by uid 500); 6 May 2011 12:56:48 -0000 Mailing-List: contact hdfs-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-user@hadoop.apache.org Delivered-To: mailing list hdfs-user@hadoop.apache.org Received: (qmail 50274 invoked by uid 99); 6 May 2011 12:56:48 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 06 May 2011 12:56:48 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,RFC_ABUSE_POST,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of doducthanhbk@gmail.com designates 209.85.216.176 as permitted sender) Received: from [209.85.216.176] (HELO mail-qy0-f176.google.com) (209.85.216.176) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 06 May 2011 12:56:43 +0000 Received: by qyk30 with SMTP id 30so3080146qyk.14 for ; Fri, 06 May 2011 05:56:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type; bh=G+PJsJSfAw0uYIy2nbOjGhp+ybuih507dJRGq1ojX0I=; b=aF0OejeYaYs4qbClcCXdEOyWkvtFRr6PE7bO1YvD4gEuC1q7ovceYvQCMGsQRPmW1/ K1E3AdLs1tyxxtt6LjW83RVF9Fix+pFy7H4hon8ITSbplzjXSWoxNvxW12Vj9dVYq4W0 M5olArvDzUBm87CD2GPIdLP/hnz20MnDpgPmw= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type; b=vEvblN2QBNt+pXCqL/mAh64La22tdeJZ6P4azXoAbwrL2HtJpHVYlDgSdJgfZ0ixhE beBsWwFMqFfi68lV37pavv0gyzKIcTFIv8cxx/ZyiR6n+6lT19Kq0FZEhmOAf+istqdP /MGwAE/JgaEwUhp77wr5ezfGyxkYyUYKVRG8g= MIME-Version: 1.0 Received: by 10.224.207.202 with SMTP id fz10mr3361533qab.168.1304686582820; Fri, 06 May 2011 05:56:22 -0700 (PDT) Sender: doducthanhbk@gmail.com Received: by 10.224.67.213 with HTTP; Fri, 6 May 2011 05:56:22 -0700 (PDT) In-Reply-To: References: Date: Fri, 6 May 2011 07:56:22 -0500 X-Google-Sender-Auth: Z9PPcp_tocPxXQKQS7jDf_bTKDk Message-ID: Subject: Re: experience with Backup Node From: Thanh Do To: hdfs-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=20cf300fb38f4ff03804a29b04ff --20cf300fb38f4ff03804a29b04ff Content-Type: text/plain; charset=UTF-8 I see ... Thanks for useful feedback, Todd! On Fri, May 6, 2011 at 7:34 AM, Todd Lipcon wrote: > Hi Thanh, > > No, I doubt that anybody is running BackupNode in production, since it's > only part of 0.21, and in my opinion an incomplete implementation. A few of > the deficiencies I'm aware of: > > - Like you said, edits are transferred by synchronous RPC from the NN. As > far as I know, there are no timeouts enabled on these RPCs, so if the > backupnode hangs, so will the primary. In the case of a BN crash, the > primary will hang for many minutes before noticing. > - The BN doesn't provide hot standby since it doesn't yet receive block > reports. > > The fact that the RPCs are synchronous seems unavoidable if you want to be > able to do a failover without any lost edits. But without timeouts, it's a > bit scary. > > Some work will be going on in trunk to address high availability over the > next several months - we'd definitely appreciate your expertise in failure > injection, etc, being applied to the new code as it goes in! > > -Todd > > On Thu, May 5, 2011 at 9:28 AM, Thanh Do wrote: > >> hi all, >> >> any body deploy the Backup Node in your system. >> I am curious about the impact of the Backup Node >> to the NameNode throughput. >> >> To my understanding, NameNode streams edits >> log operation to the BackupNode (by an RPC call), >> and only return once that operation has been applied >> to the in memory state of the Backup Node. >> >> Will this RPC call slow down the NameNode a little bit. >> >> Thanks >> Thanh >> > > > > -- > Todd Lipcon > Software Engineer, Cloudera > --20cf300fb38f4ff03804a29b04ff Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
I see ...

Thanks for useful feedback, Todd!
On Fri, May 6, 2011 at 7:34 AM, Todd Lipcon <todd@cloudera.com> wrote:
Hi Thanh,

No, I doubt th= at anybody is running BackupNode in production, since it's only part of= 0.21, and in my opinion an incomplete implementation. A few of the deficie= ncies I'm aware of:

- Like you said, edits are transferred by synchronous R= PC from the NN. As far as I know, there are no timeouts enabled on these RP= Cs, so if the backupnode hangs, so will the primary. In the case of a BN cr= ash, the primary will hang for many minutes before noticing.
- The BN doesn't provide hot standby since it doesn't yet rece= ive block reports.

The fact that the RPCs are sync= hronous seems unavoidable if you want to be able to do a failover without a= ny lost edits. But without timeouts, it's a bit scary.

Some work will be going on in trunk to address high ava= ilability over the next several months - we'd definitely appreciate you= r expertise in failure injection, etc, being applied to the new code as it = goes in!

-Todd

<= div class=3D"gmail_quote">On Thu, May 5, 2011 at 9:28 AM, Thanh Do <
than= hdo@cs.wisc.edu> wrote:
hi all,

any body deploy the Backup Node in your system.<= /div>
I am curious about the impact of=C2=A0the Backup Node
t= o the NameNode throughput.

To my understanding, Na= meNode streams edits
log operation to the BackupNode (by an RPC call),
and only r= eturn once that operation has been applied
to the in memory state= of the Backup Node.

Will this RPC call slow down = the NameNode a little bit.

Thanks
Thanh



--
Todd Lipcon=
Software Engineer, Cloudera

--20cf300fb38f4ff03804a29b04ff--