Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B80E3D761 for ; Mon, 20 May 2013 22:49:03 +0000 (UTC) Received: (qmail 43710 invoked by uid 500); 20 May 2013 22:49:01 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 43665 invoked by uid 500); 20 May 2013 22:49:01 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 43655 invoked by uid 99); 20 May 2013 22:49:01 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 20 May 2013 22:49:01 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of varun@pinterest.com designates 209.85.210.182 as permitted sender) Received: from [209.85.210.182] (HELO mail-ia0-f182.google.com) (209.85.210.182) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 20 May 2013 22:48:55 +0000 Received: by mail-ia0-f182.google.com with SMTP id z3so8187778iad.13 for ; Mon, 20 May 2013 15:48:34 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:x-gm-message-state; bh=KytR4z53PlSaRCy0hG6mnmY3vuGaPyjIQbmUGcY1+Vo=; b=YkWLG++5a9QrAX8khzwzNYI/bVujVjLBUmcdeBI0hBUpmm/f4tCrzmYd8QMZYLEGzQ a9M9kAv31iw0/aZYnKibx6lX7g0MjTtYRWHadS+meeX0Ub+PZqNLKeSVBywB+6MCuOhC 5yoKzhiO4tb8MdZO36E631fGdlfamC6mPLZJKfanjt37jKFLrhVgM1hDsQ5oiESBBTKb 1MSGWgweBU6+CHt5H9HBM9GJc8hqiCGDC/snLoYA92tWG9I59wA6WM0OquReOOhnrlVn AloIXWrFcDs9NjYxwo2YWzl/SAdwxyltKwKe6yD3kfluG5a6JwQU1wmzFfbpP9W1C2Qh ECKA== MIME-Version: 1.0 X-Received: by 10.50.17.234 with SMTP id r10mr6331407igd.102.1369090114675; Mon, 20 May 2013 15:48:34 -0700 (PDT) Received: by 10.231.84.6 with HTTP; Mon, 20 May 2013 15:48:34 -0700 (PDT) In-Reply-To: References: <1369021773.11235.YahooMailNeo@web140605.mail.bf1.yahoo.com> Date: Mon, 20 May 2013 15:48:34 -0700 Message-ID: Subject: Re: Questions about HBase replication From: Varun Sharma To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=14dae9340ff5f37eab04dd2e210a X-Gm-Message-State: ALoCoQkA2meleMCvOCPc2FdGMnzhAey4zVNL2d+ZtHcCvy2T7ssnOy3nvU2dU+FvuZpOO3XsxDRa X-Virus-Checked: Checked by ClamAV on apache.org --14dae9340ff5f37eab04dd2e210a Content-Type: text/plain; charset=ISO-8859-1 Thanks JD for the response... I was just wondering if issues have ever been seen with regards to moving over a large number of WAL(s) entirely from one region server to another since that would double the replication related load on the one server which takes over. Another side question: After the WAL has been replicated - is it purged immediately or soonish from the zookeeper ? Thanks Varun On Mon, May 20, 2013 at 9:57 AM, Jean-Daniel Cryans wrote: > On Mon, May 20, 2013 at 12:35 AM, Varun Sharma > wrote: > > Hi Lars, > > > > Thanks for the response. > > > > Regarding #2 again, so if RS1 failed, then the following happens... > > 1) RS2 takes over its logs... > > 2) Master renames the log containing directory to have a -splitting in > the > > path > > 3) Does RS2 already know about the "-splitting" path ? > > It will look at all the possible locations. See > ReplicationSource.openReader > > > > > Also on a related note, was there a reason that we have all region > servers > > watching all other region server's queue of logs. Otherwise, couldn't the > > master have done the reassignment of outstanding logs to other region > > servers more fairly upon failure ? > > I think I did it like that because it was easier since the region > server has to be told to grab the queue(s) anyway. > > > > > Thanks > > Varun > > > > > > On Sun, May 19, 2013 at 8:49 PM, lars hofhansl wrote: > > > >> #1 yes > >> #2 no > >> > >> :) > >> > >> Now, there are scenarios where inconsistencies can happen. The edits are > >> not necessarily shipped in order when there are failures. > >> So it is possible to have some Puts at T1 and some Deletes at T2 (T1 < > >> T2), and end up with the deletes shipped first. > >> Now imagine a compaction happens at the slave after the Deletes are > >> shipped to the slave, but before the Puts are shipped... The Puts will > >> reappear. > >> > >> -- Lars > >> > >> > >> > >> ________________________________ > >> From: Varun Sharma > >> To: user@hbase.apache.org > >> Sent: Sunday, May 19, 2013 12:13 PM > >> Subject: Questions about HBase replication > >> > >> > >> Hi, > >> > >> I have a couple of questions about HBase replication... > >> > >> 1) When we ship edits to slave cluster - do we retain the timestamps in > the > >> edits - if we don't, I can imagine hitting some inconsistencies ? > >> > >> 2) When a region server fails, the master renames the directory > containing > >> WAL(s). Does this impact reading of those logs for replication ? > >> > >> Thanks > >> Varun > >> > --14dae9340ff5f37eab04dd2e210a--