Return-Path: X-Original-To: apmail-hbase-dev-archive@www.apache.org Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A42C418D43 for ; Fri, 17 Jul 2015 03:42:40 +0000 (UTC) Received: (qmail 74818 invoked by uid 500); 17 Jul 2015 03:42:39 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 74737 invoked by uid 500); 17 Jul 2015 03:42:39 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 74723 invoked by uid 99); 17 Jul 2015 03:42:39 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 17 Jul 2015 03:42:39 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id E33391A71A4 for ; Fri, 17 Jul 2015 03:42:38 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 5.213 X-Spam-Level: ***** X-Spam-Status: No, score=5.213 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_REPLY=1, HTML_MESSAGE=3, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, URI_HEX=1.313] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id NxDW3aY_hUQt for ; Fri, 17 Jul 2015 03:42:28 +0000 (UTC) Received: from mail-yk0-f173.google.com (mail-yk0-f173.google.com [209.85.160.173]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id F1A3720CF5 for ; Fri, 17 Jul 2015 03:42:27 +0000 (UTC) Received: by ykdu72 with SMTP id u72so80477198ykd.2 for ; Thu, 16 Jul 2015 20:41:35 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=wH2pBCMc4hJB23Z/J/h2P32OWJ0Qr3EmHlrAE/tXuKc=; b=qXclpyIsYDvRQ/6FHkdPulBnIo6kNsVpeOpYv04JkFis07n+mk7YmlgVYqUj3sQ1yi gb3PXTBM0XRey0UPHtxjW2WECXcsxeN1YfXVJ73cn/No4WRb+kRCeRiRnhXXE/48P/rN 0EKE2Xv1i1BKPZQ+8VYRE1MtocQAuQOlHcBejrDJdNgT73kndEC51gPLWIJApgIDTOxT 3ZFMor2VHLTZ3KJaAoK4I00ccjCSVRXU/815LXfNYFRd7vVFex/0aQX249vrOvkVLOql jy6t9Fg0O7M0CYiyVW4ZwgkhC+ZJpS6GOCkb/B5zTyu8H03g3F+SezkJ7jbuX2tWAu39 575w== MIME-Version: 1.0 X-Received: by 10.170.94.84 with SMTP id l81mr13363313yka.93.1437104495742; Thu, 16 Jul 2015 20:41:35 -0700 (PDT) Received: by 10.37.214.144 with HTTP; Thu, 16 Jul 2015 20:41:35 -0700 (PDT) In-Reply-To: References: <1432791830121-4071911.post@n3.nabble.com> Date: Thu, 16 Jul 2015 20:41:35 -0700 Message-ID: Subject: Re: [DISCUSSION] Merge of the hbase-11339 mob branch into master. From: Ted Yu To: "dev@hbase.apache.org" , "Du, Jingcheng" Cc: "hbase-dev@hadoop.apache.org" Content-Type: multipart/alternative; boundary=001a113a6cb6f953a3051b09f6de --001a113a6cb6f953a3051b09f6de Content-Type: text/plain; charset=UTF-8 Jingcheng: Let us know if you need anything for composition of the vote thread. On Tue, Jul 14, 2015 at 3:58 PM, Andrew Purtell wrote: > That's great, and I also appreciate the time spent discussing on this > thread. I plan to vote yes. > > On Tue, Jul 14, 2015 at 3:57 PM, Jonathan Hsieh wrote: > > > Since Jingcheng did the majority of the work, I'd like him to stat the > > thread vote thread. It should happen in the next few days. > > > > Jon > > > > On Tue, Jul 14, 2015 at 3:53 PM, Andrew Purtell > > wrote: > > > > > Start a vote thread? This says DISCUSSION > > > > > > On Tue, Jul 14, 2015 at 2:05 PM, Ted Yu wrote: > > > > > > > There have been several iterations since the first mega patch was > > posted. > > > > > > > > Currently QA run is green. > > > > > > > > IntegrationTestIngestWithMOB has been run which passes. > > > > > > > > I want to give +1 for merging to master branch. > > > > > > > > On Tue, Jul 7, 2015 at 9:37 PM, Anoop John > > > wrote: > > > > > > > > > +1 > > > > > > > > > > We will work on making it a Vote thread. > > > > > > > > > > -Anoop- > > > > > > > > > > On Wed, Jul 8, 2015 at 9:46 AM, ramkrishna vasudevan < > > > > > ramkrishna.s.vasudevan@gmail.com> wrote: > > > > > > > > > > > +1 to start the voting again. > > > > > > > > > > > > On Wed, Jul 8, 2015 at 9:07 AM, Ted Yu > > wrote: > > > > > > > > > > > > > Looks like all MOB-related JIRAs have been resolved. > > > > > > > > > > > > > > Should the voting process be resumed ? > > > > > > > > > > > > > > Cheers > > > > > > > > > > > > > > On Thu, May 28, 2015 at 5:48 PM, Anoop John < > > anoop.hbase@gmail.com > > > > > > > > > > wrote: > > > > > > > > > > > > > > > Yes Andy. The sweep tool is completely optional now. We have > a > > > > chore > > > > > > > doing > > > > > > > > the compaction, like we trigger auto major compaction. We > can > > > > > > configure > > > > > > > > the interval. Auto can be turned off and user can explicitly > > call > > > > > also. > > > > > > > We > > > > > > > > have shell and API support. > > > > > > > > > > > > > > > > Anoop > > > > > > > > > > > > > > > > On Friday, May 29, 2015, Andrew Purtell > > > > > wrote: > > > > > > > > > MOB references in cells won't find their value if the MOB > > hfile > > > > has > > > > > > > been > > > > > > > > > corrupted. Dealing with that would be like any other > > corrupted > > > > > HFile, > > > > > > > > > understood. The dangling references make thinking about > > > (partial) > > > > > > > > recovery > > > > > > > > > and repair interesting. > > > > > > > > > > > > > > > > > > > > > > > > > > > On Thursday, May 28, 2015, Jingcheng Du < > > > jingcheng.du@intel.com> > > > > > > > wrote: > > > > > > > > > > > > > > > > > >> Andrew Purtell wrote > > > > > > > > >> > HBCK can check and sideline dangling reference files. I > > > think > > > > of > > > > > > MOB > > > > > > > > >> files > > > > > > > > >> > as "core enough" auxiliary files that need some > support. I > > > > > suppose > > > > > > > > unlike > > > > > > > > >> > reference files their presence or absence won't produce > a > > > > region > > > > > > > open > > > > > > > > >> > failure, we would see dangling pointers later when tying > > to > > > > > > service > > > > > > > > >> > queries. (Yes?) Will that abort the RS? Pardon the > > ignorant > > > > > > > question, > > > > > > > > >> > normally I could check the code but I'm at the airport > on > > a > > > > > phone. > > > > > > > > >> > > > > > > > > >> Thanks for comments! > > > > > > > > >> In mob, usually the reference cells are committed after > the > > > mob > > > > > > files > > > > > > > > are > > > > > > > > >> done. I think it hardly happens that a reference cell > cannot > > > > find > > > > > > its > > > > > > > > mob > > > > > > > > >> file. > > > > > > > > >> Even if there's a dangling reference cell, the RS won't be > > > > > aborted, > > > > > > a > > > > > > > > empty > > > > > > > > >> cell is returned instead. > > > > > > > > >> > > > > > > > > >> > > > > > > > > >> Andrew Purtell wrote > > > > > > > > >> > On that subject, I should file follow up issues for more > > > check > > > > > and > > > > > > > > repair > > > > > > > > >> > options for HFiles. We should be able to detect missing > or > > > > > corrupt > > > > > > > > files > > > > > > > > >> > of > > > > > > > > >> > all variety: HFile, reference, MOB. This may require an > > > > > expensive > > > > > > > scan > > > > > > > > >> > over > > > > > > > > >> > lots of files, but this is like fsck full disk surface > > scans > > > > and > > > > > > > those > > > > > > > > >> > have > > > > > > > > >> > similar costs. Providing MR based tools is fine but we > > > should > > > > > have > > > > > > > > >> > multithreaded tools that can stand in if a MR runtime is > > not > > > > > > > > available. > > > > > > > > >> > Import, Export, VerifyReplication...all of these tools > are > > > in > > > > a > > > > > > > > >> different, > > > > > > > > >> > lesser, class than integrity and repair tools, in my > > > opinion. > > > > > > Since > > > > > > > > MOB > > > > > > > > >> > will likely be merged into trunk by then I'll be sure to > > > > include > > > > > > > it. I > > > > > > > > >> > agree it's not fair to ask more of MOB then what we have > > now > > > > for > > > > > > > > HFile. > > > > > > > > >> > > > > > > > > >> To detect the corrupt files, some code are needed in the > > file > > > > > > checker > > > > > > > to > > > > > > > > >> check mob files after knowing it's a mob-enabled column. > > > > > > > > >> To detect the missing or dangling reference cells, I think > > we > > > > have > > > > > > to > > > > > > > do > > > > > > > > a > > > > > > > > >> full-table scan, like what is done now in HFile.main. > > > > > > > > >> We can do that. > > > > > > > > >> > > > > > > > > >> > > > > > > > > >> > > > > > > > > >> -- > > > > > > > > >> View this message in context: > > > > > > > > >> > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > http://apache-hbase.679495.n3.nabble.com/DISCUSSION-Merge-of-the-hbase-11339-mob-branch-into-master-tp4071644p4071911.html > > > > > > > > >> Sent from the HBase Developer mailing list archive at > > > > Nabble.com. > > > > > > > > >> > > > > > > > > > > > > > > > > > > > > > > > > > > > -- > > > > > > > > > Best regards, > > > > > > > > > > > > > > > > > > - Andy > > > > > > > > > > > > > > > > > > Problems worthy of attack prove their worth by hitting > back. > > - > > > > Piet > > > > > > > Hein > > > > > > > > > (via Tom White) > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > -- > > > Best regards, > > > > > > - Andy > > > > > > Problems worthy of attack prove their worth by hitting back. - Piet > Hein > > > (via Tom White) > > > > > > > > > > > -- > > // Jonathan Hsieh (shay) > > // HBase Tech Lead, Software Engineer, Cloudera > > // jon@cloudera.com // @jmhsieh > > > > > > -- > Best regards, > > - Andy > > Problems worthy of attack prove their worth by hitting back. - Piet Hein > (via Tom White) > --001a113a6cb6f953a3051b09f6de--