Return-Path: X-Original-To: apmail-hbase-dev-archive@www.apache.org Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3ABC01810F for ; Thu, 21 May 2015 18:21:51 +0000 (UTC) Received: (qmail 56928 invoked by uid 500); 21 May 2015 18:21:50 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 56850 invoked by uid 500); 21 May 2015 18:21:50 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 56839 invoked by uid 99); 21 May 2015 18:21:50 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 21 May 2015 18:21:50 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id CC748C1385 for ; Thu, 21 May 2015 18:21:49 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.98 X-Spam-Level: ** X-Spam-Status: No, score=2.98 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=3, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id gnF2YTKVMoRL for ; Thu, 21 May 2015 18:21:41 +0000 (UTC) Received: from mail-la0-f41.google.com (mail-la0-f41.google.com [209.85.215.41]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id 504D343BAB for ; Thu, 21 May 2015 18:21:41 +0000 (UTC) Received: by laat2 with SMTP id t2so108735686laa.1 for ; Thu, 21 May 2015 11:20:55 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-type; bh=B5o6ZdgduxzihWAKzTfBXWKBkKSfeUjJ7fxXQP8NL9I=; b=Ywasy5F4QV4SmmwYJF27jJBcaPrmo3fkbqVWrqxST8dvINfZQNm1SqOiyrWErGYJ87 a/HLq+bTHfrUI+5XXUniwN0m2/6y7yXs/XlFoxojGULU1+KlpA75TPZtOdaI9ul+g/W7 f3wuNlX+wibLDp8ixUI8fU4JwSbDIBpzkO6MvZhxdWGMYt8crjwOqsMyjuqn+ZR3rTG+ W1OTpRey9yuUTgw969WQA8OY4l8rSV6+ZHcKfYH56365IR+clf1FHPlSt63iVXSBzEsi 9rvXRPv1VMf5yseZbwIVltLkZTG73gjcV6mRVKUw3Wr6QrYtJSSUD1IEoQAeXJBRmWU+ Qlow== X-Gm-Message-State: ALoCoQlz84XfId9iLlU3lD9clceRywJ7jTGL4ox+QqJNZVZf69dRaCi/iB4dzQB7lMxmpgFMaPF3 X-Received: by 10.152.5.225 with SMTP id v1mr3262442lav.111.1432232454889; Thu, 21 May 2015 11:20:54 -0700 (PDT) MIME-Version: 1.0 Received: by 10.25.84.78 with HTTP; Thu, 21 May 2015 11:20:34 -0700 (PDT) In-Reply-To: References: From: Jonathan Hsieh Date: Thu, 21 May 2015 11:20:34 -0700 Message-ID: Subject: Re: [DISCUSSION] Merge of the hbase-11339 mob branch into master. To: "dev@hbase.apache.org" Content-Type: multipart/alternative; boundary=089e013d1e0eb5b6e305169b9ae1 --089e013d1e0eb5b6e305169b9ae1 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Not that I know of. Here's a quick high level comparison based on my read of the ozone doc are api differences: - ozone is described an s3 clone while hbase mob is similar to a rdbms's blob feature. - ozone names - 3-64 bytes, hbase mob names - hbase qual limit -(64k i think) - ozone object sizes - 100k's to 100M's; hbase mob - 10k's-10'sM (tested at 50M) - ozone - hash partitioning, hbase mob - range partitioning It isn't clear to me how they handle updates and deletes, hdfs snapshots and other hdfs backup mechanisms. Jon. On Wed, May 20, 2015 at 8:10 PM, =E5=BC=A0=E9=93=8E = wrote: > Is there any comparison between HBASE-11339 and HDFS-7240? > Is their 'Object' a super set of our 'Medium Object'? > > 2015-05-21 10:38 GMT+08:00 Ted Yu : > > > This is a useful feature, Jon. > > > > I went over the mega-patch and left some comments on review board. > > > > I noticed that hbck was not included in the patch. Neither did I find a > > sub-task of HBASE-11339 that covers hbck. > > > > Do you or Jingcheng plan to add MOB-aware capability for hbck ? > > > > Cheers > > > > On Wed, May 20, 2015 at 9:21 AM, Jonathan Hsieh > wrote: > > > > > Hi folks, > > > > > > The Medium Object (MOB) Storage feature (HBASE-11339[1]) is modified > I/O > > > and compaction path that allows individual moderately sized values > > > (10k-10MB) to be stored so that write amplification is reduced when > > > compared to the normal I/O path. At a high level, it provides > alternate > > > flush and compaction mechanisms that segregates large cells into a > > separate > > > area where they are not subject to potentially frequent compaction an= d > > > splits that can be encountered in the normal I/O path. A more detaile= d > > > design doc can be found on the hbase-11339 jira. > > > > > > Jingcheng Du has been working on the mob feature for a while and Anoo= p, > > Ram > > > and I have been shepherding him through the design revisions and > > > implementation of the feature in the hbase-11339 branch.[2] > > > > > > The branch we are proposing to merge into master is compatible with > > HBase's > > > core functionality including snapshots, replication, shell support, > > behaves > > > well with table alters, bulk loads and does not require external MR > > > processes. It has been documented, and subject to many integration te= st > > > runs (ITBLL, ITAcidGuarantees, ITIngest) including fault injection. > > > Performance testing of the feature shows what can be a 2x-3x throughp= ut > > > improvement for workloads that contain mobs. These results can be see= n > on > > > the hbase 2.0 panel discussion slides from hbasecon (once published). > > > > > > Recently there have been some hfile encryption related shortcomings > that > > we > > > could address in branch or in master. > > > > > > Earlier iterations of the feature has been tested in production by > users > > > that Jingcheng has been responsible for. A version has also been > > deployed > > > at users I have been responsible for. Some of the folks from Huawei > > > (ashutosh) have also been submitting the recent encryption bug report= s > > > against the hbase-11339 branch so there is some evidence of usage by > > them. > > > > > > The four of us (Jingcheng, Ram, Anoop and I) are satisfied with the > > > feature and feel it is a good time to call a merge vote. Ive posted = a > > > megapatch version for folks who want to peruse the code. [3] > > > > > > What do you all think? > > > > > > Thanks, > > > Jingcheng, Jon, Ram, and Anoop. > > > > > > [1] https://issues.apache.org/jira/browse/HBASE-11339 > > > [2] https://github.com/apache/hbase/tree/hbase-11339 > > > [3] https://reviews.apache.org/r/34475/ > > > -- > > > // Jonathan Hsieh (shay) > > > // HBase Tech Lead, Software Engineer, Cloudera > > > // jon@cloudera.com // @jmhsieh > > > > > > --=20 // Jonathan Hsieh (shay) // HBase Tech Lead, Software Engineer, Cloudera // jon@cloudera.com // @jmhsieh --089e013d1e0eb5b6e305169b9ae1--