Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3A29F1871B for ; Mon, 21 Sep 2015 15:24:57 +0000 (UTC) Received: (qmail 32422 invoked by uid 500); 21 Sep 2015 15:24:55 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 32349 invoked by uid 500); 21 Sep 2015 15:24:55 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 32337 invoked by uid 99); 21 Sep 2015 15:24:55 -0000 Received: from Unknown (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 21 Sep 2015 15:24:55 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 57707F4F4C for ; Mon, 21 Sep 2015 15:24:54 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 4.463 X-Spam-Level: **** X-Spam-Status: No, score=4.463 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, HTML_MESSAGE=3, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, URI_HEX=1.313] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id UqRyA35OfgZd for ; Mon, 21 Sep 2015 15:24:45 +0000 (UTC) Received: from mail-io0-f181.google.com (mail-io0-f181.google.com [209.85.223.181]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 4682E20382 for ; Mon, 21 Sep 2015 15:24:45 +0000 (UTC) Received: by iofh134 with SMTP id h134so123391219iof.0 for ; Mon, 21 Sep 2015 08:24:44 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=PKsjG/0c/C40hATyQrGFuG1BBV1bkxtl/oy2zHxB3fo=; b=pHfMWitHkiAZdvPyJk8HTxC7sfk2eIm1drEcZv4Hg3Oachh0tZr3EQaGE+xPPRdOUn pQGUUDbdTnQnUY1/X2N9nLvYcAKxep8jFAzbIkkX3grKUMnp7vK4QkFYjGXEQwAhL6h1 PnvDg5aB57rmhuEVWIgzDne5DU8hmjvwDaIFirQ9BBfYknoNfpUBzDXXHEXFPv0vYkcK s2OQULkbmj5DVYS1tH0p1OYWwdrpwRgzAay41H+iDgT+irdRB0oHwPhBTe5Rhqo06bOv mJWWCLwFMaWG/QnIAL9hJUuijKDRCpWaRsiMrjetC0Wp27ZowkomJ5sJCV+C5hACbVd2 uk6Q== MIME-Version: 1.0 X-Received: by 10.107.131.134 with SMTP id n6mr26174684ioi.192.1442849084167; Mon, 21 Sep 2015 08:24:44 -0700 (PDT) Received: by 10.36.219.133 with HTTP; Mon, 21 Sep 2015 08:24:44 -0700 (PDT) In-Reply-To: <45B48B4E-4363-4F82-B548-2C72628082E1@gmail.com> References: <1441319118144-4074407.post@n3.nabble.com> <1441320221821-4074410.post@n3.nabble.com> <1442371454468-4074696.post@n3.nabble.com> <1442547425371-4074727.post@n3.nabble.com> <1442553377273-4074729.post@n3.nabble.com> <1442553564111-4074730.post@n3.nabble.com> <1442554255593-4074731.post@n3.nabble.com> <45B48B4E-4363-4F82-B548-2C72628082E1@gmail.com> Date: Mon, 21 Sep 2015 23:24:44 +0800 Message-ID: Subject: Re: Multiwal performance with HBase 1.x From: Yu Li To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=001a113ea628205b800520437b26 --001a113ea628205b800520437b26 Content-Type: text/plain; charset=UTF-8 @Ted, Sure, I've opened HBASE-14457 as an umbrella for all works done on multiwal and please allow me to give a more detailed sharing there, in format of documents rather than emails. :-) @Jingcheng and @Vlad, any comments/sharing from you will be warmly welcomed in the JIRA. :-) Best Regards, Yu On 21 September 2015 at 19:35, Ted Yu wrote: > Thanks for sharing, Yu. > > Images didn't go through. Can you use third party site for sharing ? > > Cheers > > > On Sep 20, 2015, at 11:09 PM, Yu Li wrote: > > > > Hi Vlad, > > > > >> the existing write performance is more than adequate (avg load per RS > usually less than 1MB/sec) > > We have some different user scenarios and I'd like to share with you. We > are using hbase to store data for building search index, and features like > pv/uv of each online item will be recorded, so the write load would reach > as high as 10MB/s (below is a screenshot of the ganglia metrics data) per > RS. OTOH, as a database I think the online write performance of HBase is as > important as read, bulkload is for offline and it cannot resolve all > problem. > > > > > > Another advantage of using multiple wal is that we could do > user/business level isolation on wal. For example you could use one > namespace per business and one wal group per namespace, and you could > replicate only the data for the business in need. > > > > Regarding compaction IO, as I mentioned before, we could use tiered > storage to prevent compaction to affect wal sync. This way we've observed > an obvious improvement on the avg mutate RT, from 0.5ms to 0.3ms on our > online cluster, FYI. > > > > Best Regards, > > Yu > > > >> On 19 September 2015 at 00:55, Vladimir Rodionov < > vladrodionov@gmail.com> wrote: > >> Hi, Jingcheng > >> > >> You postpone compaction until your test completes by setting number of > >> blocking stores to 120. That is kind of cheating :) > >> As I said previously, in a long run, compaction rules the world - not > >> number of wal files. In a real production setting, the existing write > >> performance > >> is more than adequate (avg load per RS usually less than 1MB/sec). > Multiwal > >> has probably its value if someone need to load quick large volume of > data, > >> but ... why do not use bulk load instead? > >> > >> Thank for letting us know that beefy servers with 8 SSDs can sustain > such a > >> huge load. > >> > >> -Vlad > >> > >> > >> > >> On Thu, Sep 17, 2015 at 10:30 PM, Jingcheng Du > >> wrote: > >> > >> > More information for the test. > >> > I use ycsb 0.3.0 for the test. > >> > The command line is "./ycsb load hbase-10 -P ../workloads/workload > -threads > >> > 200 -p columnfamily=family -p clientbuffering=true -s > workload.dat" > >> > The workload is, the data size is slightly less than 1TB: > >> > fieldcount=5 > >> > fieldlength=200 > >> > recordcount=1000000000 > >> > maxexecutiontime=86400 > >> > > >> > > >> > > >> > -- > >> > View this message in context: > >> > > http://apache-hbase.679495.n3.nabble.com/Multiwal-performance-with-HBase-1-x-tp4074403p4074731.html > >> > Sent from the HBase User mailing list archive at Nabble.com. > >> > > > > --001a113ea628205b800520437b26--