Return-Path: X-Original-To: apmail-hbase-dev-archive@www.apache.org Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DA4AF19B19 for ; Fri, 29 Apr 2016 05:12:18 +0000 (UTC) Received: (qmail 31575 invoked by uid 500); 29 Apr 2016 05:12:18 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 31473 invoked by uid 500); 29 Apr 2016 05:12:17 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 31461 invoked by uid 99); 29 Apr 2016 05:12:17 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 29 Apr 2016 05:12:17 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id DF735180184 for ; Fri, 29 Apr 2016 05:12:16 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.129 X-Spam-Level: ** X-Spam-Status: No, score=2.129 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx2-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id 2L5iZMuNiiCB for ; Fri, 29 Apr 2016 05:12:14 +0000 (UTC) Received: from mail-yw0-f182.google.com (mail-yw0-f182.google.com [209.85.161.182]) by mx2-lw-eu.apache.org (ASF Mail Server at mx2-lw-eu.apache.org) with ESMTPS id A8C9C5F23A for ; Fri, 29 Apr 2016 05:12:13 +0000 (UTC) Received: by mail-yw0-f182.google.com with SMTP id g133so144621035ywb.2 for ; Thu, 28 Apr 2016 22:12:13 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to; bh=3RjTvs2BxVrAXTulvPFrxNX+jOR9KwqbB90YtAIf3tA=; b=ylCRRsumdJCV+cNPXWhckB6lGW5xGBo1aTUeRnxghPxuXkamCc9N7X+2A9fIJ0p2C3 7VZADeQ2f8kTrp5/X6yOnivIwNh4yQRhmvyH49ZrU0lFE5/i0NK4832vg9Pd64w9lVjS raMjkd/cxefasVJicLymdMd/V7qPz5Z21/jABjI5MdZwXYy5mxLfkNpx4JPBepFpXQsF uzVjxHISqLxgXnxFEFkEkgR859yAN9giFRVYI2EA1CC5D3IByFjjJwkrYwpqfHbj7fjG 4ObJVZ/t5fIXZ35c5wBLD+nxhoq1+o6NZWayjNu0bNYXA16Nxagfhoww7PxTlkLUUTTE 48DA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to; bh=3RjTvs2BxVrAXTulvPFrxNX+jOR9KwqbB90YtAIf3tA=; b=SVP60xPwgQtPkOOdcjGCobLsEtryxtuByJ6NBH7M4mLsesu6AYqDzYzwRGWWcxAs2t hwyZqPoHloCqZ/qWmD2YmM5l7TTfhCG42HRUZwznRBSJdC1elwW124ScR8EJ6V1xkWhO BrHmrX7kOk9VLtmMtsXmeDdG1VfPfpkCHCjZDtohp0GmUTTssIAZuGGBqyeHO7nUUAn7 +8MuJW19WhRyhnK3N23EYj2KdtTV3dXqU3OAOJN6eZFah9PG8l4wb4rSCtu/H5czaMFV CUgiNDMoa0TohWbaKCDiXDohWYYSmQybaLcnVCQWG+fe+P1G7+wisyRguWtThPDRtGJW xLww== X-Gm-Message-State: AOPr4FU9oNjTuy+G3cGM8twC6rVbkaQZtOEAvWnJpjr3Q5Sl1rIiAXIAi6owVymqfb8a4fQpJDjBZA5CsIGsgg== MIME-Version: 1.0 X-Received: by 10.176.6.161 with SMTP id g30mr10434006uag.142.1461906732709; Thu, 28 Apr 2016 22:12:12 -0700 (PDT) Received: by 10.159.55.138 with HTTP; Thu, 28 Apr 2016 22:12:12 -0700 (PDT) In-Reply-To: References: Date: Fri, 29 Apr 2016 13:12:12 +0800 Message-ID: Subject: Re: [DISCUSS] Make AsyncFSWAL the default WAL in 2.0 From: =?UTF-8?B?5byg6ZOO?= To: dev@hbase.apache.org Content-Type: multipart/alternative; boundary=94eb2c047ca07f7ed4053198afe6 --94eb2c047ca07f7ed4053198afe6 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable 2016-04-29 11:47 GMT+08:00 Ted Yu : > Last comment on HDFS-916 was from 2010. > > Suggest making a new issue or reviving discussion on HDFS-916 (currently > assigned to Todd). > > bq. The fallback implementation is not aim to get a good performance > > For more than two weeks, I have been working with Azure Data Lake > developers so that all hbase system tests pass on ADLS - there were subtl= e > differences between ADLS and hdfs. > > If switching to AsyncWAL gives either WASB or ADLS subpar performance, it > would make upgrading to hbase 2.x unacceptable for their users. > You can still use FSHLog, it is not removed... But yes, this is a good point on how we choose default configs in HBase. A config that performs normally for every case, or a config that performs much better under the main scenario but worse for other scenarios... > > On Thu, Apr 28, 2016 at 8:39 PM, =E5=BC=A0=E9=93=8E wrote: > > > 2016-04-29 11:35 GMT+08:00 Ted Yu : > > > > > bq. AsyncFSOutput will be in HDFS-3.0 > > > > > > Is there HDFS JIRA for the above ? Can you share the number ? > > > > > I have not filed a new one but there are bunch of related issues alread= y, > > such as this one https://issues.apache.org/jira/browse/HDFS-916 > > > > > > > > bq. Just wrap FSDataOutputStream to make it act like an asynchronous > > output > > > > > > Can you be a bit more specific ? > > > HBase currently works with WASB and Azure Data Lake. Does the above > mean > > > their performance would suffer ? > > > > > Yes, the performance will suffer... > > The fallback implementation is not aim to get a good performance, just > for > > compatibility with any FileSystem implementation. > > > > > > > > On Thu, Apr 28, 2016 at 8:30 PM, =E5=BC=A0=E9=93=8E wrote: > > > > > > > Inline comments. > > > > Thanks, > > > > > > > > 2016-04-29 10:57 GMT+08:00 Sean Busbey : > > > > > > > > > I am nervous about having default out-of-the-box new HBase users > > > reliant > > > > on > > > > > a bespoke HDFS client, especially given Hadoop's compatibility > > > > > promises and history. Answers for these questions would make me > more > > > > > confident: > > > > > > > > > > 1) Where are we on getting the client-side changes to HDFS pushed > > back > > > > > upstream? > > > > > > > > > No progress yet... Here I want to tell a good story that HBase is > > already > > > > use it as default :) > > > > > > > > > > > > > > 2) How well do we detect when our FS is not HDFS and what does > > > > > fallback look like? > > > > > > > > > Just wrap FSDataOutputStream to make it act like an asynchronous > > > > output(call hflush in a separated thread). The performance is not > good > > I > > > > think. > > > > > > > > > > > > > > 3) Will this mean altering the versions of Hadoop we label as > > > > > supported for HBase 2.y+? > > > > > > > > > I have tested with hadoop versions from 2.4.x to 2.7.x, so I don't > > think > > > we > > > > need to change the supported versions? > > > > > > > > > > > > > > 4) How are we going to ensure our client remains compatible with > > newer > > > > > Hadoop releases? > > > > > > > > > We can not ensure, HDFS always breaks HBase at a new release... > > > > I need to test AsyncFSWAL on every new 2.x release and make it > > compatible > > > > with that version. And back to #1, I think we should make sure that > the > > > > AsyncFSOutput will be in HDFS-3.0. And in HBase-3.0, we can > introduce a > > > new > > > > 'AsyncFSWAL' that use the AsyncFSOutput in HDFS. > > > > > > > > > > > > > > On Thu, Apr 28, 2016 at 9:42 PM, Duo Zhang > > > wrote: > > > > > > Six month after I filed HBASE-14790... > > > > > > > > > > > > Now the AsyncFSWAL is ready. The WALPE result shows that it is > > > > > *1.4x~3.7x* > > > > > > faster than FSHLog. The ITBLL result turns out that it is *not > bad* > > > > than > > > > > > FSHLog(the master branch is not that stable itself...). > > > > > > > > > > > > More details can be found on HBASE-15536. > > > > > > > > > > > > So here we propose to change the default WAL from FSHLog to > > > AsyncFSWAL. > > > > > > Suggestions are welcomed. > > > > > > > > > > > > Thanks. > > > > > > > > > > > > > > > > > > > > -- > > > > > busbey > > > > > > > > > > > > > > > --94eb2c047ca07f7ed4053198afe6--