Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 3576B200BC3 for ; Fri, 18 Nov 2016 18:20:34 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 3415B160B04; Fri, 18 Nov 2016 17:20:34 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 5905E160B03 for ; Fri, 18 Nov 2016 18:20:33 +0100 (CET) Received: (qmail 82573 invoked by uid 500); 18 Nov 2016 17:20:31 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 81810 invoked by uid 99); 18 Nov 2016 17:20:30 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Nov 2016 17:20:30 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 6CB89C083A; Fri, 18 Nov 2016 17:20:30 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.689 X-Spam-Level: * X-Spam-Status: No, score=1.689 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001, T_REMOTE_IMAGE=0.01] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id UpaAzn9cOqcE; Fri, 18 Nov 2016 17:20:28 +0000 (UTC) Received: from mail-qt0-f182.google.com (mail-qt0-f182.google.com [209.85.216.182]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id E14B05F3F4; Fri, 18 Nov 2016 17:20:27 +0000 (UTC) Received: by mail-qt0-f182.google.com with SMTP id w33so160919995qtc.3; Fri, 18 Nov 2016 09:20:27 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=iCbCpsmCk1JrMGtC7WXlGca/WRGsXe1fcIYMcMTVQMY=; b=Ya047+5LcI/oEd5UyXJtzcfVO7hZtV3ztsgOsBruwFWLsxa+9ALdJy9dxd/2JRY2Ho cO0ZVRj0QBdvtly7Xy+z8E5U+V/pi5+gVWEI0Gh811ImT/R20X3EMt4m0MD3oHzHdB47 J2Jvo8usVTj46+gC7XgUW3cpDdwV9cVN6/TggUi6jGkst/vnfnc2OkU7xkeJEB1ark6g clou0D3jIRq3Cpz15cWTe/CtpW25QM1SL4UwVOMB4RnaxnotXcN3ur19cxn3XLwGVffY c9FhkkFuBeA4KCK9tMcJPeibCRcN89bVtsfNjiwnwVGUn6hCFQL/0A3ttUwR5ZIGd8DH Fz3A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=iCbCpsmCk1JrMGtC7WXlGca/WRGsXe1fcIYMcMTVQMY=; b=K9FZif5KBYyLBqqwOzsIhC5nBAmxT2lry2WMqWs4SVi1g9uKfzDADM2ms5YD2kPVZl 3hKDnHx5qHPhrmgEOBEHBKDcymNoV2i8wW/nb4GJ2LqYVXyeMwBJ44rvc4JcKqqU7ldv 48gi5TAgvcFhttiOl37iQWlK7wqcEN9WuC1pwau07zaMCZ2Lkg1CJrqsmimCZpoDlP/0 /D+TjxBx+ogLVaACEQE3OP69yd6tSesPXMrT3e1leMbfOeGCDlordIdfJ419BVUmsDHX E7jTvr/GyaSzfP2yS0fUcgpM9D/8EdUhqjuzLFvwx6UmBCBmtlnNB8PHravAzMPaIsSD QbVQ== X-Gm-Message-State: AKaTC027Qv+d9u4QX1bQZGTpnM60CEQG9Mepe0lRmFTLi0XfhZtuwR1FGYG1Il3B6/jQNdVDV/f1sROgTbTIoQ== X-Received: by 10.237.61.176 with SMTP id i45mr563445qtf.285.1479489616787; Fri, 18 Nov 2016 09:20:16 -0800 (PST) MIME-Version: 1.0 Received: by 10.237.51.5 with HTTP; Fri, 18 Nov 2016 09:20:16 -0800 (PST) In-Reply-To: References: <943F698A-7CD9-4D65-B083-D2F7ED570A0E@gmail.com> From: Anoop John Date: Fri, 18 Nov 2016 22:50:16 +0530 Message-ID: Subject: Re: Use experience and performance data of offheap from Alibaba online cluster To: "user@hbase.apache.org" Cc: "dev@hbase.apache.org" Content-Type: multipart/alternative; boundary=001a11440aae0ebddb0541968559 archived-at: Fri, 18 Nov 2016 17:20:34 -0000 --001a11440aae0ebddb0541968559 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hi Yu Li Good to see that the off heap work help you.. The perf numbers looks great. So this is a compare of on heap L1 cache vs off heap L2 cache(HBASE-11425 enabled). So for 2.0 we should make L2 off heap cache ON by default I believe. Will raise a jira for that we can discuss under that. Seems like L2 off heap cache for data blocks and L1 cache for index blocks seems a right choice. Thanks for the backport and the help in testing the feature.. You were able to find some corner case bugs and helped community to fix them.. Thanks goes to ur whole team. -Anoop- On Fri, Nov 18, 2016 at 10:14 PM, Yu Li wrote: > Sorry guys, let me retry the inline images: > > Performance w/o offheap: > > =E2=80=8B > Performance w/ offheap: > > =E2=80=8B > Peak Get QPS of one single RS during Singles' Day (11/11): > > =E2=80=8B > > And attach the files in case inline still not working: > =E2=80=8B=E2=80=8B=E2=80=8B > Performance_without_offheap.png > > =E2=80=8B=E2=80=8B > Performance_with_offheap.png > > =E2=80=8B=E2=80=8B > Peak_Get_QPS_of_Single_RS.png > > =E2=80=8B > > > Best Regards, > Yu > > On 18 November 2016 at 19:29, Ted Yu wrote: > >> Yu: >> With positive results, more hbase users would be asking for the backport >> of offheap read path patches. >> >> Do you think you or your coworker has the bandwidth to publish backport >> for branch-1 ? >> >> Thanks >> >> > On Nov 18, 2016, at 12:11 AM, Yu Li wrote: >> > >> > Dear all, >> > >> > We have backported read path offheap (HBASE-11425) to our customized >> hbase-1.1.2 (thanks @Anoop for the help/support) and run it online for m= ore >> than a month, and would like to share our experience, for what it's wort= h >> (smile). >> > >> > Generally speaking, we gained a better and more stable >> throughput/performance with offheap, and below are some details: >> > 1. QPS become more stable with offheap >> > >> > Performance w/o offheap: >> > >> > >> > >> > Performance w/ offheap: >> > >> > >> > >> > These data come from our online A/B test cluster (with 450 physical >> machines, and each with 256G memory + 64 core) with real world workloads= , >> it shows using offheap we could gain a more stable throughput as well as >> better performance >> > >> > Not showing fully online data here because for online we published the >> version with both offheap and NettyRpcServer together, so no standalone >> comparison data for offheap >> > >> > 2. Full GC frequency and cost >> > >> > Average Full GC STW time reduce from 11s to 7s with offheap. >> > >> > 3. Young GC frequency and cost >> > >> > No performance degradation observed with offheap. >> > >> > 4. Peak throughput of one single RS >> > >> > On Singles Day (11/11), peak throughput of one single RS reached 100K, >> among which 90K from Get. Plus internet in/out data we could know the >> average result size of get request is ~1KB >> > >> > >> > >> > Offheap are used on all online machines (more than 1600 nodes) instead >> of LruCache, so the above QPS is gained from offheap bucketcache, along >> with NettyRpcServer(HBASE-15756). >> > >> > Just let us know if any comments. Thanks. >> > >> > Best Regards, >> > Yu >> > >> > >> > >> > >> > >> > >> > >> > > --001a11440aae0ebddb0541968559--