Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 5843E200BD4 for ; Thu, 1 Dec 2016 17:04:33 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 56E1A160B0B; Thu, 1 Dec 2016 16:04:33 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 7AA64160B05 for ; Thu, 1 Dec 2016 17:04:32 +0100 (CET) Received: (qmail 81209 invoked by uid 500); 1 Dec 2016 16:04:31 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 81191 invoked by uid 99); 1 Dec 2016 16:04:30 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 01 Dec 2016 16:04:30 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 740EB1A9CC1 for ; Thu, 1 Dec 2016 16:04:30 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 3.08 X-Spam-Level: *** X-Spam-Status: No, score=3.08 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, KAM_LINEPADDING=1.2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id cR04KcEiKg-a for ; Thu, 1 Dec 2016 16:04:28 +0000 (UTC) Received: from mail-yw0-f169.google.com (mail-yw0-f169.google.com [209.85.161.169]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 2F42A5F1F0 for ; Thu, 1 Dec 2016 16:04:28 +0000 (UTC) Received: by mail-yw0-f169.google.com with SMTP id a10so194213375ywa.3 for ; Thu, 01 Dec 2016 08:04:28 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=kS2QiRN1IbvgRO+fNoYW4qG5RsrOKR0pQLryNYhdRNE=; b=cdbR9feG5GLidXU+YH1JEdFq/34ouf3h7oowPqQUnIqlRBL1f8qgD2TFZ8VGufu3WK 0qBtd3lhUOaT9hhXwux+xQwVVz6Wcs0HFdVy5HgctR9dYaoOci7ZTkNlYF4r9Fg0gdjV RQ9OJjZyEE4qdqG5vbp1kP3IHgUo8zbUF609Yej5Po5QL48A0ew51kuiSrurhTr5kdCM SU+Pz+jTWVInAHWihzd/sHKut2Oy5OzcHjDxaUwsq6UnrQ0zUnpY9tfET3sALSs0FSt7 8BHUFNR+B83FnPuAD0HzVJsGpzcK7luXz6J8CZS8632DO3+o4+hLmGUpElorxvFbNvWi Oh9w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=kS2QiRN1IbvgRO+fNoYW4qG5RsrOKR0pQLryNYhdRNE=; b=PQus676XMNxK0FLYHnLFygwuClB+/kREeEYgr6YqVabA5HPWHn9OH3s7SU0B1B7gu5 nPd5t4AblspZL5XcfwidPE9XRIB5ZsdG6ltUwQOv4+YJG4ZLMTIS7QUAq4o/LNtNjt9E 8+0jA39K0OXK5hQN8gAVzu68lZ7qHzvAMmGCSW99XKOErd3a2WyMbRKAqj8Z9ywaFuTl ikWPdHxrMhQbNyodyz9Nu7tGhcATARpKAPU1Y22LUV4FwY9AvfuRJIljOl88qS5ECnB1 jT1b0mFZ8bCXM3ci1s1rMVsOO7r8NCtnnevSiy4IlQXKuuFkJQBLdOiuj4GaswJIK/jr sf7Q== X-Gm-Message-State: AKaTC03Q6THVROM15yrGmig5sGQmOBHg/hFN4eJL+MIPqdOp5ytJ3gSXGYK/wnapYFIYHHfkQ5JcMmiEB+GQeA== X-Received: by 10.129.86.5 with SMTP id k5mr41750393ywb.24.1480608261955; Thu, 01 Dec 2016 08:04:21 -0800 (PST) MIME-Version: 1.0 Received: by 10.37.57.148 with HTTP; Thu, 1 Dec 2016 08:04:21 -0800 (PST) Received: by 10.37.57.148 with HTTP; Thu, 1 Dec 2016 08:04:21 -0800 (PST) In-Reply-To: <664629431.3697231.1480583398056@mail.yahoo.com> References: <1954769023.3376965.1480540502507@mail.yahoo.com> <664629431.3697231.1480583398056@mail.yahoo.com> From: Neelesh Date: Thu, 1 Dec 2016 08:04:21 -0800 Message-ID: Subject: Re: Hbase on HDFS versus Cassandra To: Ted Yu Cc: user@hbase.apache.org Content-Type: multipart/alternative; boundary=001a1143198481b4b605429af98f archived-at: Thu, 01 Dec 2016 16:04:33 -0000 --001a1143198481b4b605429af98f Content-Type: text/plain; charset=UTF-8 Yes, PHOENIX-1718 has about the same information On Dec 1, 2016 1:12 AM, "Ted Yu" wrote: > w.r.t. "Unable to find cached index metadata" error, have you seen this ? > > http://search-hadoop.com/m/Phoenix/9UY0h2YBSOhgbflB1? > subj=Re+Global+Secondary+Index+ERROR+2008+INT10+Unable+ > to+find+cached+index+metadata+PHOENIX+1718+ > > Cheers > > > On Wednesday, November 30, 2016 11:59 PM, Neelesh > wrote: > > > Ted, > we use HDP 2.3.4 (HBase 1.1.2, phoenix 4.4 - but with a lot of backports > from later versions) > > The key of the data table is but it really is a right padded long due to historic data> long> > > The two global indexes are long> and long> > Around 100B rows in the main table. The main issues we see are > > # Sudden spikes in queueSize - going all the way to 1G limit and staying > there, without any correlated client traffic > # Boatloads of these errors 2016-11-30 11:28:54,907 INFO > [RW.default.writeRpcServer.handler=43,queue=9,port=16020] > util.IndexManagementUtil: Rethrowing org.apache.hadoop.hbase.DoNotRetryIOException: > ERROR 2008 (INT10): ERROR 2008 (INT10): Unable to find cached index > metadata. key=120521194876100862 region=. Index update failed > > We have cross datacenter WAL replication enabled. > We saw PHOENIX-1718, and changed all recommended timeouts to 1 hour. Our > HBase version has HBase-11705. We also discovered that the queuesize is > global (across general/replication/priority queues) and if it reaches the > 1GB limit, calls to all queues will drop. That was interesting because even > though the replication handlers have a different queue, the size is > counted globally, affecting others. Please correct me on this. I hope I'm > wrong on this one :) > > Our challenge has been to understand what's HBase doing under various > scenarios. We monitor call queue lengths, sizes and latencies as the > primary alerting mechanism to tell us something is going on with HBase. > > Thanks! > -neelesh > > On Wed, Nov 30, 2016 at 1:15 PM, Ted Yu wrote: > > Neelesh:Can you share more details about the sluggish cluster performance > (such as version of hbase / phoenix, your schema, region server log > snippet, stack traces, etc) ? > As hbase / phoenix evolve, I hope the performance keeps getting better for > your use case. > Cheers > > On Wednesday, November 30, 2016 10:07 AM, Neelesh > wrote: > > > We use both, in different capacities. Cassandra is an x-DC archive store > with mostly batch writes and occasional key based reads. Hbase is for > real-time event ingestion. Our experience so far on hbase + phoenix is that > when it works, it is fast and scales like crazy. But if you ever hit a snag > around data patterns, you will have a VERY hard time figuring out what's > going on. A combination of global phoenix indexes and heavy writes leave an > entire cluster sluggish, if there is a hint of hotspotting. > > On the other hand, we had a big struggle getting Cassandra when a node > recovery was in progress. What with twice the amount of disk requirements > during recovery etc. Other than that, it is quiet. > But the access patterns are not the same. > > I think the old rule still stays. If you are already on hadoop , or > interested in using/analysing data in several different ways, go with hbase > . If you just need a big data store with a few predefined query patterns, > Cassandra is good > > Of course, I'm biased towards HBase. > > On Nov 30, 2016 7:02 AM, "Mich Talebzadeh" > wrote: > > > Hi Guys, > > > > Used Hbase on HDFS reasonably well. Happy to to stick with it and more > with > > Hive/Phoenix views and Phoenix indexes where I can. > > > > I have a bunch of users now vocal about the use case for Cassandra and > > whether it can do a better job than Hbase. > > > > Unfortunately I am no expert on Cassandra. However, some use case fit > would > > be very valuable. > > > > Thanks > > > > Dr Mich Talebzadeh > > > > > > > > LinkedIn * https://www.linkedin.com/ profile/view?id= > > > AAEAAAAWh2gBxianrbJd6zP6AcPCCd OABUrV8Pw > > AAEAAAAWh2gBxianrbJd6zP6AcPCCd > > > OABUrV8Pw>* > > > > > > > > http://talebzadehmich. wordpress.com > > > > > > > *Disclaimer:* Use it at your own risk. Any and all responsibility for any > > loss, damage or destruction of data or any other property which may arise > > from relying on this email's technical content is explicitly disclaimed. > > The author will in no case be liable for any monetary damages arising > from > > such loss, damage or destruction. > > > > > > > > > > --001a1143198481b4b605429af98f--