Return-Path: Delivered-To: apmail-hadoop-hbase-user-archive@minotaur.apache.org Received: (qmail 52274 invoked from network); 1 Sep 2009 22:56:32 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 1 Sep 2009 22:56:32 -0000 Received: (qmail 84158 invoked by uid 500); 1 Sep 2009 22:56:31 -0000 Delivered-To: apmail-hadoop-hbase-user-archive@hadoop.apache.org Received: (qmail 84120 invoked by uid 500); 1 Sep 2009 22:56:31 -0000 Mailing-List: contact hbase-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-user@hadoop.apache.org Delivered-To: mailing list hbase-user@hadoop.apache.org Received: (qmail 84110 invoked by uid 99); 1 Sep 2009 22:56:31 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 01 Sep 2009 22:56:31 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of saint.ack@gmail.com designates 74.125.92.24 as permitted sender) Received: from [74.125.92.24] (HELO qw-out-2122.google.com) (74.125.92.24) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 01 Sep 2009 22:56:22 +0000 Received: by qw-out-2122.google.com with SMTP id 8so221419qwh.35 for ; Tue, 01 Sep 2009 15:56:01 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:sender:received:in-reply-to :references:date:x-google-sender-auth:message-id:subject:from:to :content-type; bh=O+DLCX3ESrSBuPvB/97mk0iXDW2KvdyxlXp8g6w6WHs=; b=vp/7u6au5ARjbkHssR5bKjXKdlt7brefO2KAea+/CwS9VSpIMqBF4pHaI6wmAdgGTX 53D+39UVjhXrs+l4JEI/iTPoPk2rv4hLPRMkoyReLBzkVtcu/eyHv0tg/FfhixALYL6I t4miUIyF2SdCNTeHn7tWN8JCyZndS/boGvlZk= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type; b=Q6mjePe2BEbIv+Vst7RSob8WYxmE8Tcb01Cjn+Pfa75hPtstH+X76o0xS9FbOwHqNP 4xTWg/4+NwqnLzxpj0KmLapWWcce0QMULha3fpXraMq+yX5CB3IvMgOfrODDJ+SQwPez mFHdJYSQ6qtB/LRLpZqplvdef5TyDxMF3XBUo= MIME-Version: 1.0 Sender: saint.ack@gmail.com Received: by 10.229.29.148 with SMTP id q20mr2256490qcc.51.1251845761093; Tue, 01 Sep 2009 15:56:01 -0700 (PDT) In-Reply-To: References: Date: Tue, 1 Sep 2009 15:56:01 -0700 X-Google-Sender-Auth: 6bc3166c247b36d8 Message-ID: <7c962aed0909011556x41ec3de7o9117ba10d161d3c6@mail.gmail.com> Subject: Re: Cassandra vs HBase From: stack To: hbase-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=001636833dfce7502504728c0de9 X-Virus-Checked: Checked by ClamAV on apache.org --001636833dfce7502504728c0de9 Content-Type: text/plain; charset=ISO-8859-1 Hey Jonathan: On Tue, Sep 1, 2009 at 3:12 PM, Jonathan Ellis wrote: > The big win for Cassandra is that its p2p distribution model -- which > drives the consistency model -- means there is no single point of > failure. SPF can be mitigated by failover but it's really, really > hard to get all the corner cases right with that approach. Even > Google with their 3 year head start and huge engineering resources > still has trouble with that occasionally. (See e.g. > http://groups.google.com/group/google-appengine/msg/ba95ded980c8c179.) > > Its hard to answer the above -- No SPOF > failover because some corner cases will be missed as though P2P was without corners -- so I'll pass on it. > > + Cassandra does not have have a natural sharding notion as there is in > > HBase -- i.e. HBase Regions -- so hooking Cassandra to MapReduce is > awkward. > > Actually that's not a big deal -- the token ring is known, so you can > break up at a coarse granularity there, and each node has a sampling > of the keys stored on it thanks to the way the sstable indexing works, > so generating hadoop input regions is pretty easy. Jeff Hodges wrote > a proof of concept over at > https://issues.apache.org/jira/browse/CASSANDRA-342. > Thanks. Yeah, I'd read that issue before making the comment. It was my reading of the issue that provoked my 'awkward' comment. > > + The Cassandra fellas talk of their app being one ball of code only > whereas > > with HBase there is HDFS, ZooKeeper and then HBase itself (Apparently it > has > > less lines of code too). > > Opinions may differ, but I still think this is a huge win for > troubleshooting. > The parenthetical was to poke fun at what, IMO, is a silly guage for comparing very different projects. Go easy, St.Ack --001636833dfce7502504728c0de9--