Return-Path: Delivered-To: apmail-incubator-cassandra-user-archive@minotaur.apache.org Received: (qmail 62844 invoked from network); 6 Dec 2009 05:50:05 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 6 Dec 2009 05:50:05 -0000 Received: (qmail 11557 invoked by uid 500); 6 Dec 2009 05:50:05 -0000 Delivered-To: apmail-incubator-cassandra-user-archive@incubator.apache.org Received: (qmail 11518 invoked by uid 500); 6 Dec 2009 05:50:04 -0000 Mailing-List: contact cassandra-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: cassandra-user@incubator.apache.org Delivered-To: mailing list cassandra-user@incubator.apache.org Received: (qmail 11509 invoked by uid 99); 6 Dec 2009 05:50:04 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 06 Dec 2009 05:50:04 +0000 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [69.25.50.200] (HELO outbound.mse8.exchange.ms) (69.25.50.200) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 06 Dec 2009 05:49:53 +0000 Received: from 10.0.25.173 ([10.0.25.173]) by ms15.mse8.exchange.ms ([10.0.25.15]) with Microsoft Exchange Server HTTP-DAV ; Sun, 6 Dec 2009 05:49:31 +0000 Message-ID: <7C39B5C7-38A3-4379-BB71-00453D89C0CF@digitalreasoning.com> From: "Tim Estes" To: thread-topic: Cassandra vs HBase thread-index: Acp2N+FLjKzc60wXSN2/O13pEioX9A== In-Reply-To: <46C52A50-EC11-43B4-BBE4-4BAB2CD76B84@gmail.com> Content-Type: text/plain; format=flowed; delsp=yes; charset="us-ascii" Content-Transfer-Encoding: 7bit MIME-Version: 1.0 (iPhone Mail 7D11) Subject: Re: Cassandra vs HBase Date: Sun, 6 Dec 2009 00:49:21 -0500 References: <9afa75fe0912051841x745c03bag60c145a745a717ae@mail.gmail.com> <1737C2A7-714A-4E25-B6D7-E8A879D92A2E@gmail.com> <9BD6B75A-D0CA-4A3D-8060-29D8E1C6982C@digitalreasoning.com> <46C52A50-EC11-43B4-BBE4-4BAB2CD76B84@gmail.com> X-Virus-Checked: Checked by ClamAV on apache.org Thanks. That is interesting and what I was looking for. I knew V.20 was closing the gap. Probably good to compare with V0.5B1 on the Cassandra side. I'd think that fast multi-get and batch insert/ update would be interesting to compare and benchmark. I know we are taxing Cassandra now and working on some auxillary means (outside if Thrift) to see what the per node limits really are... Sent from my iPhone On Dec 6, 2009, at 12:35 AM, "Matt Revelle" wrote: > Cassandra performance likely still beats HBase, but according to the > "Powered By" page on the HBase wiki it is being used to handle > realtime requests by StumbleUpon, Meetup, and Streamy (http://wiki.apache.org/hadoop/Hbase/PoweredBy > ). > > These two documents contain some performance numbers: > http://static.last.fm/johan/nosql-20090611/hbase_nosql.pdf (skip to > page 22) > http://www.slideshare.net/schubertzhang/hbase-0200-performance-evaluation > > Both Cassandra and HBase are useful tech, I just wanted to point out > that HBase performance has improved over the past year and it can > handle realtime requests. > > On Dec 5, 2009, at 11:08 PM, Tim Estes wrote: > >> Can you link/reference those? I haven't seen random read or write >> performance numbers published around V0.20 Hbase that are within 5x >> of Cassandra. I'm very curious about this... >> >> Sent from my iPhone >> >> On Dec 5, 2009, at 11:05 PM, "Matt Revelle" >> wrote: >> >>> On Dec 5, 2009, at 21:45, Joe Stump wrote: >>> >>>> >>>> On Dec 5, 2009, at 7:41 PM, Bill Hastings wrote: >>>> >>>>> [Is] HBase used for real timish applications and if so any ideas >>>>> what the largest deployment is. >>>> >>>> I don't know of anyone off the top of my head who's using >>>> anything built on top of Hadoop for a real-time environment. >>>> Hadoop just wasn't built for that. It was built, like MapReduce, >>>> for crunching absurd amounts of data across hundreds of nodes in >>>> a "reasonable" amount of time. >>>> >>>> Just my $0.02. >>>> >>>> --Joe >>>> >>> >>> While Hadoop MapReduce isn't meant for realtime use, HBase can >>> handle it. >>> >>> Over last summer there were some benchmarks included in HBase/ >>> Hadoop presentations that showed, IIRC, performance comparable to >>> Cassandra. >>> >