Return-Path: Delivered-To: apmail-incubator-cassandra-user-archive@minotaur.apache.org Received: (qmail 57352 invoked from network); 6 Dec 2009 05:35:27 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 6 Dec 2009 05:35:27 -0000 Received: (qmail 4008 invoked by uid 500); 6 Dec 2009 05:35:26 -0000 Delivered-To: apmail-incubator-cassandra-user-archive@incubator.apache.org Received: (qmail 3958 invoked by uid 500); 6 Dec 2009 05:35:25 -0000 Mailing-List: contact cassandra-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: cassandra-user@incubator.apache.org Delivered-To: mailing list cassandra-user@incubator.apache.org Received: (qmail 3949 invoked by uid 99); 6 Dec 2009 05:35:25 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 06 Dec 2009 05:35:25 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of mrevelle@gmail.com designates 209.85.221.191 as permitted sender) Received: from [209.85.221.191] (HELO mail-qy0-f191.google.com) (209.85.221.191) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 06 Dec 2009 05:35:13 +0000 Received: by qyk29 with SMTP id 29so1468342qyk.32 for ; Sat, 05 Dec 2009 21:34:53 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:content-type:mime-version :subject:from:in-reply-to:date:content-transfer-encoding:message-id :references:to:x-mailer; bh=d6XvTQCnGndakhnEb9mjJ/rmNNvPK9iCMt1Dm2TquOc=; b=WFct30/c5dKiICUbiybVZORxVs60ZbEyRAFkUlZkOINtYsG6eQ1otA4ziLVwGbX5JR I0vyX4T8kdu4ltYNqlr7rEL0eOJq3FywHXMRmuRaPVCURSUGAvlrzfQmKBUYuJ+jeNHn Ke0W8WNRvCdnUr2bzsiReezs87Wgiuy56FbzI= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to:x-mailer; b=ec+a0mLW7Gb0Ybd2nXb7FWBLDA1qDaYCFMmnDy7r99ijaz5/ItVnwiR5UUDZb0MNnd BTYgJzQUhf0zqXcy9EaVBt/+syWbaMA+gS6F+EqyUGvva626Vp9Fb+yO3UJpjY7yPxRX akTaaKD2ukSEFSq+HofiCsWbvaLO4dU/Ss6xo= Received: by 10.224.96.207 with SMTP id i15mr2746433qan.179.1260077692913; Sat, 05 Dec 2009 21:34:52 -0800 (PST) Received: from mars.home (pool-71-163-162-204.washdc.fios.verizon.net [71.163.162.204]) by mx.google.com with ESMTPS id 22sm2802235qyk.14.2009.12.05.21.34.50 (version=TLSv1/SSLv3 cipher=RC4-MD5); Sat, 05 Dec 2009 21:34:51 -0800 (PST) Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Apple Message framework v1077) Subject: Re: Cassandra vs HBase From: Matt Revelle In-Reply-To: <9BD6B75A-D0CA-4A3D-8060-29D8E1C6982C@digitalreasoning.com> Date: Sun, 6 Dec 2009 00:34:49 -0500 Content-Transfer-Encoding: quoted-printable Message-Id: <46C52A50-EC11-43B4-BBE4-4BAB2CD76B84@gmail.com> References: <9afa75fe0912051841x745c03bag60c145a745a717ae@mail.gmail.com> <1737C2A7-714A-4E25-B6D7-E8A879D92A2E@gmail.com> <9BD6B75A-D0CA-4A3D-8060-29D8E1C6982C@digitalreasoning.com> To: cassandra-user@incubator.apache.org X-Mailer: Apple Mail (2.1077) X-Virus-Checked: Checked by ClamAV on apache.org Cassandra performance likely still beats HBase, but according to the = "Powered By" page on the HBase wiki it is being used to handle realtime = requests by StumbleUpon, Meetup, and Streamy = (http://wiki.apache.org/hadoop/Hbase/PoweredBy). These two documents contain some performance numbers: http://static.last.fm/johan/nosql-20090611/hbase_nosql.pdf (skip to = page 22) = http://www.slideshare.net/schubertzhang/hbase-0200-performance-evaluation Both Cassandra and HBase are useful tech, I just wanted to point out = that HBase performance has improved over the past year and it can handle = realtime requests. On Dec 5, 2009, at 11:08 PM, Tim Estes wrote: > Can you link/reference those? I haven't seen random read or write = performance numbers published around V0.20 Hbase that are within 5x of = Cassandra. I'm very curious about this... >=20 > Sent from my iPhone >=20 > On Dec 5, 2009, at 11:05 PM, "Matt Revelle" = wrote: >=20 >> On Dec 5, 2009, at 21:45, Joe Stump wrote: >>=20 >>>=20 >>> On Dec 5, 2009, at 7:41 PM, Bill Hastings wrote: >>>=20 >>>> [Is] HBase used for real timish applications and if so any ideas = what the largest deployment is. >>>=20 >>> I don't know of anyone off the top of my head who's using anything = built on top of Hadoop for a real-time environment. Hadoop just wasn't = built for that. It was built, like MapReduce, for crunching absurd = amounts of data across hundreds of nodes in a "reasonable" amount of = time. >>>=20 >>> Just my $0.02. >>>=20 >>> --Joe >>>=20 >>=20 >> While Hadoop MapReduce isn't meant for realtime use, HBase can handle = it. >>=20 >> Over last summer there were some benchmarks included in HBase/Hadoop = presentations that showed, IIRC, performance comparable to Cassandra. >>=20