From cassandra-user-return-1709-apmail-incubator-cassandra-user-archive=incubator.apache.org@incubator.apache.org Tue Dec 08 06:08:17 2009 Return-Path: Delivered-To: apmail-incubator-cassandra-user-archive@minotaur.apache.org Received: (qmail 38795 invoked from network); 8 Dec 2009 06:08:17 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 8 Dec 2009 06:08:17 -0000 Received: (qmail 86284 invoked by uid 500); 8 Dec 2009 06:08:16 -0000 Delivered-To: apmail-incubator-cassandra-user-archive@incubator.apache.org Received: (qmail 86223 invoked by uid 500); 8 Dec 2009 06:08:15 -0000 Mailing-List: contact cassandra-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: cassandra-user@incubator.apache.org Delivered-To: mailing list cassandra-user@incubator.apache.org Received: (qmail 86214 invoked by uid 99); 8 Dec 2009 06:08:15 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Dec 2009 06:08:15 +0000 X-ASF-Spam-Status: No, hits=-2.6 required=5.0 tests=BAYES_00,HTML_MESSAGE X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [209.85.211.183] (HELO mail-yw0-f183.google.com) (209.85.211.183) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Dec 2009 06:08:13 +0000 Received: by ywh13 with SMTP id 13so5809343ywh.29 for ; Mon, 07 Dec 2009 22:07:50 -0800 (PST) Received: by 10.91.26.7 with SMTP id d7mr1496124agj.1.1260252469794; Mon, 07 Dec 2009 22:07:49 -0800 (PST) Received: from ?58.108.18.123? (d58-108-18-123.dsl.vic.optusnet.com.au [58.108.18.123]) by mx.google.com with ESMTPS id 8sm2891038yxg.6.2009.12.07.22.07.46 (version=TLSv1/SSLv3 cipher=RC4-MD5); Mon, 07 Dec 2009 22:07:48 -0800 (PST) References: <9afa75fe0912051841x745c03bag60c145a745a717ae@mail.gmail.com> <1737C2A7-714A-4E25-B6D7-E8A879D92A2E@gmail.com> <9972cc9e0912052012o61b82d06h9f4fe25c0d1cd552@mail.gmail.com> Message-Id: <87D39CDC-D684-4A61-A039-D1BC30D2CC9E@holsman.net> From: Ian Holsman To: "cassandra-user@incubator.apache.org" In-Reply-To: <9972cc9e0912052012o61b82d06h9f4fe25c0d1cd552@mail.gmail.com> Content-Type: multipart/alternative; boundary=Apple-Mail-27-624900564 Content-Transfer-Encoding: 7bit X-Mailer: iPhone Mail (7D11) Mime-Version: 1.0 (iPhone Mail 7D11) Subject: Re: Cassandra vs HBase Date: Tue, 8 Dec 2009 17:07:33 +1100 Cc: "cassandra-user@incubator.apache.org" --Apple-Mail-27-624900564 Content-Type: text/plain; charset=us-ascii; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit This is slightly off-topic There is a recent project called hadoop online (hop) on google-code that promises a online/continuous query ability on top of hadoop which should allow for near real time activities instead of the batch stuff that mapred does --- Sent from my phone Ian Holsman - 703 879-3128 On 06/12/2009, at 3:12 PM, Joseph Bowman wrote: > When I wrote my Why Cassandra article, I didn't get into the why I > didn't choose x platform because I didn't want to start a flame war > by doing comparisons. For HBase, the primary reason I didn't choose > it is that while there were benchmarks of what it could > theoretically do, there wasn't any real real world deployments > proving it. My experience as a systems administrator is that it's > best to go with a product that's been proven over time in real world > scenarios. > > I'll add to this though, that nothing nosql, even Cassandra, has > reached the point where I feel it's no-brainer to choose it over > anything, including sql based solutions like mysql and oracle. It > really comes down to your requirements. > > On Sat, Dec 5, 2009 at 11:04 PM, Matt Revelle > wrote: > On Dec 5, 2009, at 21:45, Joe Stump wrote: > > > On Dec 5, 2009, at 7:41 PM, Bill Hastings wrote: > > [Is] HBase used for real timish applications and if so any ideas > what the largest deployment is. > > I don't know of anyone off the top of my head who's using anything > built on top of Hadoop for a real-time environment. Hadoop just > wasn't built for that. It was built, like MapReduce, for crunching > absurd amounts of data across hundreds of nodes in a "reasonable" > amount of time. > > Just my $0.02. > > --Joe > > > While Hadoop MapReduce isn't meant for realtime use, HBase can > handle it. > > Over last summer there were some benchmarks included in HBase/Hadoop > presentations that showed, IIRC, performance comparable to Cassandra. > > --Apple-Mail-27-624900564 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: quoted-printable
This is slightly = off-topic

There is a recent project called = hadoop online (hop) on google-code that promises a online/continuous = query ability on top of hadoop which should allow for near real time = activities instead of the batch stuff that mapred = does

---
Sent from my phone
Ian Holsman = - 703 879-3128

On 06/12/2009, at 3:12 PM, Joseph = Bowman <bowman.joseph@gmail.com> = wrote:

When I = wrote my Why Cassandra article, I didn't get into the why I didn't = choose x platform because I didn't want to start a flame war by doing = comparisons. For HBase, the primary reason I didn't choose it is that = while there were benchmarks of what it could theoretically do, there = wasn't any real real world deployments proving it. My experience as a = systems administrator is that it's best to go with a product that's been = proven over time in real world scenarios.

I'll add to this though, that nothing nosql, even Cassandra, has = reached the point where I feel it's no-brainer to choose it over = anything, including sql based solutions like mysql and oracle. It really = comes down to your requirements.

On Sat, Dec 5, 2009 at 11:04 PM, Matt = Revelle <mrevelle@gmail.com> = wrote:
On Dec 5, 2009, at 21:45, Joe Stump = <joe@joestump.net> wrote:


On Dec 5, 2009, at 7:41 PM, Bill Hastings wrote:

[Is] HBase used for real timish applications and if so any ideas what = the largest deployment is.

I don't know of anyone off the top of my head who's using anything built = on top of Hadoop for a real-time environment. Hadoop just wasn't built = for that. It was built, like MapReduce, for crunching absurd amounts of = data across hundreds of nodes in a "reasonable" amount of time.

Just my $0.02.

--Joe


While Hadoop MapReduce isn't meant for realtime use, HBase can handle = it.

Over last summer there were some benchmarks included in HBase/Hadoop = presentations that showed, IIRC, performance comparable to = Cassandra.


= --Apple-Mail-27-624900564--