cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jonathan Ellis <>
Subject Re: The Difference Between Cassandra and HBase
Date Sun, 25 Apr 2010 04:09:02 GMT
On Sat, Apr 24, 2010 at 9:29 PM, dir dir <> wrote:
> I have already read Jonathan Ellis's Blog today
> (
> in this blog, Jonathan tried to explain the difference between Cassandra and
> Hbase.
> But I have several questions. In this blog Jonathan said:
> 1. Hbase Follows the bigtable model, so it's more complicated than it needs
> to be.
>     (300+kloc vs 50 for Cassandra; many more components). This means it's
> that
>     much harder for me to troubleshoot.
> But in same blog Jonathan also said that Cassandra is a sort of hybrid
> between dynamo
> and big table. Therefore I can conclude Cassandra also follows the Big Table
> model.

No, I said Cassandra provides a data model similar to the one
described in BigTable, i.e. ColumnFamilies with sparse rows.  The
architecture of Cassandra (a fully distributed cluster with each node
managing local storage) is completely different from BigTable (lock
server + master server + tablet servers on top of GFS).

> I also do not understand what is the meaning of  "so it's more complicated
> than it needs to be"??

Read the BigTable paper, then read the Dynamo paper, and this will
make more sense.

> To run Cassandra, should I check out the code from
> ( or as an User,
> I just enough download the binary software of Cassandra 0.6.1  from

You should be using 0.6.1.

View raw message