hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Schubert Zhang <zson...@gmail.com>
Subject Re: Cassandra vs HBase
Date Fri, 04 Sep 2009 06:25:51 GMT


On Thu, Sep 3, 2009 at 3:15 PM, stack <stack@duboce.net> wrote:

> When does data expire?  Or, how many Billions of rows should your cluster
> of
> 10-20 nodes carry at a time?

about 3 months.
hundreds of billinons of rows.

> Will the many rows of same timestamp all go into the one timestamp row or
> will the key have a further qualifier such as event type to distingush
> amongst the updates that arrive at the same timestamp?

will have many rows of same timestamp.
we coud distingush row by using other info, if there is no such info in raw
data,we may should generate a unique id when pre-pressing phase.

> What do you see the as approximate write rate and what do you think its
> spread across timestamps will be?  E.g. 10000 updates a second and all of
> the updates fit within a ten second window?

in avg, about 40000 new rows per second.

> Sorry for all the questions.
> St.Ack

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message