cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From matthew hawthorne <>
Subject Re: Cassandra users survey
Date Wed, 25 Nov 2009 02:17:03 GMT
I work for Comcast, and we have tons of data that we are migrating
into non-relational storage.

we recently evaluated cassandra, riak, voldemort, and hdfs.  I focused
on cassandra, this is why you may have seen me asking dumb questions
over IRC :-)

A few desirables for cassandra:

1) I'm not a huge fan of thrift.  it would be nice if the client jar
came packaged with cassandra  (I had to manually build it from the
thrift-generated java).

also, the lack of streaming support is troubling.  a lot of our
internal services are http, and I'd like to be able to connect a
column's input stream to the output stream of an http response,
instead of loading it all into memory.

2) a practical/situational view of managing a cassandra cluster
("deployment guide", maybe) would be nice.  for my evaluation, I was
seeking answers to questions like:

- how do I add capacity?

- how do I remove capacity? (I believe you're calling it "decommissioning")

- what files should I backup?

- how can I mitigate the risk of lost writes during a power failure?

- how can I ensure that my writes go to multiple data centers?

I think overall the docs are good (I found answers to most of my
questions), but since a lot of groups are analyzing cassandra in this
fashion, and needing to make a sales pitch to management, ops, etc. --
it would be nice to have a more comprehensive deployment guide.

you fellows at Rackspace should consider offering Cassandra support.
I know that the ability to have some paid professionals come in and
train our ops team on how to monitor + manage a cassandra cluster
would have made a huge difference for us.



On Fri, Nov 20, 2009 at 4:17 PM, Jonathan Ellis <> wrote:
> Hi all,
> I'd love to get a better feel for who is using Cassandra and what kind
> of applications it is seeing.  If you are using Cassandra, could you
> share what you're using it for and what stage you are at with it
> (evaluation / testing / production)? Also, what alternatives you
> evaluated/are evaluating would be useful.  Finally, feel free to throw
> in "I'd love to use Cassandra if only it did X" wishes. :)
> I can start: Rackspace is using Cassandra for stats collection
> (testing, almost production) and as a backend for the Mail & Apps
> division (early testing).  We evaluated HBase, Hypertable, dynomite,
> and Voldemort as well.
> Thanks,
> -Jonathan
> (If you're in stealth mode or don't want to say anything in public,
> feel free to reply to me privately and I will keep it off the record.)

View raw message