cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erich Nachbar <>
Subject Re: Cassandra users survey
Date Fri, 20 Nov 2009 21:48:20 GMT

I'm using Cassandra 0.4.2 at my current client to persist URL graphs
for Spam detection.
The crawling and page classification is done in Hadoop/Bixo/Cascading,
which persists URL classification results into Cassandra.
The incoming production traffic is using Cassandra for the real-time
spam score lookup to determine the spammyness of a URL.

It started out as a prototype and is currently in production with 4
Cassandra nodes (for the last >3 weeks).
Sometimes Cassandra is a little rough on the edges, but in general it works.

- data rebalancing
- proper MapReduce support (ideally supporting the same API HBase
uses, so one could use the same eco-system)
- node decommissioning

On Fri, Nov 20, 2009 at 1:17 PM, Jonathan Ellis <> wrote:
> Hi all,
> I'd love to get a better feel for who is using Cassandra and what kind
> of applications it is seeing.  If you are using Cassandra, could you
> share what you're using it for and what stage you are at with it
> (evaluation / testing / production)? Also, what alternatives you
> evaluated/are evaluating would be useful.  Finally, feel free to throw
> in "I'd love to use Cassandra if only it did X" wishes. :)
> I can start: Rackspace is using Cassandra for stats collection
> (testing, almost production) and as a backend for the Mail & Apps
> division (early testing).  We evaluated HBase, Hypertable, dynomite,
> and Voldemort as well.
> Thanks,
> -Jonathan
> (If you're in stealth mode or don't want to say anything in public,
> feel free to reply to me privately and I will keep it off the record.)

View raw message