cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Torsten Curdt <>
Subject Re: CassandraBulkLoader
Date Tue, 13 Jul 2010 21:56:52 GMT
> look at contrib/bmt_example, with the caveat that it's usually
> premature optimization

I wish that was true for us :)

>> Fact: It has always been straightforward to send the output of Hadoop jobs
>> to Cassandra, and Facebook, Digg, and others have been using Hadoop like
>> this as a Cassandra bulk-loader for over a year.

That we've done as well. With a custom OutputFormat.

>> Does anyone from Facebook or Digg share details on how to use Cassandra
>> BulkLoader?

You just use the StorageProxy and create the RowMutations.

>> I could see some details from Arin's presentation on Cassandra @ Digg about
>> data load from MySQL -> Hadoop -> Cassandra.

Maybe you should just try with a simple bulk load first?

>> Can someone please help me?

You need to tell us how :)


View raw message