cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robbie Strickland (JIRA)" <>
Subject [jira] [Commented] (CASSANDRA-4208) ColumnFamilyOutputFormat should support writing to multiple column families
Date Tue, 01 May 2012 18:50:49 GMT


Robbie Strickland commented on CASSANDRA-4208:

We could use MultipleOutputs if you think that's better, though the implementation is certainly
less trivial than what I've done here. Upside is of course sticking with the convention. I'm
not really sure it gets us any more than that, and personally I think it adds unnecessary
complexity to an already convoluted API. Passing in a CF at the call level is more intuitive
and will be more familiar to Cassandra users, IMHO. But I'm happy to work on the MultipleOutputs
version if that's the consensus.
> ColumnFamilyOutputFormat should support writing to multiple column families
> ---------------------------------------------------------------------------
>                 Key: CASSANDRA-4208
>                 URL:
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Hadoop
>    Affects Versions: 1.1.0
>            Reporter: Robbie Strickland
>         Attachments: trunk-4208.txt
> It is not currently possible to output records to more than one column family in a single
reducer.  Considering that writing values to Cassandra often involves multiple column families
(i.e. updating your index when you insert a new value), this seems overly restrictive.  I
am submitting a patch that moves the specification of column family from the job configuration
to the write() call in ColumnFamilyRecordWriter.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


View raw message