cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "T Jake Luciani (JIRA)" <j...@apache.org>
Subject [jira] [Issue Comment Edited] (CASSANDRA-4208) ColumnFamilyOutputFormat should support writing to multiple column families
Date Wed, 09 May 2012 20:21:48 GMT

    [ https://issues.apache.org/jira/browse/CASSANDRA-4208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13271761#comment-13271761
] 

T Jake Luciani edited comment on CASSANDRA-4208 at 5/9/12 8:20 PM:
-------------------------------------------------------------------

I'm ok with this now that it works with MultipleOutputs (nice find), though I'm not sure if
it should be in 1.1 since it would break existing scripts.  Would you be able to make it backwards
compatible by adding the old  public static setOutputColumnFamily( public static void setOutputColumnFamily(Configuration
conf, String keyspace, String columnFamily)) back and using the new setColumnFamily() in there?

                
      was (Author: tjake):
    I'm ok with this now that it works with MultipleOutputs (nice find), though I'm not sure
if it should be in 1.1 since it would break existing scripts.  Would you be able to make it
backwards compatible by adding the old constructor back and using the setColumnFamily() in
there?


                  
> ColumnFamilyOutputFormat should support writing to multiple column families
> ---------------------------------------------------------------------------
>
>                 Key: CASSANDRA-4208
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4208
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Hadoop
>    Affects Versions: 1.1.0
>            Reporter: Robbie Strickland
>         Attachments: cassandra-1.1-4208.txt, trunk-4208-v2.txt, trunk-4208.txt
>
>
> It is not currently possible to output records to more than one column family in a single
reducer.  Considering that writing values to Cassandra often involves multiple column families
(i.e. updating your index when you insert a new value), this seems overly restrictive.  I
am submitting a patch that moves the specification of column family from the job configuration
to the write() call in ColumnFamilyRecordWriter.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message