cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jonathan Ellis (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CASSANDRA-5251) Hadoop support should be able to work with multiple column families
Date Fri, 01 Mar 2013 14:55:12 GMT

    [ https://issues.apache.org/jira/browse/CASSANDRA-5251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13590587#comment-13590587
] 

Jonathan Ellis commented on CASSANDRA-5251:
-------------------------------------------

If you want to m/r data without random i/o, it needs to be in the same CF, which may not be
ideal for your other uses.  Pick your poison.
                
> Hadoop support should be able to work with multiple column families
> -------------------------------------------------------------------
>
>                 Key: CASSANDRA-5251
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-5251
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Hadoop
>    Affects Versions: 1.1.0, 1.1.11, 1.2.0, 2.0
>            Reporter: Illarion Kovalchuk
>            Priority: Minor
>         Attachments: trunk-5251.txt
>
>
> This patch affects api, so I changed hadoop example in it. The main difference is that
now ColumnFamilyInput format generates splits for all input column families, and ColumnFamilyOutputFormat
works not with List<Mutation>, but with List<Pair<String,Mutation>>, where
Pair.left is for column family name.
> Thank you

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message