cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jeremiah Jordan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CASSANDRA-2527) Add ability to snapshot data as input to hadoop jobs
Date Fri, 15 Nov 2013 15:17:21 GMT

    [ https://issues.apache.org/jira/browse/CASSANDRA-2527?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13823722#comment-13823722
] 

Jeremiah Jordan commented on CASSANDRA-2527:
--------------------------------------------

Now that we have the cql hadoop input format, maybe the better way to do this would be to
add "USING SNAPSHOT xyz" to CQL, and let all selects be able to run against a snapshot.

> Add ability to snapshot data as input to hadoop jobs
> ----------------------------------------------------
>
>                 Key: CASSANDRA-2527
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2527
>             Project: Cassandra
>          Issue Type: New Feature
>            Reporter: Jeremy Hanna
>            Assignee: Brandon Williams
>              Labels: hadoop
>             Fix For: 2.1
>
>
> It is desirable to have immutable inputs to hadoop jobs for the duration of the job.
 That way re-execution of individual tasks do not alter the output.  One way to accomplish
this would be to snapshot the data that is used as input to a job.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message