cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Samarth Gahire (Commented) (JIRA)" <>
Subject [jira] [Commented] (CASSANDRA-3859) Add Progress Reporting to Cassandra OutputFormats
Date Mon, 27 Feb 2012 11:35:49 GMT


Samarth Gahire commented on CASSANDRA-3859:

I have tested patch on CDH2 and latest CDH3 and also on hadoop-0.20.203 . I had set the time
out to 10 seconds.As you confirmed that  progress is reported after every second this job
should not timed out for time out of 10 seconds.But it is still getting timed out for all
the versions mentioned above.
Please test and let me know if I am missing something.
> Add Progress Reporting to Cassandra OutputFormats
> -------------------------------------------------
>                 Key: CASSANDRA-3859
>                 URL:
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Hadoop, Tools
>    Affects Versions: 1.1.0
>            Reporter: Samarth Gahire
>            Assignee: Brandon Williams
>            Priority: Minor
>              Labels: bulkloader, hadoop, mapreduce, sstableloader
>             Fix For: 1.1.0
>         Attachments: 0001-add-progress-reporting-to-BOF.txt, 0002-Add-progress-to-CFOF.txt
>   Original Estimate: 48h
>  Remaining Estimate: 48h
> When we are using the BulkOutputFormat to load the data to cassandra. We should use the
progress reporting to Hadoop Job within Sstable loader because while loading the data for
particular task if streaming is taking more time and progress is not reported to Job it may
kill the task with timeout exception. 

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


View raw message