cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Samarth Gahire (Commented) (JIRA)" <>
Subject [jira] [Commented] (CASSANDRA-3859) Add Progress Reporting to Cassandra OutputFormats
Date Tue, 28 Feb 2012 14:01:48 GMT


Samarth Gahire commented on CASSANDRA-3859:

Ok ,So I tested patch for BulkOutputFormat with time out of 30 seconds.
1) First of all I tried it without applying a patch and it was throwing a timeout exception.
2) Then I applied a patch and it worked properly.
It means with patch progress reporting is working but it is not reporting progress after every
second while loading(can you explain this?). Because same patch throws timeout exception for
10 seconds of time out.
> Add Progress Reporting to Cassandra OutputFormats
> -------------------------------------------------
>                 Key: CASSANDRA-3859
>                 URL:
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Hadoop, Tools
>    Affects Versions: 1.1.0
>            Reporter: Samarth Gahire
>            Assignee: Brandon Williams
>            Priority: Minor
>              Labels: bulkloader, hadoop, mapreduce, sstableloader
>             Fix For: 1.1.0
>         Attachments: 0001-add-progress-reporting-to-BOF.txt, 0002-Add-progress-to-CFOF.txt
>   Original Estimate: 48h
>  Remaining Estimate: 48h
> When we are using the BulkOutputFormat to load the data to cassandra. We should use the
progress reporting to Hadoop Job within Sstable loader because while loading the data for
particular task if streaming is taking more time and progress is not reported to Job it may
kill the task with timeout exception. 

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


View raw message