cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Brandon Williams (JIRA)" <>
Subject [jira] [Commented] (CASSANDRA-2905) Add retry logic to ColumnFamilyRecordReader
Date Mon, 18 Jul 2011 21:28:58 GMT


Brandon Williams commented on CASSANDRA-2905:

Since we don't know how many times we should retry, or how long to wait between retries, we
should expose these tunables to the job configuration.

> Add retry logic to ColumnFamilyRecordReader
> -------------------------------------------
>                 Key: CASSANDRA-2905
>                 URL:
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Jeremy Hanna
>            Assignee: Jeremy Hanna
>            Priority: Minor
>              Labels: hadoop
> One thing that would improve the built-in ColumnFamilyRecordReader is some retry logic
if it times out on hasNext.  It could help in addition to setting the rpc_timeout_in_ms, so
that timeouts happen less frequently so there are fewer blacklisted task trackers (which are
the result of an error, including the timeout).
> {quote}
> java.lang.RuntimeException: TimedOutException() at org.apache.cassandra.hadoop.ColumnFamilyRecordReader$RowIterator.maybeInit(
at org.apache.cassandra.hadoop.ColumnFamilyRecordReader$RowIterator.computeNext(
at org.apache.cassandra.hadoop.ColumnFamilyRecordReader$RowIterator.computeNext(
at at org.apache.cassandra.hadoop.ColumnFamilyRecordReader.nextKeyValue(
at org.apache.cassandra.hadoop.pig.CassandraStorage.getNext(Unknown Source) at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigRecordReader.nextKeyValue(
at org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.nextKeyValue(
at org.apache.hadoop.mapreduce.MapContext.nextKeyValue( at
at org.apache.hadoop.mapred.MapTask.runNewMapper( at
at org.apache.hadoop.mapred.Child$ at
Method) at at
at org.apache.hadoop.mapred.Child.main( Caused by: TimedOutException() at org.apache.cassandra.thrift.Cassandra$
at org.apache.cassandra.thrift.Cassandra$Client.recv_get_range_slices(
at org.apache.cassandra.thrift.Cassandra$Client.get_range_slices( at org.apache.cassandra.hadoop.ColumnFamilyRecordReader$RowIterator.maybeInit(
... 17 more
> {quote}

This message is automatically generated by JIRA.
For more information on JIRA, see:


View raw message