Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 24582 invoked from network); 29 Apr 2010 14:26:06 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 29 Apr 2010 14:26:06 -0000 Received: (qmail 56506 invoked by uid 500); 29 Apr 2010 14:26:06 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 56444 invoked by uid 500); 29 Apr 2010 14:26:05 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 56436 invoked by uid 99); 29 Apr 2010 14:26:05 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 29 Apr 2010 14:26:05 +0000 X-ASF-Spam-Status: No, hits=2.9 required=10.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [74.125.82.172] (HELO mail-wy0-f172.google.com) (74.125.82.172) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 29 Apr 2010 14:25:58 +0000 Received: by wyb35 with SMTP id 35so5362016wyb.31 for ; Thu, 29 Apr 2010 07:25:35 -0700 (PDT) Received: by 10.216.89.209 with SMTP id c59mr7046958wef.87.1272551135298; Thu, 29 Apr 2010 07:25:35 -0700 (PDT) MIME-Version: 1.0 Received: by 10.216.186.78 with HTTP; Thu, 29 Apr 2010 07:25:14 -0700 (PDT) From: =?UTF-8?Q?Utku_Can_Top=C3=A7u?= Date: Thu, 29 Apr 2010 16:25:14 +0200 Message-ID: Subject: TimedOutException when using the ColumnFamilyInputFormat To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=0016e6dab05660c497048560e650 X-Virus-Checked: Checked by ClamAV on apache.org --0016e6dab05660c497048560e650 Content-Type: text/plain; charset=UTF-8 Hey All, I'm trying to run some tests on cassandra an Hadoop integration. I'm basically following the word count example at https://svn.apache.org/repos/asf/cassandra/trunk/contrib/word_count/src/WordCount.javausing the ColumnFamilyInputFormat. Currently I have one-node cassandra and hadoop setup on the same machine. I'm having problems if there are more than one map tasks running on the same node, please find the copy of the error message below. If I limit the map tasks per tasktracker to 1, the MapReduce works fine without anyproblems at all. Do you thinki it's a know issue or am I doing something wrong in implementation. ---------------error---------------- 10/04/29 13:47:37 INFO mapred.JobClient: Task Id : attempt_201004291109_0024_m_000000_1, Status : FAILED java.lang.RuntimeException: TimedOutException() at org.apache.cassandra.hadoop.ColumnFamilyRecordReader$RowIterator.maybeInit(ColumnFamilyRecordReader.java:165) at org.apache.cassandra.hadoop.ColumnFamilyRecordReader$RowIterator.computeNext(ColumnFamilyRecordReader.java:215) at org.apache.cassandra.hadoop.ColumnFamilyRecordReader$RowIterator.computeNext(ColumnFamilyRecordReader.java:97) at com.google.common.collect.AbstractIterator.tryToComputeNext(AbstractIterator.java:135) at com.google.common.collect.AbstractIterator.hasNext(AbstractIterator.java:130) at org.apache.cassandra.hadoop.ColumnFamilyRecordReader.nextKeyValue(ColumnFamilyRecordReader.java:91) at org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.nextKeyValue(MapTask.java:423) at org.apache.hadoop.mapreduce.MapContext.nextKeyValue(MapContext.java:67) at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:143) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:621) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305) at org.apache.hadoop.mapred.Child.main(Child.java:170) Caused by: TimedOutException() at org.apache.cassandra.thrift.Cassandra$get_range_slices_result.read(Cassandra.java:11015) at org.apache.cassandra.thrift.Cassandra$Client.recv_get_range_slices(Cassandra.java:623) at org.apache.cassandra.thrift.Cassandra$Client.get_range_slices(Cassandra.java:597) at org.apache.cassandra.hadoop.ColumnFamilyRecordReader$RowIterator.maybeInit(ColumnFamilyRecordReader.java:142) ... 11 more --------------------------------------- Best Regards, Utku --0016e6dab05660c497048560e650 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hey All,

I'm trying to run some tests on cassandra an Hadoop int= egration. I'm basically following the word count example at https://svn.apache.org/repos/asf/cassandra/trunk/contrib/word_= count/src/WordCount.java using the ColumnFamilyInputFormat.

Currently I have one-node cassandra and hadoop setup on the same machin= e.

I'm having problems if there are more than one map tasks runn= ing on the same node, please find the copy of the error message below.

If I limit the map tasks per tasktracker to 1, the MapReduce works fine= without anyproblems at all.

Do you thinki it's a know issue or = am I doing something wrong in implementation.

---------------error--= --------------
10/04/29 13:47:37 INFO mapred.JobClient: Task Id : attempt_201004291109_002= 4_m_000000_1, Status : FAILED
java.lang.RuntimeException: TimedOutExcept= ion()
=C2=A0=C2=A0=C2=A0 at org.apache.cassandra.hadoop.ColumnFamilyReco= rdReader$RowIterator.maybeInit(ColumnFamilyRecordReader.java:165)
=C2=A0=C2=A0=C2=A0 at org.apache.cassandra.hadoop.ColumnFamilyRecordReader$= RowIterator.computeNext(ColumnFamilyRecordReader.java:215)
=C2=A0=C2=A0= =C2=A0 at org.apache.cassandra.hadoop.ColumnFamilyRecordReader$RowIterator.= computeNext(ColumnFamilyRecordReader.java:97)
=C2=A0=C2=A0=C2=A0 at com.google.common.collect.AbstractIterator.tryToCompu= teNext(AbstractIterator.java:135)
=C2=A0=C2=A0=C2=A0 at com.google.commo= n.collect.AbstractIterator.hasNext(AbstractIterator.java:130)
=C2=A0=C2= =A0=C2=A0 at org.apache.cassandra.hadoop.ColumnFamilyRecordReader.nextKeyVa= lue(ColumnFamilyRecordReader.java:91)
=C2=A0=C2=A0=C2=A0 at org.apache.hadoop.mapred.MapTask$NewTrackingRecordRea= der.nextKeyValue(MapTask.java:423)
=C2=A0=C2=A0=C2=A0 at org.apache.hado= op.mapreduce.MapContext.nextKeyValue(MapContext.java:67)
=C2=A0=C2=A0=C2= =A0 at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:143)
=C2=A0=C2=A0=C2=A0 at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask= .java:621)
=C2=A0=C2=A0=C2=A0 at org.apache.hadoop.mapred.MapTask.run(Ma= pTask.java:305)
=C2=A0=C2=A0=C2=A0 at org.apache.hadoop.mapred.Child.mai= n(Child.java:170)
Caused by: TimedOutException()
=C2=A0=C2=A0=C2=A0 at org.apache.cassandra.thrift.Cassandra$get_range_slice= s_result.read(Cassandra.java:11015)
=C2=A0=C2=A0=C2=A0 at org.apache.cas= sandra.thrift.Cassandra$Client.recv_get_range_slices(Cassandra.java:623)=C2=A0=C2=A0=C2=A0 at org.apache.cassandra.thrift.Cassandra$Client.get_ran= ge_slices(Cassandra.java:597)
=C2=A0=C2=A0=C2=A0 at org.apache.cassandra.hadoop.ColumnFamilyRecordReader$= RowIterator.maybeInit(ColumnFamilyRecordReader.java:142)
=C2=A0=C2=A0=C2= =A0 ... 11 more
---------------------------------------


Best = Regards,
Utku
--0016e6dab05660c497048560e650--