From user-return-9913-apmail-cassandra-user-archive=cassandra.apache.org@cassandra.apache.org Fri Oct 15 00:14:26 2010 Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 54056 invoked from network); 15 Oct 2010 00:14:26 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 15 Oct 2010 00:14:26 -0000 Received: (qmail 61046 invoked by uid 500); 15 Oct 2010 00:14:24 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 60930 invoked by uid 500); 15 Oct 2010 00:14:24 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 60922 invoked by uid 99); 15 Oct 2010 00:14:24 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 15 Oct 2010 00:14:24 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,MIME_QP_LONG_LINE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [208.113.200.5] (HELO homiemail-a46.g.dreamhost.com) (208.113.200.5) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 15 Oct 2010 00:14:15 +0000 Received: from homiemail-a46.g.dreamhost.com (localhost [127.0.0.1]) by homiemail-a46.g.dreamhost.com (Postfix) with ESMTP id CCA4C3E4062 for ; Thu, 14 Oct 2010 17:13:52 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; c=nofws; d=thelastpickle.com; h=to:from :subject:date:message-id:content-type:mime-version:in-reply-to; q=dns; s=thelastpickle.com; b=GeJe2W4J5ucTz3SEiAM5IIBV6TclyDjWn Bm59cjkyvIBqBin8hPIydrKDramBD4cDiV8NZkWov4yc0DJ0L/GHfFMZAUWCsMUx dCQbSqgVcC53By2K/GT2W0XZd45Shf9juW23z9/iLpIOAchU5VPx9hep/g8Sfdjg wPhLWWb7rc= DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=thelastpickle.com; h=to :from:subject:date:message-id:content-type:mime-version: in-reply-to; s=thelastpickle.com; bh=G3IfEiYeflEfQyQnetyDOMr9dYM =; b=QtYQfJy5KtxUrMrNiGcSC9PWLDldONJpRgP34agXxqPgj5Jqnl6oj3M9XIM orp5OYLyLm6jWw9PFoeQgWeUkP6EKrhD5eR3wIXL2rgekCxVOau5isn/VftzfN6t qoc0ofJR6wRyDTBJP0blYPdENgHR1UGeyQUQPbgPjfXjCZdU= Received: from localhost (webms.mac.com [17.148.16.118]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) (Authenticated sender: aaron@thelastpickle.com) by homiemail-a46.g.dreamhost.com (Postfix) with ESMTPSA id AD2833E405C for ; Thu, 14 Oct 2010 17:13:52 -0700 (PDT) To: user@cassandra.apache.org From: Aaron Morton Subject: Re: ColumnFamilyRecordReader issue Date: Fri, 15 Oct 2010 00:13:51 GMT X-Mailer: MobileMe Mail (1C3203) Message-id: <24a106ab-62a6-efad-d69e-ac98c8e054b2@me.com> Content-Type: multipart/alternative; boundary=Apple-Webmail-42--1e7b6a67-7c12-e8fe-d5be-e5a9f802b20f MIME-Version: 1.0 In-Reply-To: X-Virus-Checked: Checked by ClamAV on apache.org --Apple-Webmail-42--1e7b6a67-7c12-e8fe-d5be-e5a9f802b20f Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=ISO-8859-1; format=flowed Clock was present in beta 1 and then removed. The Beta2 thrift client does= not have this check in it.=A0=0A=0ADouble check your install and make sur= e it's all beta 2.=A0=0A=0AAaron=0A=0A=0AOn 15 Oct, 2010,at 11:49 AM, Mich= ael Moores wrote:=0A=0AMy Hadoop TaskTracker is using t= he Cassandra CplumnFamilyInputFormat, and appears to be finding records (t= he data is serialized below in the log output),=0Abut the cassandra Column= class is throwing a validation exception indicating "Required field 'cloc= k' was not present!".=0A=0AMy Cassandra cluster version is 0.7-beta2 and i= also loaded the same JAR in my hadoop configuration.=0A=0ASo does this po= int to my client side (hadoop task) possibly using the wrong cassandra ver= sion?=0A=0A=0AMapAttempt TASK_TYPE=3D"MAP" TASKID=3D"task_201010140951_002= 0_m_000006" TASK_ATTEMPT_ID=3D"attempt_201010140951_0020_m_000006_3" START= _TIME=3D"1287095149722" TRACKER_NAME=3D"tracker_kv-app06\.dev\.real\.com:l= ocalhost\.localdomain/127\.0\.0\.1:54957" HTTP_PORT=3D"50060" .=0AMapAttem= pt TASK_TYPE=3D"MAP" TASKID=3D"task_201010140951_0020_m_000006" TASK_ATTEM= PT_ID=3D"attempt_201010140951_0020_m_000006_3" TASK_STATUS=3D"FAILED" FINI= SH_TIME=3D"1287095154592" HOSTNAME=3D"kv-app06\.dev\.real\.com" ERROR=3D"j= ava\.lang\.RuntimeException: org\.apache\.thrift\.protocol\.TProtocolExcep= tion: Required field 'clock' was not present! Struct: Column(name:69 64, v= alue:6D 65 73 73 61 67 65 2D 70 72 65 66 65 72 65 6E 63 65 73, clock:null)= =0Aat org\.apache\.cassandra\.hadoop\.ColumnFamilyRecordReader$RowIterator= \.maybeInit(ColumnFamilyRecordReader\.java:266)=0Aat org\.apache\.cassandr= a\.hadoop\.ColumnFamilyRecordReader$RowIterator\.computeNext(ColumnFamilyR= ecordReader\.java:281)=0Aat org\.apache\.cassandra\.hadoop\.ColumnFamilyRe= cordReader$RowIterator\.computeNext(ColumnFamilyRecordReader\.java:178)=0A= at com\.google\.common\.collect\.AbstractIterator\.tryToComputeNext(Abstra= ctIterator\.java:136)=0Aat com\.google\.common\.collect\.AbstractIterator\= hasNext(AbstractIterator\.java:131)=0Aat org\.apache\.cassandra\.hadoop\.= ColumnFamilyRecordReader\.nextKeyValue(ColumnFamilyRecordReader\.java:137)= =0Aat org\.apache\.hadoop\.mapred\.MapTask$NewTrackingRecordReader\.nextKe= yValue(MapTask\.java:423)=0Aat org\.apache\.hadoop\.mapreduce\.MapContext\= nextKeyValue(MapContext\.java:67)=0Aat org\.apache\.hadoop\.mapreduce\.Ma= pper\.run(Mapper\.java:143)=0Aat org\.apache\.hadoop\.mapred\.MapTask\.run= NewMapper(MapTask\.java:621)=0Aat org\.apache\.hadoop\.mapred\.MapTask\.ru= n(MapTask\.java:305)=0Aat org\.apache\.hadoop\.mapred\.Child\.main(Child\.= java:170)=0ACaused by: org\.apache\.thrift\.protocol\.TProtocolException: = Required field 'clock' was not present! Struct: Column(name:69 64, value:6= D 65 73 73 61 67 65 2D 70 72 65 66 65 72 65 6E 63 65 73, clock:null)=0Aat = org\.apache\.cassandra\.thrift\.Column\.validate(Column\.java:615)=0Aat or= g\.apache\.cassandra\.thrift\.Column\.read(Column\.java:527)=0Aat org\.apa= che\.cassandra\.thrift\.ColumnOrSuperColumn\.read(ColumnOrSuperColumn\.jav= a:351)=0Aat org\.apache\.cassandra\.thrift\.KeySlice\.read(KeySlice\.java:= 391)=0Aat org\.apache\.cassandra\.thrift\.Cassandra$get_range_slices_resul= t\.read(Cassandra\.java:11859)=0Aat org\.apache\.cassandra\.thrift\.Cassan= dra$Client\.recv_get_range_slices(Cassandra\.java:743)=0Aat org\.apache\.c= assandra\.thrift\.Cassandra$Client\.get_range_slices(Cassandra\.java:715)=0A= at org\.apache\.cassandra\.hadoop\.ColumnFamilyRecordReader$RowIterator\.m= aybeInit(ColumnFamilyRecordReader\.java:244) --Apple-Webmail-42--1e7b6a67-7c12-e8fe-d5be-e5a9f802b20f Content-Type: multipart/related; type="text/html"; boundary=Apple-Webmail-86--1e7b6a67-7c12-e8fe-d5be-e5a9f802b20f --Apple-Webmail-86--1e7b6a67-7c12-e8fe-d5be-e5a9f802b20f Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=ISO-8859-1;
Clock was present in beta 1 and then removed. The Beta2 thrift client= does not have this check in it. 

Double che= ck your install and make sure it's all beta 2. 

<= div>Aaron


On 15 Oct, 2010,at 11:49 AM, Michael Moores &l= t;mmoores@real.com> wrote:

=
My Hadoop TaskTracker is using the Cassandra = CplumnFamilyInputFormat, and appears to be finding records (the data is se= rialized below in the log output),
=0Abut the cassandra Column class is= throwing a validation exception indicating "Required field 'clock' was no= t present!".
=0A
=0AMy Cassandra cluster version is 0.7-beta2 and i = also loaded the same JAR in my hadoop configuration.
=0A
=0ASo does = this point to my client side (hadoop task) possibly using the wrong cassan= dra version?
=0A
=0A
=0AMapAttempt TASK_TYPE=3D"MAP" TASKID=3D"ta= sk_201010140951_0020_m_000006" TASK_ATTEMPT_ID=3D"attempt_201010140951_002= 0_m_000006_3" START_TIME=3D"1287095149722" TRACKER_NAME=3D"tracker_kv-app0= 6\.dev\.real\.com:localhost\.localdomain/127\.0\.0\.1:54957" HTTP_PORT=3D"= 50060" .
=0AMapAttempt TASK_TYPE=3D"MAP" TASKID=3D"task_201010140951_00= 20_m_000006" TASK_ATTEMPT_ID=3D"attempt_201010140951_0020_m_000006_3" TASK= _STATUS=3D"FAILED" FINISH_TIME=3D"1287095154592" HOSTNAME=3D"kv-app06\.dev= \.real\.com" ERROR=3D"java\.lang\.RuntimeException: org\.apache\.thrift\.p= rotocol\.TProtocolException: Required field 'clock' was not present! Struc= t: Column(name:69 64, value:6D 65 73 73 61 67 65 2D 70 72 65 66 65 72 65 6= E 63 65 73, clock:null)
=0A at org\.apache\.cassandra\.hadoop\.C= olumnFamilyRecordReader$RowIterator\.maybeInit(ColumnFamilyRecordReader\.j= ava:266)
=0A at org\.apache\.cassandra\.hadoop\.ColumnFamilyReco= rdReader$RowIterator\.computeNext(ColumnFamilyRecordReader\.java:281)
=0A= at org\.apache\.cassandra\.hadoop\.ColumnFamilyRecordReader$RowIte= rator\.computeNext(ColumnFamilyRecordReader\.java:178)
=0A at co= m\.google\.common\.collect\.AbstractIterator\.tryToComputeNext(AbstractIte= rator\.java:136)
=0A at com\.google\.common\.collect\.AbstractIt= erator\.hasNext(AbstractIterator\.java:131)
=0A at org\.apache\.= cassandra\.hadoop\.ColumnFamilyRecordReader\.nextKeyValue(ColumnFamilyReco= rdReader\.java:137)
=0A at org\.apache\.hadoop\.mapred\.MapTask$= NewTrackingRecordReader\.nextKeyValue(MapTask\.java:423)
=0A at = org\.apache\.hadoop\.mapreduce\.MapContext\.nextKeyValue(MapContext\.java:= 67)
=0A at org\.apache\.hadoop\.mapreduce\.Mapper\.run(Mapper\.j= ava:143)
=0A at org\.apache\.hadoop\.mapred\.MapTask\.runNewMapp= er(MapTask\.java:621)
=0A at org\.apache\.hadoop\.mapred\.MapTas= k\.run(MapTask\.java:305)
=0A at org\.apache\.hadoop\.mapred\.Ch= ild\.main(Child\.java:170)
=0ACaused by: org\.apache\.thrift\.protocol\= TProtocolException: Required field 'clock' was not present! Struct: Colum= n(name:69 64, value:6D 65 73 73 61 67 65 2D 70 72 65 66 65 72 65 6E 63 65 = 73, clock:null)
=0A at org\.apache\.cassandra\.thrift\.Column\.v= alidate(Column\.java:615)
=0A at org\.apache\.cassandra\.thrift\= Column\.read(Column\.java:527)
=0A at org\.apache\.cassandra\.t= hrift\.ColumnOrSuperColumn\.read(ColumnOrSuperColumn\.java:351)
=0A = at org\.apache\.cassandra\.thrift\.KeySlice\.read(KeySlice\.java:391)<= br>=0A at org\.apache\.cassandra\.thrift\.Cassandra$get_range_slice= s_result\.read(Cassandra\.java:11859)
=0A at org\.apache\.cassan= dra\.thrift\.Cassandra$Client\.recv_get_range_slices(Cassandra\.java:743)<= br>=0A at org\.apache\.cassandra\.thrift\.Cassandra$Client\.get_ran= ge_slices(Cassandra\.java:715)
=0A at org\.apache\.cassandra\.ha= doop\.ColumnFamilyRecordReader$RowIterator\.maybeInit(ColumnFamilyRecordRe= ader\.java:244)
--Apple-Webmail-86--1e7b6a67-7c12-e8fe-d5be-e5a9f802b20f-- --Apple-Webmail-42--1e7b6a67-7c12-e8fe-d5be-e5a9f802b20f--