Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2C503E128 for ; Wed, 28 Nov 2012 02:18:39 +0000 (UTC) Received: (qmail 42240 invoked by uid 500); 28 Nov 2012 02:18:36 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 42211 invoked by uid 500); 28 Nov 2012 02:18:36 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 42203 invoked by uid 99); 28 Nov 2012 02:18:36 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 28 Nov 2012 02:18:36 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of okrammarko@gmail.com designates 209.85.160.44 as permitted sender) Received: from [209.85.160.44] (HELO mail-pb0-f44.google.com) (209.85.160.44) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 28 Nov 2012 02:18:28 +0000 Received: by mail-pb0-f44.google.com with SMTP id uo1so9107687pbc.31 for ; Tue, 27 Nov 2012 18:18:07 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=from:mime-version:content-type:subject:date:in-reply-to:to :references:message-id:x-mailer; bh=bZFUFum3Y9diyh3uaCzZwCeEFlDWhgoqNlEO07F73go=; b=kZDyotWCVZ+wzktcH1f0GDkBIl0mqJ3VWRpWbYNhzuoiNZV0CKBmgU1fhhhRs3p+Bt VANu8jSv5Hpgj5iCXXbMLTclpvHUNASeJkA29swVNoiYR3coUVQPd/x9Q6hENgxuEeMY gCgO5FBBYFogMfgwLgX1rFygCL78zE78JRM/7S2Te4rP1usRbauCp6O6oeQx7T+ThFAg qWZL1kKuV9ZXk4XUzJf/vUczXxgCJ2uOVxBz2yNR8kTuhVsBh5BHxBNX09KV5aKG4DWB JQK7G/59wuYlTDmy2KsYCEl2B1jUYbae/XoFNod6uFvQLqaCWgBGyoVx3WljnLo9lNcR fRAg== Received: by 10.68.143.106 with SMTP id sd10mr53580853pbb.62.1354069087467; Tue, 27 Nov 2012 18:18:07 -0800 (PST) Received: from [192.168.1.100] ([101.115.171.170]) by mx.google.com with ESMTPS id g10sm11629277pav.9.2012.11.27.18.18.04 (version=TLSv1/SSLv3 cipher=OTHER); Tue, 27 Nov 2012 18:18:06 -0800 (PST) From: Marko Rodriguez Mime-Version: 1.0 (Apple Message framework v1085) Content-Type: multipart/alternative; boundary=Apple-Mail-5--47784290 Subject: Re: Frame size exceptions occurring with ColumnFamilyInputFormat for very large rows Date: Tue, 27 Nov 2012 19:17:33 -0700 In-Reply-To: <48334066-2C22-411D-B2F5-DCE1EBD71375@gmail.com> To: user@cassandra.apache.org References: <48334066-2C22-411D-B2F5-DCE1EBD71375@gmail.com> Message-Id: <0D139925-F4F6-4FD4-8224-C7E3894F541C@gmail.com> X-Mailer: Apple Mail (2.1085) X-Virus-Checked: Checked by ClamAV on apache.org --Apple-Mail-5--47784290 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=us-ascii Hello, I was wondering if anyone had an answer to my previous message below.=20 Seems another is having the same problem, but unfortunately with no = response as well. = http://mail-archives.apache.org/mod_mbox/cassandra-user/201211.mbox/%3C509= A4A1F.8070506@semantico.com%3E =09 Any help would be much appreciated. Thank you, Marko. http://markorodriguez.com On Nov 9, 2012, at 3:02 PM, Marko Rodriguez wrote: > Hello, >=20 > I am trying to run a Hadoop job that pulls data out of Cassandra via = ColumnFamilyInputFormat. I am getting a "frame size" exception. To = remedy that, I have set both the thrift_framed_transport_size_in_mb and = thrift_max_message_length_in_mb to an "infinite" amount at 100000mb on = all nodes. Moreover, I have restarted the cluster and the cassandra.yaml = files have been reloaded. >=20 > However, I am still getting: >=20 > 12/11/09 21:39:52 INFO mapred.JobClient: map 62% reduce 0% > 12/11/09 21:40:09 INFO mapred.JobClient: Task Id : = attempt_201211082011_0015_m_000479_2, Status : FAILED > java.lang.RuntimeException: = org.apache.thrift.transport.TTransportException: Frame size (30046945) = larger than max length (16384000)! > at = org.apache.cassandra.hadoop.ColumnFamilyRecordReader$StaticRowIterator.may= beInit(ColumnFamilyRecordReader.java:400) > at = org.apache.cassandra.hadoop.ColumnFamilyRecordReader$StaticRowIterator.com= puteNext(ColumnFamilyRecordReader.java:406) > at = org.apache.cassandra.hadoop.ColumnFamilyRecordReader$StaticRowIterator.com= puteNext(ColumnFamilyRecordReader.java:324) > at = com.google.common.collect.AbstractIterator.tryToComputeNext(AbstractIterat= or.java:143) > at = com.google.common.collect.AbstractIterator.hasNext(AbstractIterator.java:1= 38) > at = org.apache.cassandra.hadoop.ColumnFamilyRecordReader.nextKeyValue(ColumnFa= milyRecordReader.java:189) >=20 > Question: Why is 16384000 bytes (I assume) !=3D 100000mb? >=20 > Next, I made this parameter true as a last hail mary attempt: > cassandra.input.widerows=3Dtrue > ...still with no luck. >=20 > Does someone know what I might be missing? >=20 > Thank you very much for your time, > Marko. >=20 > http://markorodriguez.com --Apple-Mail-5--47784290 Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=us-ascii = http://mail-archives.apache.org/= mod_mbox/cassandra-user/201211.mbox/%3C509A4A1F.8070506@semantico.com%3E
=
Any help would be much = appreciated.

Thank = you,
Marko.


On Nov 9, 2012, at 3:02 PM, Marko Rodriguez = wrote:

Hello,

I am = trying to run a Hadoop job that pulls data out of Cassandra via = ColumnFamilyInputFormat. I am getting a "frame size" exception. To = remedy that, I have set both the thrift_framed_transport_size_in_mb and = thrift_max_message_length_in_mb to an "infinite" amount at 100000mb on = all nodes. Moreover, I have restarted the cluster and the cassandra.yaml = files have been reloaded.

However, I am still = getting:

12/11/09 21:39:52 INFO mapred.JobClient:  map 62% = reduce 0%
12/11/09 21:40:09 INFO mapred.JobClient: Task Id : = attempt_201211082011_0015_m_000479_2, Status : = FAILED
java.lang.RuntimeException: = org.apache.thrift.transport.TTransportException: Frame size (30046945) = larger than max length (16384000)!
at = org.apache.cassandra.hadoop.ColumnFamilyRecordReader$StaticRowIterator.may= beInit(ColumnFamilyRecordReader.java:400)
at = org.apache.cassandra.hadoop.ColumnFamilyRecordReader$StaticRowIterator.com= puteNext(ColumnFamilyRecordReader.java:406)
at = org.apache.cassandra.hadoop.ColumnFamilyRecordReader$StaticRowIterator.com= puteNext(ColumnFamilyRecordReader.java:324)
at = com.google.common.collect.AbstractIterator.tryToComputeNext(AbstractIterat= or.java:143)
at = com.google.common.collect.AbstractIterator.hasNext(AbstractIterator.java:1= 38)
= at = org.apache.cassandra.hadoop.ColumnFamilyRecordReader.nextKeyValue(ColumnFa= milyRecordReader.java:189)

Question: Why is 16384000 bytes (I = assume) !=3D  100000mb?

Next, I made this parameter true as = a last hail mary attempt:
= cassandra.input.widerows=3Dtrue
...still with no = luck.

Does someone know what I might be missing?

Thank you = very much for your time,
Marko.

http://markorodriguez.com

= --Apple-Mail-5--47784290--