Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id AF997E194 for ; Wed, 28 Nov 2012 03:27:15 +0000 (UTC) Received: (qmail 82824 invoked by uid 500); 28 Nov 2012 03:27:13 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 82619 invoked by uid 500); 28 Nov 2012 03:27:12 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 82587 invoked by uid 99); 28 Nov 2012 03:27:11 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 28 Nov 2012 03:27:11 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of edlinuxguru@gmail.com designates 209.85.223.172 as permitted sender) Received: from [209.85.223.172] (HELO mail-ie0-f172.google.com) (209.85.223.172) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 28 Nov 2012 03:27:05 +0000 Received: by mail-ie0-f172.google.com with SMTP id c13so13011815ieb.31 for ; Tue, 27 Nov 2012 19:26:45 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=YF3hOSoxmDBpNKvwJP82deEaCCzL6gUk7PH/6cPKACc=; b=RuTqz+Mzh/ZgRWEVB2aGGNa58YHaLUb6tj8XI3dXMAahpL7CYQN/UQ7axiZxYF7Vsz IG1a5RkGlhnZ4QEass7dQuiebAUe8FbEgJuv3PLyxFI035VkRsuuNVS86QE8MzBSUThB qMbcrhRDYNg/Vs9aNL/IlvtWmfpbEEgZf+YRCbpbyz2Vll1R8l2b/QP7/T2dnkHngP9f MLU4hGr1oubLvxT843A3Av3Ahc8uzQw+AZviyP4c4JjR1/IDNisoG9hWh8l0nnSUOAad qsQqj6LBPY5T8jlwz1pCyyYKBVE+7ZUB7B5PE4cUZjfBXdN9Zi2h1huNMkp8gCJYO2be 95hQ== MIME-Version: 1.0 Received: by 10.50.188.136 with SMTP id ga8mr17817167igc.24.1354073204862; Tue, 27 Nov 2012 19:26:44 -0800 (PST) Received: by 10.64.97.106 with HTTP; Tue, 27 Nov 2012 19:26:44 -0800 (PST) In-Reply-To: <0D139925-F4F6-4FD4-8224-C7E3894F541C@gmail.com> References: <48334066-2C22-411D-B2F5-DCE1EBD71375@gmail.com> <0D139925-F4F6-4FD4-8224-C7E3894F541C@gmail.com> Date: Tue, 27 Nov 2012 22:26:44 -0500 Message-ID: Subject: Re: Frame size exceptions occurring with ColumnFamilyInputFormat for very large rows From: Edward Capriolo To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=14dae93410f7602e9704cf85bcee X-Virus-Checked: Checked by ClamAV on apache.org --14dae93410f7602e9704cf85bcee Content-Type: text/plain; charset=ISO-8859-1 Thrift has to buffer the packet into memory so setting it to 1,000,000 brontobytes is a bad idea. On Tue, Nov 27, 2012 at 9:17 PM, Marko Rodriguez wrote: > Hello, > > I was wondering if anyone had an answer to my previous message below. > > Seems another is having the same problem, but unfortunately with no > response as well. > > http://mail-archives.apache.org/mod_mbox/cassandra-user/201211.mbox/%3C509A4A1F.8070506@semantico.com%3E > Any help would be much appreciated. > > Thank you, > Marko. > > http://markorodriguez.com > > On Nov 9, 2012, at 3:02 PM, Marko Rodriguez wrote: > > Hello, > > I am trying to run a Hadoop job that pulls data out of Cassandra via > ColumnFamilyInputFormat. I am getting a "frame size" exception. To remedy > that, I have set both the thrift_framed_transport_size_in_mb and > thrift_max_message_length_in_mb to an "infinite" amount at 100000mb on all > nodes. Moreover, I have restarted the cluster and the cassandra.yaml files > have been reloaded. > > However, I am still getting: > > 12/11/09 21:39:52 INFO mapred.JobClient: map 62% reduce 0% > 12/11/09 21:40:09 INFO mapred.JobClient: Task Id : > attempt_201211082011_0015_m_000479_2, Status : FAILED > java.lang.RuntimeException: > org.apache.thrift.transport.TTransportException: Frame size (30046945) > larger than max length (16384000)! > at > org.apache.cassandra.hadoop.ColumnFamilyRecordReader$StaticRowIterator.maybeInit(ColumnFamilyRecordReader.java:400) > at > org.apache.cassandra.hadoop.ColumnFamilyRecordReader$StaticRowIterator.computeNext(ColumnFamilyRecordReader.java:406) > at > org.apache.cassandra.hadoop.ColumnFamilyRecordReader$StaticRowIterator.computeNext(ColumnFamilyRecordReader.java:324) > at > com.google.common.collect.AbstractIterator.tryToComputeNext(AbstractIterator.java:143) > at > com.google.common.collect.AbstractIterator.hasNext(AbstractIterator.java:138) > at > org.apache.cassandra.hadoop.ColumnFamilyRecordReader.nextKeyValue(ColumnFamilyRecordReader.java:189) > > Question: Why is 16384000 bytes (I assume) != 100000mb? > > Next, I made this parameter true as a last hail mary attempt: > cassandra.input.widerows=true > ...still with no luck. > > Does someone know what I might be missing? > > Thank you very much for your time, > Marko. > > http://markorodriguez.com > > > --14dae93410f7602e9704cf85bcee Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Thrift has to buffer the packet into memory so setting it to 1,000,000 bron= tobytes is a bad idea.

On Tue, Nov 27, 20= 12 at 9:17 PM, Marko Rodriguez <okrammarko@gmail.com> wro= te:
Hello,

I was wondering if anyone had an answe= r to my previous message below.=A0

Seems another is having the same problem, but unfortuna= tely with no response as well.
Any help would= be much appreciated.

Thank you,
Marko.<= /div>


On Nov 9, 2012, at 3:02 PM, Marko Rodriguez wrote:

<= blockquote type=3D"cite">
Hello,

= I am trying to run a Hadoop job that pulls data out of Cassandra via Column= FamilyInputFormat. I am getting a "frame size" exception. To reme= dy that, I have set both the thrift_framed_transport_size_in_mb and thrift_= max_message_length_in_mb to an "infinite" amount at 100000mb on a= ll nodes. Moreover, I have restarted the cluster and the cassandra.yaml fil= es have been reloaded.

However, I am still getting:

12/11/09 21:39:52 INFO mapred.JobCl= ient: =A0map 62% reduce 0%
12/11/09 21:40:09 INFO mapred.JobClient: Task= Id : attempt_201211082011_0015_m_000479_2, Status : FAILED
java.lang.Ru= ntimeException: org.apache.thrift.transport.TTransportException: Frame size= (30046945) larger than max length (16384000)!
at org.apache.cassandra.hadoop= .ColumnFamilyRecordReader$StaticRowIterator.maybeInit(ColumnFamilyRecordRea= der.java:400)
at org.apache= .cassandra.hadoop.ColumnFamilyRecordReader$StaticRowIterator.computeNext(Co= lumnFamilyRecordReader.java:406)
at org.apache.cassandra.hadoop= .ColumnFamilyRecordReader$StaticRowIterator.computeNext(ColumnFamilyRecordR= eader.java:324)
at com.goog= le.common.collect.AbstractIterator.tryToComputeNext(AbstractIterator.java:1= 43)
at com.google.common.collect.A= bstractIterator.hasNext(AbstractIterator.java:138)
at org.apache.cassandra.hadoop.ColumnFamilyRecordRe= ader.nextKeyValue(ColumnFamilyRecordReader.java:189)

Question: Why is 16384000 bytes (I assume) !=3D =A0100000mb?

Nex= t, I made this parameter true as a last hail mary attempt:
cassandra.input.widerows=3Dtrue
...sti= ll with no luck.

Does someone know what I might be missing?

Thank you very much f= or your time,
Marko.

http://markorodriguez.com


--14dae93410f7602e9704cf85bcee--