Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 559499BD4 for ; Mon, 8 Oct 2012 15:07:05 +0000 (UTC) Received: (qmail 47960 invoked by uid 500); 8 Oct 2012 15:07:02 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 47852 invoked by uid 500); 8 Oct 2012 15:07:02 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 47843 invoked by uid 99); 8 Oct 2012 15:07:01 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 08 Oct 2012 15:07:01 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of matganine@hotmail.com designates 65.55.116.85 as permitted sender) Received: from [65.55.116.85] (HELO blu0-omc3-s10.blu0.hotmail.com) (65.55.116.85) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 08 Oct 2012 15:06:52 +0000 Received: from BLU170-W89 ([65.55.116.72]) by blu0-omc3-s10.blu0.hotmail.com with Microsoft SMTPSVC(6.0.3790.4675); Mon, 8 Oct 2012 08:06:32 -0700 Message-ID: Content-Type: multipart/alternative; boundary="_5e0dee64-482d-4b49-972f-669806ca05bf_" X-Originating-IP: [82.150.248.29] From: Ralph Romanos To: Subject: Problem while streaming SSTables with BulkOutputFormat Date: Mon, 8 Oct 2012 15:06:31 +0000 Importance: Normal MIME-Version: 1.0 X-OriginalArrivalTime: 08 Oct 2012 15:06:32.0135 (UTC) FILETIME=[804C6570:01CDA566] X-Virus-Checked: Checked by ClamAV on apache.org --_5e0dee64-482d-4b49-972f-669806ca05bf_ Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Hello=2C I am using BulkOutputFormat to load data from a .csv file into Cassandra. I= am using Cassandra 1.1.3 and Hadoop 0.20.2.I have 7 hadoop nodes: 1 nameno= de/jobtracker and 6 datanodes/tasktrackers. Cassandra is installed on 4 of = these 6 datanodes/tasktrackers.The issue happens when I have more than 1 re= ducer=2C SSTables are generated in each node=2C however=2C I get the follow= ing error in the tasktracker's logs when they are streamed into the Cassand= ra cluster: Exception in thread "Streaming to /172.16.110.79:1" java.lang.RuntimeExcept= ion: java.io.EOFException at org.apache.cassandra.utils.FBUtilities.unchecked(FBUtilities.java:628) at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:34) at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown Source) at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) at java.lang.Thread.run(Unknown Source) Caused by: java.io.EOFException at java.io.DataInputStream.readInt(Unknown Source) at org.apache.cassandra.streaming.FileStreamTask.receiveReply(FileStreamTa= sk.java:194) at org.apache.cassandra.streaming.FileStreamTask.stream(FileStreamTask.jav= a:181) at org.apache.cassandra.streaming.FileStreamTask.runMayThrow(FileStreamTas= k.java:94) at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:30) ... 3 more Exception in thread "Streaming to /172.16.110.92:1" java.lang.RuntimeExcept= ion: java.io.EOFException at org.apache.cassandra.utils.FBUtilities.unchecked(FBUtilities.java:628) at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:34) at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown Source) at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) at java.lang.Thread.run(Unknown Source) Caused by: java.io.EOFException at java.io.DataInputStream.readInt(Unknown Source) at org.apache.cassandra.streaming.FileStreamTask.receiveReply(FileStreamTa= sk.java:194) at org.apache.cassandra.streaming.FileStreamTask.stream(FileStreamTask.jav= a:181) at org.apache.cassandra.streaming.FileStreamTask.runMayThrow(FileStreamTas= k.java:94) at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:30) ... 3 more ... This is what I get in the logs of one of my Cassandra nodes:ERROR 16:47:34= =2C904 Sending retry message failed=2C closing session. java.io.IOException: Broken pipe at sun.nio.ch.FileDispatcher.write0(Native Method) at sun.nio.ch.SocketDispatcher.write(Unknown Source) at sun.nio.ch.IOUtil.writeFromNativeBuffer(Unknown Source) at sun.nio.ch.IOUtil.write(Unknown Source) at sun.nio.ch.SocketChannelImpl.write(Unknown Source) at java.nio.channels.Channels.writeFullyImpl(Unknown Source) at java.nio.channels.Channels.writeFully(Unknown Source) at java.nio.channels.Channels.access$000(Unknown Source) at java.nio.channels.Channels$1.write(Unknown Source) at java.io.OutputStream.write(Unknown Source) at java.nio.channels.Channels$1.write(Unknown Source) at java.io.DataOutputStream.writeInt(Unknown Source) at org.apache.cassandra.net.OutboundTcpConnection.write(OutboundTcp= Connection.java:196) at org.apache.cassandra.streaming.StreamInSession.sendMessage(Strea= mInSession.java:171) at org.apache.cassandra.streaming.StreamInSession.retry(StreamInSes= sion.java:160) at org.apache.cassandra.streaming.IncomingStreamReader.retry(Incomi= ngStreamReader.java:168) at org.apache.cassandra.streaming.IncomingStreamReader.read(Incomin= gStreamReader.java:98) at org.apache.cassandra.net.IncomingTcpConnection.stream(IncomingTc= pConnection.java:182) at org.apache.cassandra.net.IncomingTcpConnection.run(IncomingTcpCo= nnection.java:78) Does anyone know what caused these errors? Thank you for your help.Regards=2CRalph = --_5e0dee64-482d-4b49-972f-669806ca05bf_ Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable
Hello=2C

I am using BulkOutputFormat to load data from a= .csv file into Cassandra. I am using Cassandra 1.1.3 and Hadoop 0.20.2.
I have 7 hadoop nodes: 1 namenode/jobtracker and 6 datanodes/tasktr= ackers. Cassandra is installed on 4 of these 6 datanodes/tasktrackers.
The issue happens when I have more than 1 reducer=2C SSTables are gen= erated in each node=2C however=2C I get the following error in the tasktrac= ker's logs when they =3B
are streamed into the Cassandra clus= ter:

Exception in thread "Streaming to /172.16.110.79:1" ja=
va.lang.RuntimeException: java.io.EOFException
	at org.apache.cassandra.utils.FBUtilities.unchecked(FBUtilities.java:628)
	at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:34)
	at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown Source)
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
	at java.lang.Thread.run(Unknown Source)
Caused by: java.io.EOFException
	at java.io.DataInputStream.readInt(Unknown Source)
	at org.apache.cassandra.streaming.FileStreamTask.receiveReply(FileStreamTa=
sk.java:194)
	at org.apache.cassandra.streaming.FileStreamTask.stream(FileStreamTask.jav=
a:181)
	at org.apache.cassandra.streaming.FileStreamTask.runMayThrow(FileStreamTas=
k.java:94)
	at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:30)
	... 3 more
Exception in thread "Streaming to /172.16.110.92:1" java.lang.RuntimeExcept=
ion: java.io.EOFException
	at org.apache.cassandra.utils.FBUtilities.unchecked(FBUtilities.java:628)
	at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:34)
	at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown Source)
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
	at java.lang.Thread.run(Unknown Source)
Caused by: java.io.EOFException
	at java.io.DataInputStream.readInt(Unknown Source)
	at org.apache.cassandra.streaming.FileStreamTask.receiveReply(FileStreamTa=
sk.java:194)
	at org.apache.cassandra.streaming.FileStreamTask.stream(FileStreamTask.jav=
a:181)
	at org.apache.cassandra.streaming.FileStreamTask.runMayThrow(FileStreamTas=
k.java:94)
	at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:30)
	... 3 more
 ...

This is what I get in the logs of one of my Cassandra nodes:
ERROR 16:47:34=2C904 Sending retry message failed=2C closing sessio=
n.
java.io.IOException: Broken pipe
        at sun.nio.ch.FileDispatcher.write0(Native Method)
        at sun.nio.ch.SocketDispatcher.write(Unknown Source)
        at sun.nio.ch.IOUtil.writeFromNativeBuffer(Unknown Source)
        at sun.nio.ch.IOUtil.write(Unknown Source)
        at sun.nio.ch.SocketChannelImpl.write(Unknown Source)
        at java.nio.channels.Channels.writeFullyImpl(Unknown Source)
        at java.nio.channels.Channels.writeFully(Unknown Source)
        at java.nio.channels.Channels.access$000(Unknown Source)
        at java.nio.channels.Channels$1.write(Unknown Source)
        at java.io.OutputStream.write(Unknown Source)
        at java.nio.channels.Channels$1.write(Unknown Source)
        at java.io.DataOutputStream.writeInt(Unknown Source)
        at org.apache.cassandra.net.OutboundTcpConnection.write(OutboundTcp=
Connection.java:196)
        at org.apache.cassandra.streaming.StreamInSession.sendMessage(Strea=
mInSession.java:171)
        at org.apache.cassandra.streaming.StreamInSession.retry(StreamInSes=
sion.java:160)
        at org.apache.cassandra.streaming.IncomingStreamReader.retry(Incomi=
ngStreamReader.java:168)
        at org.apache.cassandra.streaming.IncomingStreamReader.read(Incomin=
gStreamReader.java:98)
        at org.apache.cassandra.net.IncomingTcpConnection.stream(IncomingTc=
pConnection.java:182)
        at org.apache.cassandra.net.IncomingTcpConnection.run(IncomingTcpCo=
nnection.java:78)

Does anyone know what caused these e= rrors?

Thank you for your help.
Regards= =2C
Ralph
= --_5e0dee64-482d-4b49-972f-669806ca05bf_--