Return-Path: X-Original-To: apmail-cassandra-commits-archive@www.apache.org Delivered-To: apmail-cassandra-commits-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 16BC89CC5 for ; Tue, 24 Jan 2012 16:09:06 +0000 (UTC) Received: (qmail 1456 invoked by uid 500); 24 Jan 2012 16:09:05 -0000 Delivered-To: apmail-cassandra-commits-archive@cassandra.apache.org Received: (qmail 1149 invoked by uid 500); 24 Jan 2012 16:09:05 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 1141 invoked by uid 99); 24 Jan 2012 16:09:04 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 24 Jan 2012 16:09:04 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 24 Jan 2012 16:09:03 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id AE86916096D for ; Tue, 24 Jan 2012 16:08:43 +0000 (UTC) Date: Tue, 24 Jan 2012 16:08:43 +0000 (UTC) From: "Jonathan Ellis (Updated) (JIRA)" To: commits@cassandra.apache.org Message-ID: <1102326847.72154.1327421323716.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <235640.71318.1327404400196.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Updated] (CASSANDRA-3776) Streaming task hangs forever during repair after unexpected connection reset by peer MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CASSANDRA-3776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Ellis updated CASSANDRA-3776: -------------------------------------- Component/s: Core Priority: Minor (was: Major) Fix Version/s: 1.0.8 Assignee: Yuki Morishita To be specific: none of the nodes involved went down? Were anything else unusual correlated with the reset? Can you reproduce this? > Streaming task hangs forever during repair after unexpected connection reset by peer > ------------------------------------------------------------------------------------ > > Key: CASSANDRA-3776 > URL: https://issues.apache.org/jira/browse/CASSANDRA-3776 > Project: Cassandra > Issue Type: Bug > Components: Core > Affects Versions: 1.0.7 > Environment: Windows Server 2008 R2 > Sun Java 7u2 64bit > Reporter: Viktor Jevdokimov > Assignee: Yuki Morishita > Priority: Minor > Fix For: 1.0.8 > > > During streaming (repair) a stream receiving node thrown an exceptions: > ERROR [Streaming:1] 2012-01-24 10:17:03,828 AbstractCassandraDaemon.java (line 139) Fatal exception in thread Thread[Streaming:1,1,main] > java.lang.RuntimeException: java.net.SocketException: Connection reset by peer: socket write error > at org.apache.cassandra.utils.FBUtilities.unchecked(FBUtilities.java:689) > at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:34) > at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source) > at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) > at java.lang.Thread.run(Unknown Source) > Caused by: java.net.SocketException: Connection reset by peer: socket write error > at java.net.SocketOutputStream.socketWrite0(Native Method) > at java.net.SocketOutputStream.socketWrite(Unknown Source) > at java.net.SocketOutputStream.write(Unknown Source) > at com.ning.compress.lzf.LZFChunk.writeCompressedHeader(LZFChunk.java:77) > at com.ning.compress.lzf.ChunkEncoder.encodeAndWriteChunk(ChunkEncoder.java:132) > at com.ning.compress.lzf.LZFOutputStream.writeCompressedBlock(LZFOutputStream.java:203) > at com.ning.compress.lzf.LZFOutputStream.write(LZFOutputStream.java:97) > at org.apache.cassandra.streaming.FileStreamTask.write(FileStreamTask.java:181) > at org.apache.cassandra.streaming.FileStreamTask.stream(FileStreamTask.java:145) > at org.apache.cassandra.streaming.FileStreamTask.runMayThrow(FileStreamTask.java:91) > at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:30) > ... 3 more > ERROR [Streaming:1] 2012-01-24 10:17:03,891 AbstractCassandraDaemon.java (line 139) Fatal exception in thread Thread[Streaming:1,1,main] > java.lang.RuntimeException: java.net.SocketException: Connection reset by peer: socket write error > at org.apache.cassandra.utils.FBUtilities.unchecked(FBUtilities.java:689) > at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:34) > at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source) > at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) > at java.lang.Thread.run(Unknown Source) > Caused by: java.net.SocketException: Connection reset by peer: socket write error > at java.net.SocketOutputStream.socketWrite0(Native Method) > at java.net.SocketOutputStream.socketWrite(Unknown Source) > at java.net.SocketOutputStream.write(Unknown Source) > at com.ning.compress.lzf.LZFChunk.writeCompressedHeader(LZFChunk.java:77) > at com.ning.compress.lzf.ChunkEncoder.encodeAndWriteChunk(ChunkEncoder.java:132) > at com.ning.compress.lzf.LZFOutputStream.writeCompressedBlock(LZFOutputStream.java:203) > at com.ning.compress.lzf.LZFOutputStream.write(LZFOutputStream.java:97) > at org.apache.cassandra.streaming.FileStreamTask.write(FileStreamTask.java:181) > at org.apache.cassandra.streaming.FileStreamTask.stream(FileStreamTask.java:145) > at org.apache.cassandra.streaming.FileStreamTask.runMayThrow(FileStreamTask.java:91) > at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:30) > ... 3 more > After which streaming hanged forever. > A few seconds later the sending node had an exception (may not be related): > ERROR [Thread-17224] 2012-01-24 10:17:07,817 AbstractCassandraDaemon.java (line 139) Fatal exception in thread Thread[Thread-17224,5,main] > java.lang.ArrayIndexOutOfBoundsException > Other than that, nodes behave normally, communicating each other. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira