flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ufuk Celebi (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-1830) java.io.IOException: Network stream corrupted
Date Mon, 06 Apr 2015 18:14:12 GMT

    [ https://issues.apache.org/jira/browse/FLINK-1830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14481532#comment-14481532

Ufuk Celebi commented on FLINK-1830:

Thanks for reporting this critical issue.

1) Can you provide the example program and data? If not publicly, privately would also work.
This will make fixing it way easier. If possible, either post it here or mail me (uce@a.o)

2) If you cannot share it, could you try the same program with the latest nightly version
(0.9-SNAPSHOT) and report if it is fixed there?

> java.io.IOException: Network stream corrupted
> ---------------------------------------------
>                 Key: FLINK-1830
>                 URL: https://issues.apache.org/jira/browse/FLINK-1830
>             Project: Flink
>          Issue Type: Bug
>    Affects Versions: 0.8.1
>            Reporter: Stefan Bunk
>              Labels: network
> When running my Flink job I get the following error:
> {quote}
> 04.Apr. 20:43:12 WARN  DefaultChannelPipeline - An exceptionCaught() event was fired,
and it reached at the tail of the pipeline. It usually means the last handler in the pipeline
did not handle the exception.
> java.io.IOException: Network stream corrupted: invalid magicnumber in current envelope
>         at org.apache.flink.runtime.io.network.netty.InboundEnvelopeDecoder.decodeEnvelope(InboundEnvelopeDecoder.java:239)
>         at org.apache.flink.runtime.io.network.netty.InboundEnvelopeDecoder.decodeBuffer(InboundEnvelopeDecoder.java:127)
>         at org.apache.flink.runtime.io.network.netty.InboundEnvelopeDecoder.channelRead(InboundEnvelopeDecoder.java:111)
>         at io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:333)
>         at io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:319)
>         at io.netty.channel.DefaultChannelPipeline.fireChannelRead(DefaultChannelPipeline.java:787)
>         at io.netty.channel.nio.AbstractNioByteChannel$NioByteUnsafe.read(AbstractNioByteChannel.java:125)
>         at io.netty.channel.nio.NioEventLoop.processSelectedKey(NioEventLoop.java:511)
>         at io.netty.channel.nio.NioEventLoop.processSelectedKeysOptimized(NioEventLoop.java:468)
>         at io.netty.channel.nio.NioEventLoop.processSelectedKeys(NioEventLoop.java:382)
>         at io.netty.channel.nio.NioEventLoop.run(NioEventLoop.java:354)
>         at io.netty.util.concurrent.SingleThreadEventExecutor$2.run(SingleThreadEventExecutor.java:116)
>         at io.netty.util.concurrent.DefaultThreadFactory$DefaultRunnableDecorator.run(DefaultThreadFactory.java:137)
>         at java.lang.Thread.run(Thread.java:745)
> {quote}
> Sometimes the job works, sometimes it fails with the above error.
> When it fails, the job still appears as running, but nothing happens anymore until I
cancel it manually. In the logs I can then find the error, often repeated hundreds of times.

This message was sent by Atlassian JIRA

View raw message