Return-Path: X-Original-To: apmail-giraph-dev-archive@www.apache.org Delivered-To: apmail-giraph-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 45354114B3 for ; Tue, 8 Jul 2014 21:49:05 +0000 (UTC) Received: (qmail 6702 invoked by uid 500); 8 Jul 2014 21:49:05 -0000 Delivered-To: apmail-giraph-dev-archive@giraph.apache.org Received: (qmail 6657 invoked by uid 500); 8 Jul 2014 21:49:05 -0000 Mailing-List: contact dev-help@giraph.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@giraph.apache.org Delivered-To: mailing list dev@giraph.apache.org Received: (qmail 6637 invoked by uid 500); 8 Jul 2014 21:49:05 -0000 Delivered-To: apmail-incubator-giraph-dev@incubator.apache.org Received: (qmail 6634 invoked by uid 99); 8 Jul 2014 21:49:05 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Jul 2014 21:49:05 +0000 Date: Tue, 8 Jul 2014 21:49:04 +0000 (UTC) From: "Hudson (JIRA)" To: giraph-dev@incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (GIRAPH-903) Detect crashes of Netty threads MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/GIRAPH-903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14055586#comment-14055586 ] Hudson commented on GIRAPH-903: ------------------------------- ABORTED: Integrated in Giraph-trunk-Commit #1460 (See [https://builds.apache.org/job/Giraph-trunk-Commit/1460/]) GIRAPH-903: Detect crashes of Netty threads (edunov via pavanka) (pavanka: http://git-wip-us.apache.org/repos/asf?p=giraph.git&a=commit&h=61cb37ecd50b0d9400873624e46692c3282e4cfc) * giraph-core/src/main/java/org/apache/giraph/comm/netty/handler/WorkerRequestServerHandler.java * giraph-core/src/test/java/org/apache/giraph/comm/ConnectionTest.java * giraph-core/src/test/java/org/apache/giraph/comm/SaslConnectionTest.java * giraph-core/src/main/java/org/apache/giraph/comm/netty/NettyClient.java * giraph-core/src/main/java/org/apache/giraph/worker/BspServiceWorker.java * giraph-core/src/main/java/org/apache/giraph/comm/netty/NettyServer.java * giraph-core/src/main/java/org/apache/giraph/graph/GraphMapper.java * giraph-core/src/main/java/org/apache/giraph/comm/netty/NettyWorkerClient.java * giraph-core/src/main/java/org/apache/giraph/comm/netty/NettyWorkerServer.java * findbugs-exclude.xml * giraph-core/src/main/java/org/apache/giraph/comm/netty/NettyMasterClient.java * giraph-core/src/main/java/org/apache/giraph/comm/netty/handler/MasterRequestServerHandler.java * giraph-core/src/test/java/org/apache/giraph/comm/RequestTest.java * giraph-core/src/main/java/org/apache/giraph/comm/netty/NettyMasterServer.java * CHANGELOG * giraph-core/src/test/java/org/apache/giraph/comm/RequestFailureTest.java * giraph-core/src/main/java/org/apache/giraph/graph/GraphTaskManager.java * giraph-core/src/main/java/org/apache/giraph/master/BspServiceMaster.java * giraph-core/src/main/java/org/apache/giraph/comm/netty/handler/RequestServerHandler.java * giraph-core/src/main/java/org/apache/giraph/utils/ThreadUtils.java * giraph-core/src/test/java/org/apache/giraph/comm/MockExceptionHandler.java * giraph-core/src/main/java/org/apache/giraph/yarn/GiraphYarnTask.java > Detect crashes of Netty threads > ------------------------------- > > Key: GIRAPH-903 > URL: https://issues.apache.org/jira/browse/GIRAPH-903 > Project: Giraph > Issue Type: Bug > Reporter: Sergey Edunov > Priority: Minor > Attachments: GIRAPH-903.patch, GIRAPH-903.patch > > Original Estimate: 24h > Remaining Estimate: 24h > > When some of the request processing threads fails, the worker gets stuck but the job doesn't fail and it has to be killed manually. We should detect netty thread crashes and fail the job automatically. > You can easily reproduce this if you add a mistake to deserialization of messages for example. -- This message was sent by Atlassian JIRA (v6.2#6252)