Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 30C46200BD1 for ; Mon, 28 Nov 2016 19:48:00 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 2F825160B25; Mon, 28 Nov 2016 18:48:00 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 83982160B00 for ; Mon, 28 Nov 2016 19:47:59 +0100 (CET) Received: (qmail 2209 invoked by uid 500); 28 Nov 2016 18:47:58 -0000 Mailing-List: contact dev-help@reef.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@reef.apache.org Delivered-To: mailing list dev@reef.apache.org Received: (qmail 2180 invoked by uid 99); 28 Nov 2016 18:47:58 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 28 Nov 2016 18:47:58 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 6E8552C0079 for ; Mon, 28 Nov 2016 18:47:58 +0000 (UTC) Date: Mon, 28 Nov 2016 18:47:58 +0000 (UTC) From: "Mariia Mykhailova (JIRA)" To: dev@reef.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (REEF-1244) Group Communication does not close down properly at the end if reej job MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Mon, 28 Nov 2016 18:48:00 -0000 [ https://issues.apache.org/jira/browse/REEF-1244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15702768#comment-15702768 ] Mariia Mykhailova commented on REEF-1244: ----------------------------------------- [~dkm2110] What's the status of this JIRA? It looks fairly old, and we've done a lot of changes in shutting down IMRU jobs. If it's still valid, could you please provide more details? > Group Communication does not close down properly at the end if reej job > ----------------------------------------------------------------------- > > Key: REEF-1244 > URL: https://issues.apache.org/jira/browse/REEF-1244 > Project: REEF > Issue Type: Bug > Components: GroupCommunications > Affects Versions: 0.13 > Environment: C# > Reporter: Dhruv Mahajan > Assignee: Dhruv Mahajan > > Currently, when we want to shut down evaluator, the dispose function of group communications will be called. However, there is a race condition that occurs. For example, suppose evaluator e1 calls dispose and closes the stream with evaluator e2. Then if e2 is in ReadAsync() function of the stream, we will get a failure since Dispose() function in e2 is still not called. Moreover, the Dispose() function in e2 will try to close the already closed stream again. > Some of these scenarios are handled by catching Exceptions and ignoring them but some are not captured and throw errors which leads to driver and reef job failing. > The aim of this JIRA is to identify all these closing scenarios and handle them appropriately. -- This message was sent by Atlassian JIRA (v6.3.4#6332)