Return-Path: X-Original-To: apmail-flink-issues-archive@minotaur.apache.org Delivered-To: apmail-flink-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 806D918A47 for ; Wed, 29 Apr 2015 12:34:32 +0000 (UTC) Received: (qmail 43155 invoked by uid 500); 29 Apr 2015 12:34:32 -0000 Delivered-To: apmail-flink-issues-archive@flink.apache.org Received: (qmail 43103 invoked by uid 500); 29 Apr 2015 12:34:32 -0000 Mailing-List: contact issues-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@flink.apache.org Delivered-To: mailing list issues@flink.apache.org Received: (qmail 43090 invoked by uid 99); 29 Apr 2015 12:34:32 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Apr 2015 12:34:32 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: message received from 54.191.145.13 which is an MX secondary for issues@flink.apache.org) Received: from [54.191.145.13] (HELO mx1-us-west.apache.org) (54.191.145.13) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Apr 2015 12:34:27 +0000 Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx1-us-west.apache.org (ASF Mail Server at mx1-us-west.apache.org) with SMTP id 08454283F8 for ; Wed, 29 Apr 2015 12:34:08 +0000 (UTC) Received: (qmail 42486 invoked by uid 99); 29 Apr 2015 12:34:07 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Apr 2015 12:34:07 +0000 Date: Wed, 29 Apr 2015 12:34:07 +0000 (UTC) From: "ASF GitHub Bot (JIRA)" To: issues@flink.incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (FLINK-1927) [Py] Rework operator distribution MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/FLINK-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14519269#comment-14519269 ] ASF GitHub Bot commented on FLINK-1927: --------------------------------------- Github user aljoscha commented on the pull request: https://github.com/apache/flink/pull/638#issuecomment-97412959 But doesn't this mean that the lambdas now must be stateless, i.e. if a user refers to some variable outside the lambda this will not be serialised with the closure anymore (because there is no serialization of the closure anymore). > [Py] Rework operator distribution > --------------------------------- > > Key: FLINK-1927 > URL: https://issues.apache.org/jira/browse/FLINK-1927 > Project: Flink > Issue Type: Improvement > Components: Python API > Affects Versions: 0.9 > Reporter: Chesnay Schepler > Assignee: Chesnay Schepler > Priority: Minor > Fix For: 0.9 > > > Currently, the python operator is created when execution the python plan file, serialized using dill and saved as a byte[] in the java function. It is then deserialized at runtime on each node. > The current implementation is fairly hacky, and imposes certain limitations that make it hard to work with. Chaining, or generally saving other user-code, always requires a separate deserialization step after deserializing the operator. > These issues can be easily circumvented by rebuilding the (python) plan on each node, instead of serializing the operator. The plan creation is deterministic, and every operator is uniquely identified by an ID that is already known to the java function. > This change will allow us to easily support custom serializers. -- This message was sent by Atlassian JIRA (v6.3.4#6332)