Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id AC0B7200C1D for ; Thu, 2 Feb 2017 01:09:56 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id AAA6D160B5E; Thu, 2 Feb 2017 00:09:56 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 00E5B160B46 for ; Thu, 2 Feb 2017 01:09:55 +0100 (CET) Received: (qmail 53688 invoked by uid 500); 2 Feb 2017 00:09:55 -0000 Mailing-List: contact commits-help@beam.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@beam.apache.org Delivered-To: mailing list commits@beam.apache.org Received: (qmail 53671 invoked by uid 99); 2 Feb 2017 00:09:55 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 02 Feb 2017 00:09:55 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id AF1131A020D for ; Thu, 2 Feb 2017 00:09:54 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -1.998 X-Spam-Level: X-Spam-Status: No, score=-1.998 tagged_above=-999 required=6.31 tests=[KAM_LAZY_DOMAIN_SECURITY=1, RP_MATCHES_RCVD=-2.999, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id SbmqmXXfPxZZ for ; Thu, 2 Feb 2017 00:09:53 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id 809135FB12 for ; Thu, 2 Feb 2017 00:09:53 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 729B2E0530 for ; Thu, 2 Feb 2017 00:09:52 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id CE46125291 for ; Thu, 2 Feb 2017 00:09:51 +0000 (UTC) Date: Thu, 2 Feb 2017 00:09:51 +0000 (UTC) From: "ASF GitHub Bot (JIRA)" To: commits@beam.incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (BEAM-65) SplittableDoFn MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Thu, 02 Feb 2017 00:09:56 -0000 [ https://issues.apache.org/jira/browse/BEAM-65?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15849152#comment-15849152 ] ASF GitHub Bot commented on BEAM-65: ------------------------------------ GitHub user jkff opened a pull request: https://github.com/apache/beam/pull/1895 [BEAM-65] ProcessFn: create invoker in setup() This makes more sense than creating it in constructor, since the fn can be serialized (and will be - by Dataflow runner). However, since direct runner creates the fn explicitly during evaluation, we need to also call setup() explicitly. R: @kennknowles You can merge this pull request into a Git repository by running: $ git pull https://github.com/jkff/incubator-beam sdf-setup Alternatively you can review and apply these changes as the patch at: https://github.com/apache/beam/pull/1895.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1895 ---- commit 5546ad68fd18315a251d059b7c8c512640172d24 Author: Eugene Kirpichov Date: 2017-02-02T00:07:44Z [BEAM-65] ProcessFn: create invoker in setup() This makes more sense than creating it in constructor, since the fn can be serialized (and will be - by Dataflow runner). However, since direct runner creates the fn explicitly during evaluation, we need to also call setup() explicitly. ---- > SplittableDoFn > -------------- > > Key: BEAM-65 > URL: https://issues.apache.org/jira/browse/BEAM-65 > Project: Beam > Issue Type: New Feature > Components: beam-model > Reporter: Daniel Halperin > Assignee: Eugene Kirpichov > Priority: Minor > > SplittableDoFn is a proposed enhancement for "dynamically splittable work" to the Beam model. > Among other things, it would allow a unified implementation of bounded/unbounded sources with dynamic work rebalancing and the ability to express multiple scalable steps (e.g., global expansion -> file sizing & parsing -> splitting files into independently-processable blocks) via composition rather than inheritance. > This would make it much easier to implement many types of sources, to modify and reuse existing sources. Also, it would improve scalability of the Beam model by moving things like splitting a source from the control plane (where it is today -- glob -> List sent over service APIs) into the data plane (PCollection -> PCollection -> ...). -- This message was sent by Atlassian JIRA (v6.3.15#6346)