Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 840BF200C4F for ; Sat, 1 Apr 2017 22:03:48 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 82B1C160B9D; Sat, 1 Apr 2017 20:03:48 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id A39A6160B78 for ; Sat, 1 Apr 2017 22:03:47 +0200 (CEST) Received: (qmail 23423 invoked by uid 500); 1 Apr 2017 20:03:44 -0000 Mailing-List: contact dev-help@samza.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@samza.apache.org Delivered-To: mailing list dev@samza.apache.org Received: (qmail 23410 invoked by uid 99); 1 Apr 2017 20:03:44 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 01 Apr 2017 20:03:44 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 120B41A0581 for ; Sat, 1 Apr 2017 20:03:44 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.423 X-Spam-Level: X-Spam-Status: No, score=-0.423 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_MED=-2.3, RCVD_IN_MSPIKE_H4=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RP_MATCHES_RCVD=-0.001, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (1024-bit key) header.d=linkedin.com header.b=1xYUYxDY; dkim=pass (1024-bit key) header.d=linkedin.com header.b=U3VtTbjA Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id BzmmUEdfPedn for ; Sat, 1 Apr 2017 20:03:41 +0000 (UTC) Received: from mail322.linkedin.com (mail322.linkedin.com [108.174.3.122]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 4B91C5F1A0 for ; Sat, 1 Apr 2017 20:03:41 +0000 (UTC) Authentication-Results: mail322.prod.linkedin.com x-tls.subject="/C=US/ST=California/L=Mountain View/O=Google Inc/CN=smtp.gmail.com"; auth=pass (cipher=ECDHE-RSA-AES128-GCM-SHA256) Authentication-Results: mail322.prod.linkedin.com; iprev=pass policy.iprev="2607:f8b0:4001:c0b::247"; spf=softfail smtp.mailfrom="kparamasivam@linkedin.com" smtp.helo="mail-it0-x247.google.com"; dkim=pass header.d=linkedin.com; tls=pass (verified) key.ciphersuite="TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256" key.length="128" tls.v="tlsv1.2" cert.client="C=US,ST=California,L=Mountain View,O=Google Inc,CN=smtp.gmail.com" cert.clientissuer="C=US,O=Google Inc,CN=Google Internet Authority G2" Received: from [2607:f8b0:4001:c0b::247] ([2607:f8b0:4001:c0b::247.35339] helo=mail-it0-x247.google.com) by mail322.prod.linkedin.com (envelope-from ) (ecelerity 3.6.21.53563 r(Core:3.6.21.0)) with ESMTPS (cipher=ECDHE-RSA-AES128-GCM-SHA256 subject="/C=US/ST=California/L=Mountain View/O=Google Inc/CN=smtp.gmail.com") id 83/1F-16912-79700E85; Sat, 01 Apr 2017 20:03:35 +0000 Received: by mail-it0-x247.google.com with SMTP id o81so21381216itg.2 for ; Sat, 01 Apr 2017 13:03:35 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=4T2IXPDmFtOXN+u2LHeOTtnoEvQwGy2toq7mkUUthFo=; b=NiELoGXGbDrL7u/CGmdnpUrj0UTSkIMKyqjG3htSz5K3dghAgS7r/BC5vJVvLnGrH5 NMUg+MA7i+Ez1HbjvmtZ8g2YcZzKGLIdhILNKlhIlofwIpA3Nve+TzZAi+65OfLvCARe qzV+WtheqFo154Kf2IVfe1wjQWR99WSl1JvNG7Z+KYYX7hoT1f/+6+xckb7udEuWgDkD NCnOo2MftrrTJVKHBvTF+AB2kfx2xaZ63YAWiGrfClIgbU+bZXtwxjJ+TWWYC1Qr54kJ JllF5XOEGXoQEjFn4z2whh7b7jCxMyYU1gvgoq/hJnMvG4TT/svr5+4OlPGrJXDo9Z8t +9iA== X-Gm-Message-State: AFeK/H3Eckfuw72YSbQ4Ly1f6S5yE3+s3PeiwMmuHuAF2UxrkuB+E+OGiETciObR3NvBxs9JzFJ0ImoxH35NqXTG2uhIZGP3li04t1C309vrB4zuh4FKQMCLMRa0vzuwunfr65/y15z3PBe4fyFXk62vI/TOHh5jYsCw X-Received: by 10.107.143.215 with SMTP id r206mr9430419iod.121.1491077014723; Sat, 01 Apr 2017 13:03:34 -0700 (PDT) X-Received: by 10.107.143.215 with SMTP id r206mr9430413iod.121.1491077014434; Sat, 01 Apr 2017 13:03:34 -0700 (PDT) MIME-Version: 1.0 Received: by 10.107.168.38 with HTTP; Sat, 1 Apr 2017 13:03:33 -0700 (PDT) In-Reply-To: References: From: Kartik Paramasivam Date: Sat, 1 Apr 2017 13:03:33 -0700 Message-ID: Subject: Re: [VOTE] SEP-1: Semantics of ProcessorId in Samza To: dev@samza.apache.org Content-Type: multipart/alternative; boundary=94eb2c05aaf0c767e7054c206bfb archived-at: Sat, 01 Apr 2017 20:03:48 -0000 --94eb2c05aaf0c767e7054c206bfb Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable +1 (non binding) Great to see the SEP process being followed. cheers Kartik On Thu, Mar 30, 2017 at 1:48 PM, Renato Marroqu=C3=ADn Mogrovejo < renatoj.marroquin@gmail.com> wrote: > Thanks for the answers Navina! > > +1 (non-binding) > > 2017-03-30 22:32 GMT+02:00 Navina Ramesh : > > > Hi Renato, > > > > > Having the big proposals documented on SEPs is really great to have a > > good understanding on the system! > > I agree. Our previous design process was not being strictly enforced. W= e > > hope to enforce it going forward as there are major changes coming into > the > > next release. > > > > > So this means that inside a container there will be a single processo= r? > > StreamProcessor is nothing more than a Samza container, along with an > > instance of JobCoordinator in it. Think about it as a thin-wrapper arou= nd > > SamzaContainer and JobCoordinator instance. You can find more details o= n > > this idea here - https://issues.apache.org/jira/browse/SAMZA-1063 > > Going forward, we want a Samza job to consist of one or more > > StreamProcessors, instead of N SamzaContainers and 1 AppMaster. > > > > > is this related to SAMZA-1080 somehow? > > Yep. SAMZA-1080 introduces StreamProcessor with an almost pass-through > > JobCoordinator. In fact, at LinkedIn, one of the teams is already using > > this API with the StandaloneJobCoordinator and delegating partition > > distribution to kafka high-level consumer (since systemconsumer is > > pluggable in Samza, we have some internal wrappers around high-level > > consumer). It has been working really well for stateless applications, = I > > believe. > > > > Cheers! > > Navina > > > > On Thu, Mar 30, 2017 at 1:23 PM, Renato Marroqu=C3=ADn Mogrovejo < > > renatoj.marroquin@gmail.com> wrote: > > > > > Hi Navina, > > > > > > Thanks for the great proposal! Having the big proposals documented on > > SEPs > > > is really great to have a good understanding on the system! > > > I have only a clarification question, the proposal states that every > > > containerId is the same as the processorId. So this means that inside= a > > > container there will be a single processor? is this related to > SAMZA-1080 > > > somehow? > > > > > > > > > Best, > > > > > > Renato M. > > > > > > 2017-03-30 20:45 GMT+02:00 Navina Ramesh >: > > > > > > > Hi Yi, > > > > Good question. Three reasons: > > > > > > > > 1. In SAMZA-881, we came up with a set of responsibilities for the > > > > JobCoordinator. One of them was to generate/assign processorId. So, > it > > > > makes sense to keep getProcessorId() within JobCoordinator interfac= e. > > > > 2. StreamProcessor was initially introduced as a user-facing API > > > > SAMZA-1080. ProcessorId was an argument in StreamProcessor > constructor. > > > It > > > > was pushing the burden of guaranteeing unique among the processors > of a > > > job > > > > to the user. This was not favorable. > > > > 3. In general, I think we have consensus that the > processorIdGenerator > > is > > > > going to specific to a runtime environment. Hence, it seems more > > > > appropriate to move it to a lower abstraction layer that deals with > the > > > > underlying execution environment. > > > > > > > > Let me know if you have a different perspective on this. > > > > > > > > Cheers! > > > > Navina > > > > > > > > On Thu, Mar 30, 2017 at 9:42 AM, Yi Pan wrote= : > > > > > > > > > @Navina, > > > > > > > > > > Sorry to chime in late. One question: > > > > > 1. Why is it in JobCoordinator, and why not in StreamProcessor > class? > > > > > Because JobCoordinator provides coordination service across many > > > > > processors, an interface getProcessorId() in JobCoordinator is > > > confusing > > > > > regarding to which processorId we are getting. > > > > > > > > > > Otherwise, the proposal looks good. > > > > > > > > > > -Yi > > > > > > > > > > On Wed, Mar 29, 2017 at 7:57 PM, Navina Ramesh > > > > > > > > > > wrote: > > > > > > > > > > > Good to hear from you, Yan. Thanks! :) > > > > > > > > > > > > On Wed, Mar 29, 2017 at 7:48 PM, Yan Fang > > > > wrote: > > > > > > > > > > > > > +1 . Thanks for the proposal, Navina. :) > > > > > > > > > > > > > > Fang, Yan > > > > > > > yanfang724@gmail.com > > > > > > > > > > > > > > On Thu, Mar 30, 2017 at 4:24 AM, Prateek Maheshwari < > > > > > > > pmaheshwari@linkedin.com.invalid> wrote: > > > > > > > > > > > > > > > +1 (non binding) from me. > > > > > > > > > > > > > > > > - Prateek > > > > > > > > > > > > > > > > On Tue, Mar 28, 2017 at 2:17 PM, Boris S > > > wrote: > > > > > > > > > > > > > > > > > +1 Looks good to me. > > > > > > > > > > > > > > > > > > On Tue, Mar 28, 2017 at 2:00 PM, xinyu liu < > > > > xinyuliu.us@gmail.com> > > > > > > > > wrote: > > > > > > > > > > > > > > > > > > > +1 on my side. Very happy to see this proposal. This is= a > > > > blocker > > > > > > for > > > > > > > > > > integrating fluent API with StreamProcessor, and > hopefully > > we > > > > can > > > > > > get > > > > > > > > it > > > > > > > > > > resolved soon :). > > > > > > > > > > > > > > > > > > > > Thanks, > > > > > > > > > > Xinyu > > > > > > > > > > > > > > > > > > > > On Tue, Mar 28, 2017 at 11:28 AM, Navina Ramesh (Apache= ) > < > > > > > > > > > > navina@apache.org> > > > > > > > > > > wrote: > > > > > > > > > > > > > > > > > > > > > Hi everyone, > > > > > > > > > > > > > > > > > > > > > > This is a voting thread for SEP-1: Semantics of > > ProcessorId > > > > in > > > > > > > Samza. > > > > > > > > > > > For reference, here is the wiki link: > > > > > > > > > > > https://cwiki.apache.org/confluence/display/SAMZA/SEP= - > > > > > > > > > > > 1%3A+Semantics+of+ProcessorId+in+Samza > > > > > > > > > > > > > > > > > > > > > > Link to discussion mail thread: > > > > > > > > > > > http://mail-archives.apache. > > org/mod_mbox/samza-dev/201703. > > > > > > > > > > > mbox/%3CCANazzuuHiO%3DvZQyFbTiYU-0Sfh3riK%3Dz4j_ > > > > > > > > > > AdCicQ8rBO%3DXuYQ%40mail. > > > > > > > > > > > gmail.com%3E > > > > > > > > > > > > > > > > > > > > > > Please vote on this SEP asap. :) > > > > > > > > > > > > > > > > > > > > > > Thanks! > > > > > > > > > > > Navina > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > -- > > > > > > Navina R. > > > > > > > > > > > > > > > > > > > > > > > > > > > -- > > > > Navina R. > > > > > > > > > > > > > > > -- > > Navina R. > > > --=20 We are hiring in Streams Infra (Kafka/Samza/Datastream) !! --94eb2c05aaf0c767e7054c206bfb--