Return-Path: X-Original-To: apmail-flink-dev-archive@www.apache.org Delivered-To: apmail-flink-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id CBDB0113B6 for ; Fri, 4 Jul 2014 18:29:01 +0000 (UTC) Received: (qmail 70805 invoked by uid 500); 4 Jul 2014 18:29:01 -0000 Delivered-To: apmail-flink-dev-archive@flink.apache.org Received: (qmail 70751 invoked by uid 500); 4 Jul 2014 18:29:01 -0000 Mailing-List: contact dev-help@flink.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@flink.incubator.apache.org Delivered-To: mailing list dev@flink.incubator.apache.org Received: (qmail 70729 invoked by uid 99); 4 Jul 2014 18:29:01 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 04 Jul 2014 18:29:01 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of gyula.fora@gmail.com designates 209.85.214.169 as permitted sender) Received: from [209.85.214.169] (HELO mail-ob0-f169.google.com) (209.85.214.169) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 04 Jul 2014 18:28:59 +0000 Received: by mail-ob0-f169.google.com with SMTP id wp18so2182934obc.0 for ; Fri, 04 Jul 2014 11:28:34 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=hRvtkltf3DLcVfypOvp+mR5A1ilZ1sFovW+9meqMZg8=; b=CV/sbH4Xikx0XYMyWk+mhvZxS7Jc5Q6cYz4Rzwll5qpnUhGkFKk+GjmjjqANhORKub 0TtUGHbn3lAIZUbXqVRH4QqCqF7Dgil8S0VjVcFGiq5spqRbWPR1ac/R0NPozEtVtgc7 MKx59XO7Ia3aMPm0ZN5eyfGz8pzZRn6Yxih2dt/QkgQ9c/qMp+YMZhI5ZuTACDOr1Ub7 DGkGodoB+3vF8Ein+fnR6AzRu4yEJuyCXdCy97J9iEnwCC8/w3FDw+OsLCb1ma7V84jX LAt1DkOJkHQhZ3RP4VOtOvSUNFXpKXBU0ns6cSy7qyctEeXAAv/KzliFHxJMRXgAuuts hisw== MIME-Version: 1.0 X-Received: by 10.60.79.104 with SMTP id i8mr13933450oex.67.1404498514405; Fri, 04 Jul 2014 11:28:34 -0700 (PDT) Received: by 10.182.144.166 with HTTP; Fri, 4 Jul 2014 11:28:34 -0700 (PDT) In-Reply-To: References: Date: Fri, 4 Jul 2014 20:28:34 +0200 Message-ID: Subject: Re: Adding the streaming project to the main repository From: =?UTF-8?Q?Gyula_F=C3=B3ra?= To: dev@flink.incubator.apache.org Content-Type: multipart/alternative; boundary=089e0118289a09f17d04fd624b17 X-Virus-Checked: Checked by ClamAV on apache.org --089e0118289a09f17d04fd624b17 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hey, I would like to give a quick update on the status of the flink streaming project; all of our dependencies are now updated to the current 0.6-snapshot in our main branch, and the project is now decomposed into 3 subprojects: core, examples, and addons. We have created a separate branch for our 0.2 release with dependencies to 0.5, however from now on we will focus our development efforts to be able to merge our main branch with the Main Flink project. Regards, Gyula, M=C3=A1rton & G=C3=A1bor On Wed, Jul 2, 2014 at 2:05 PM, M=C3=A1rton Balassi wrote: > To extend the functionality of Flink a separate branch of development was > dedicated for low latency, distributed stream processing support. The > development started during March of 2014 and is approaching a state where > it might be considered a candidate for becoming part of the main > repository. > > As of today a WordCount > < > https://github.com/stratosphere/stratosphere-streaming/blob/master/src/ma= in/java/eu/stratosphere/streaming/examples/wordcount/WordCountLocal.java#L3= 0-41 > > > example streaming program would fairly similar to the one that the batch > API provides: > > StreamExecutionEnvironment env =3D new StreamExecutionEnvironment(); > > DataStream> dataStream =3D > > env.readTextFile("src/test/resources/testdata/hamlet.txt") > .flatMap(new > WordCountSplitter()) > .partitionBy(0) > .map(new > WordCountCounter()); > > dataStream.print(); > > env.execute(); > > The user defined functions are extending the same classes as in the batch > case (e.g. a FlatMapFunction for a flatmap, see WordCountSplitter > < > https://github.com/stratosphere/stratosphere-streaming/blob/master/src/ma= in/java/eu/stratosphere/streaming/examples/wordcount/WordCountSplitter.java > >) > thus providing code interusability between the two approaches. > > As for performance the 0.1 version > > released in the beginning of June was slightly better on a single core th= en > Apache Storm, one of the major players of the field. Cluster performance > needs further optimization. This version provided a lower level API, fair= ly > similar to the one Storm has. For a deeper dive on this state of the > development and the challenges faced please refer to the slides > > > of a talk form the early days of June. > > The 0.2 release is coming soon with the the above demonstrated new API an= d > improved single core performance. To complete the release the cluster > performance is being measured, and the code is being decomposed into thre= e > subprojects separating core, example and addon functionality. > > As for the future fault tolerance is an unresolved issue and as a part of > the Google Summer of Code project an intern is working on iterative strea= m > processing. > > The project is mainly developed at Budapest by three members employed by > Hungarian Academy of Sciences and E=C3=B6tv=C3=B6s Lor=C3=A1nd University= and Frank Wu, > our Google Summer of Code student from Singapore. This summer the Hungari= an > Academy of Sciences also dedicated 4 interns to the project. > > The proposed 0.2 release is still dependant on the 0.5 release of > Stratosphere, however on branch snapshot-0.6 > > the > dependencies are updated to 0.6-snapshot, thus the codebase is ready for > becoming part of the main project - preferably a part of addons until it > becomes stable. > > Looking forward to your suggestions. > > Cheers, > > M=C3=A1rton, Gyula & G=C3=A1bor > --089e0118289a09f17d04fd624b17--