Return-Path: X-Original-To: apmail-flink-user-archive@minotaur.apache.org Delivered-To: apmail-flink-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 314BE17929 for ; Wed, 5 Nov 2014 14:28:55 +0000 (UTC) Received: (qmail 80471 invoked by uid 500); 5 Nov 2014 14:28:55 -0000 Delivered-To: apmail-flink-user-archive@flink.apache.org Received: (qmail 80433 invoked by uid 500); 5 Nov 2014 14:28:54 -0000 Mailing-List: contact user-help@flink.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@flink.incubator.apache.org Delivered-To: mailing list user@flink.incubator.apache.org Received: (qmail 80423 invoked by uid 99); 5 Nov 2014 14:28:54 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Nov 2014 14:28:54 +0000 X-ASF-Spam-Status: No, hits=-2.8 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_HI,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [192.134.164.83] (HELO mail2-relais-roc.national.inria.fr) (192.134.164.83) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Nov 2014 14:28:15 +0000 X-IronPort-AV: E=Sophos;i="5.07,320,1413237600"; d="scan'208,217";a="104963283" Received: from zmbs5.inria.fr ([128.93.142.18]) by mail2-relais-roc.national.inria.fr with ESMTP; 05 Nov 2014 15:26:13 +0100 Date: Wed, 5 Nov 2014 15:26:13 +0100 (CET) From: Anirvan BASU To: user@flink.incubator.apache.org Cc: Fabian Hueske Message-ID: <1472232068.16777675.1415197573763.JavaMail.zimbra@inria.fr> In-Reply-To: References: <1415115035.57403.ezmlm@flink.incubator.apache.org> Subject: Re: Looking for instructions & source for flink-java-examples-0.7.0-incubating-WebLogAnalysis.jar MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_16777674_2001909831.1415197573758" X-Originating-IP: [131.254.12.131] X-Mailer: Zimbra 8.0.6_GA_5922 (ZimbraWebClient - FF33 (Win)/8.0.6_GA_5922) Thread-Topic: Looking for instructions & source for flink-java-examples-0.7.0-incubating-WebLogAnalysis.jar Thread-Index: KFNYhovTf8I6P2wb2IDzE/CXvuTFaA== X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_16777674_2001909831.1415197573758 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hello Fabien and everyone,=20 In my previous post, I missed some of your questions from your last email.= =20 Here are my replies:=20 Have you checked the local file systems on all workers for output?=20 Yes, I did (in the case of using "file:///address/to/local/file" using NFS)= . They were the same empty files.=20 Did the job process any data at all? The jobs finishes within 1 second (whi= ch is still possible for very small input data).=20 The data that was used was provided to me by robert Metzger. Please see the= link here: https://github.com/rmetzger/scratch/tree/weblogdataexample/webl= og=20 Actually, the first lines in the "rank" file had some problem with the sepa= rators '|' It may be due to difference in coding between Linux machines ...= the programme would end up with some error always.=20 So I deleted the top few lines and then the programme finished with code FI= NISHED but empty files :-((=20 You can change the example program to write its output to the stdout by rep= lacing the writeAsCSV() by print(). The stdout of all workers is redirected= to the ./log/*.out files.=20 Question to you: What is the location of this stdtout ./log/* ? I could not= find it anywhere - neither in my local directories nor in the system root.= =20 Question to you: Is it possible to change the location of the stdout by cha= nging the conf file flink-conf.yaml ? Which exact parameter should I change= ?=20 Thanks in advance for all your help,=20 Anirvan=20 ----- Original Message ----- > From: "Fabian Hueske" > To: user@flink.incubator.apache.org > Sent: Tuesday, November 4, 2014 4:28:40 PM > Subject: Re: Looking for instructions & source for > flink-java-examples-0.7.0-incubating-WebLogAnalysis.jar > Hi Anirvan, > you specify input and output as files in the local file system (file:///)= . > Each worker needs access to the all input files, which means that each > worker needs (a copy of) these files in its local file system. > The common setup to use Flink in a distributed cluster is to use a > distributed data store such as HDFS (or a data store that can be accessed= by > each node). > Using a shared file system (like NFS) that is mounted into each worker wo= uld > work, but remember, that all nodes will concurrently read and write to th= e > shared system. > Have you checked the local file systems on all workers for output? > Did the job process any data at all? The jobs finishes within 1 second (w= hich > is still possible for very small input data). > You can change the example program to write its output to the stdout by > replacing the writeAsCSV() by print(). The stdout of all workers is > redirected to the ./log/*.out files. > Best, Fabian > 2014-11-04 16:08 GMT+01:00 Anirvan BASU < anirvan.basu@inria.fr > : > > Hello Robert, Stephan et al, >=20 > > Hope you are doing fine in Berlin. >=20 > > I am getting back to you on my previous problem on the WebLogAnalysis > > example, after a long time. >=20 > > We are currently using Flink 0.7.0 over a 10-node cluster in Manager-Wo= rker > > configuration. >=20 > > We ran the following command: >=20 > > $ ./flink/bin/flink run > > flink/examples/flink-java-examples-0.7.0-incubating-WebLogAnalysis.jar > > file:///home/abasu/examples/Weblogs/documents > > file:///home/abasu/examples/Weblogs/ranks > > file:///home/abasu/examples/Weblogs/visits > > file:///home/abasu/examples/Weblogs/result >=20 > > For the documents, rank and visits files, we used the data generated by= you > > from this link: >=20 > > https://github.com/rmetzger/scratch/tree/weblogdataexample/weblog >=20 > > The program executed with the following output: >=20 > > 11/04/2014 14:58:12: Job execution switched to status RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (1/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (1/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (2/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (2/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (3/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (3/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (4/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (4/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (5/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (5/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (6/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (6/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (7/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (7/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (8/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (8/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (9/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (9/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (1/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (1/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (2/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (2/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (3/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (3/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (4/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (4/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (5/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (5/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (6/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (6/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (7/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (7/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (8/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (8/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (9/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (9/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (1/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (1/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (2/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (2/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (3/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (3/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (4/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (4/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (5/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (5/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (6/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (1/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (6/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (7/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (7/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (8/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (8/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (9/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (9/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (2/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (5/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (6/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (7/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (8/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (2/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (9/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (5/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (6/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (8/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (7/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (3/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (9/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (1/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (1/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (2/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (2/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (8/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (2/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (3/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (2/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (3/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (3/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (1/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (1/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (1/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (3/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (1/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (4/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (1/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (4/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (5/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (6/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (4/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (7/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (9/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (3/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (1/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (4/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (4/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (6/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (6/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (5/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (5/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (6/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (5/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (2/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (2/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (4/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (2/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (7/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (7/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (8/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (8/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (7/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (3/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (3/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (8/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (9/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (9/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (3/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (9/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (4/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (4/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (4/9) switched to RUNNING >=20 > > 11/04/2014 14:58:12: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (5/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:12: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (5/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:12: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (5/9) switched to RUNNING >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (6/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (6/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (6/9) switched to RUNNING >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (7/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (7/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (7/9) switched to RUNNING >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (8/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (8/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (8/9) switched to RUNNING >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (9/9) switched to SCHEDULED >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (9/9) switched to DEPLOYING >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (9/9) switched to RUNNING >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (7/9) switched= to > > SCHEDULED >=20 > > 11/04/2014 14:58:13: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (7/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (7/9) switched= to > > DEPLOYING >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (8/9) switched= to > > SCHEDULED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (8/9) switched= to > > DEPLOYING >=20 > > 11/04/2014 14:58:13: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (8/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (5/9) switched= to > > SCHEDULED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (5/9) switched= to > > DEPLOYING >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (6/9) switched= to > > SCHEDULED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (6/9) switched= to > > DEPLOYING >=20 > > 11/04/2014 14:58:13: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (5/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (4/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (6/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (4/9) switched= to > > SCHEDULED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (7/9) switched= to > > RUNNING >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (4/9) switched= to > > DEPLOYING >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (3/9) switched= to > > SCHEDULED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (3/9) switched= to > > DEPLOYING >=20 > > 11/04/2014 14:58:13: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (3/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (2/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (2/9) switched= to > > SCHEDULED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (2/9) switched= to > > DEPLOYING >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (1/9) switched= to > > SCHEDULED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (1/9) switched= to > > DEPLOYING >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (5/9) switched= to > > RUNNING >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (4/9) switched= to > > RUNNING >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (3/9) switched= to > > RUNNING >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (2/9) switched= to > > RUNNING >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (8/9) switched= to > > RUNNING >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (6/9) switched= to > > RUNNING >=20 > > 11/04/2014 14:58:13: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (1/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (1/9) switched= to > > RUNNING >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (4/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (2/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (5/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (9/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (4/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (8/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (6/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (2/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (5/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (7/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (9/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (6/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (7/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (8/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: > > Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFu= nction) > > (9/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (9/9) switched= to > > SCHEDULED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (9/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (9/9) switched= to > > DEPLOYING >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (4/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (2/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (7/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (6/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (5/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (8/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (1/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/documents) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) > > -> Map (Projection [0]) (3/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (1/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (1/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/visits) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsB= yDate) > > -> Map (Projection [0]) (3/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CHAIN DataSource (CSV Input (|) > > file:/home/abasu/examples/Weblogs/ranks) -> Filter > > (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) > > (3/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (9/9) switched= to > > RUNNING >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (7/9) switched= to > > FINISHED >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (7/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (8/9) switched= to > > FINISHED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (5/9) switched= to > > FINISHED >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (8/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (6/9) switched= to > > FINISHED >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (5/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (6/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (4/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (4/9) switched= to > > FINISHED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (1/9) switched= to > > FINISHED >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (1/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (3/9) switched= to > > FINISHED >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (3/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (2/9) switched= to > > FINISHED >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (2/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: DataSink(CsvOutputFormat (path: > > file:/home/abasu/examples/Weblogs/result, delimiter: |)) (9/9) switched= to > > FINISHED >=20 > > 11/04/2014 14:58:13: CoGroup > > (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisit= s) > > (9/9) switched to FINISHED >=20 > > 11/04/2014 14:58:13: Job execution switched to status FINISHED >=20 > > The following directory was created: >=20 > > /home/abasu/examples/Weblogs/result >=20 > > with 9 files (named 1 to 9) >=20 > > All these files are empty! >=20 > > Hence my naive question: Is this the expected output ? Or what should b= e > > the > > expected output for an error-free run ? >=20 > > Please let me know where we are going wrong? >=20 > > If possible do you have other data generated to try the WebLogAnalysis > > example ? >=20 > > Thanks in advance for your advice and help, >=20 > > Anirvan >=20 > > > > > > > > > > > Le 23/09/2014 17:22, rmetzger0 [via Apache Flink > > > > > > > > > > > (Incubator) > > > > > > > > > > > User > > > > > > > > > > > Mailing > > > > > > > > > > > List archive.] a =E9crit : > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > Hi, > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > you have to use the "WebLogDataGenerator" found her= e: > > > > > > > > > > > > https://github.com/apache/incubator-flink/blob/mast= er/flink-examples/flink-java-examples/src/main/java/org/apache/flink/exampl= es/java/relational/util/WebLogDataGenerator.java > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > It accepts two arguments, the number of documents a= nd > > > > > > > > > > > > visits. > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > The generated files are located in /tmp/documents > > > > > > > > > > > > /tmp/ranks > > > > > > > > > > > > and > > > > > > > > > > > > /tmp/visits. > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > I've generated some sample data for you, located he= re: > > > > > > > > > > > > https://github.com/rmetzger/scratch/tree/weblogdata= example/weblog > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > Best, > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > Robert > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > On Tue, Sep 23, 2014 at 4:05 PM, nirvanesque [via > > > > > > > > > > > > Apache > > > > > > > > > > > > Flink > > > > > > > > > > > > (Incubator) > > > > > > > > > > > > User Mailing List archive.] < [hidden email] > wrot= e: > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > Hello Robert, > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > Thanks as usual for all your help with the > > > > > > > > > > > > > information. > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > I'm trying in vain to create the different input > > > > > > > > > > > > > files > > > > > > > > > > > > > from > > > > > > > > > > > > > the > > > > > > > > > > > > > program > > > > > > > > > > > > > source code but running into difficulties. > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > Could you (or anyone else) please post here sampl= es > > > > > > > > > > > > > of > > > > > > > > > > > > > the > > > > > > > > > > > > > 4 > > > > > > > > > > > > > inputs > > > > > > > > > > > > > that > > > > > > > > > > > > > are > > > > > > > > > > > > > required to run this program ? > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > Thanks in advance, > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > Anirvan > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > Le 09/09/2014 23:54, rmetzger0 [via Apache Flink > > > > > > > > > > > > > (Incubator) > > > > > > > > > > > > > User > > > > > > > > > > > > > Mailing > > > > > > > > > > > > > List archive.] a =E9crit : > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > > Hi Anirvan, > > > > > > > > > > > > >=20 > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > > sorry for the late response. You've posted the > > > > > > > > > > > > > > question > > > > > > > > > > > > > > to > > > > > > > > > > > > > > Nabble, > > > > > > > > > > > > > > which > > > > > > > > > > > > > > is > > > > > > > > > > > > > > only a mirror of our actual mailing list at [hi= dden > > > > > > > > > > > > > > email] > > > > > > > > > > > > > > . > > > > > > > > > > > > > > Sadly, > > > > > > > > > > > > > > the > > > > > > > > > > > > > > message is not automatically posted to the apac= he > > > > > > > > > > > > > > list > > > > > > > > > > > > > > because > > > > > > > > > > > > > > the > > > > > > > > > > > > > > apache > > > > > > > > > > > > > > server is rejecting the mails from nabble. > > > > > > > > > > > > >=20 > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > > I've already asked and there is no way to chang= e > > > > > > > > > > > > > > this > > > > > > > > > > > > > > behavior. > > > > > > > > > > > > >=20 > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > > So I actually saw the two messages you posted h= ere > > > > > > > > > > > > > > by > > > > > > > > > > > > > > accident. > > > > > > > > > > > > >=20 > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > > Regarding your actual question: > > > > > > > > > > > > >=20 > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > > - The command line arguments for the WebLogAnal= ysis > > > > > > > > > > > > > > example > > > > > > > > > > > > > > are: > > > > > > > > > > > > >=20 > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > > "WebLogAnalysis > > > > > > > > > > > > > > > > > > > > > > > > > > > path> > > > > > > > > > > > > > > > > > > > > > > > > > > > path>" > > > > > > > > > > > > >=20 > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > > - Regarding the "info -d" command. I think its = an > > > > > > > > > > > > > > artifact > > > > > > > > > > > > > > from > > > > > > > > > > > > > > our > > > > > > > > > > > > > > old > > > > > > > > > > > > > > java > > > > > > > > > > > > > > API. I've filed an issue in JIRA: > > > > > > > > > > > > > > https://issues.apache.org/jira/browse/FLINK-109= 5 > > > > > > > > > > > > > > Lets > > > > > > > > > > > > > > see > > > > > > > > > > > > > > how > > > > > > > > > > > > > > we > > > > > > > > > > > > > > resolve > > > > > > > > > > > > > > it. > > > > > > > > > > > > > > Thanks for reporting this! > > > > > > > > > > > > >=20 > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > > You can find the source code of all of our exam= ples > > > > > > > > > > > > > > in > > > > > > > > > > > > > > the > > > > > > > > > > > > > > source > > > > > > > > > > > > > > release > > > > > > > > > > > > > > of > > > > > > > > > > > > > > Flink (in the flink-examples/flink-java-example= s > > > > > > > > > > > > > > project. > > > > > > > > > > > > > > You > > > > > > > > > > > > > > can > > > > > > > > > > > > > > also > > > > > > > > > > > > > > access the source (and hence the examples) thro= ugh > > > > > > > > > > > > > > GitHub: > > > > > > > > > > > > > > https://github.com/apache/incubator-flink/blob/= master/flink-examples/flink-java-examples/src/main/java/org/apache/flink/ex= ample/java/relational/WebLogAnalysis.java > > > > > > > > > > > > > > . > > > > > > > > > > > > >=20 > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > > To build the examples, you can run: "mvn clean > > > > > > > > > > > > > > package > > > > > > > > > > > > > > -DskipTests" > > > > > > > > > > > > > > in > > > > > > > > > > > > > > the > > > > > > > > > > > > > > "flink-examples/flink-java-examples" directory. > > > > > > > > > > > > > > This > > > > > > > > > > > > > > will > > > > > > > > > > > > > > re-build > > > > > > > > > > > > > > them. > > > > > > > > > > > > >=20 > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > > If you don't want to import the whole Flink pro= ject > > > > > > > > > > > > > > just > > > > > > > > > > > > > > for > > > > > > > > > > > > > > playing > > > > > > > > > > > > > > around > > > > > > > > > > > > > > with the examples, you can also create an empty > > > > > > > > > > > > > > maven > > > > > > > > > > > > > > project. > > > > > > > > > > > > > > This > > > > > > > > > > > > > > script: > > > > > > > > > > > > >=20 > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > > curl > > > > > > > > > > > > > > https://raw.githubusercontent.com/apache/incuba= tor-flink/master/flink-quickstart/quickstart.sh > > > > > > > > > > > > > > | bash > > > > > > > > > > > > >=20 > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > > will automatically set everything up for you. J= ust > > > > > > > > > > > > > > import > > > > > > > > > > > > > > the > > > > > > > > > > > > > > "quickstart" > > > > > > > > > > > > > > project into Eclipse or IntelliJ. It will downl= oad > > > > > > > > > > > > > > all > > > > > > > > > > > > > > dependencies > > > > > > > > > > > > > > and > > > > > > > > > > > > > > package everything correctly. If you want to us= e an > > > > > > > > > > > > > > example > > > > > > > > > > > > > > there, > > > > > > > > > > > > > > just > > > > > > > > > > > > > > copy > > > > > > > > > > > > > > the Java file into the "quickstart" project. > > > > > > > > > > > > >=20 > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > > The examples are indeed a very good way to lear= n > > > > > > > > > > > > > > how > > > > > > > > > > > > > > to > > > > > > > > > > > > > > write > > > > > > > > > > > > > > Flink > > > > > > > > > > > > > > jobs. > > > > > > > > > > > > >=20 > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > > Please continue asking if you have further > > > > > > > > > > > > > > questions! > > > > > > > > > > > > >=20 > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > > Best, > > > > > > > > > > > > >=20 > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 >=20 > > > > > > > > > > > > > > Robert > > > > > > > > > > > > >=20 > > > > > > > > > > > >=20 > > > > > > > > > > >=20 > > > > > > > > > >=20 > > > > > > > > >=20 > > > > > > > >=20 > > > > > > >=20 > > > > > >=20 > > > > >=20 > > > >=20 > > >=20 > >=20 > ------=_Part_16777674_2001909831.1415197573758 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Hello Fabien and everyone,
<= br>
In my previous post, I missed some of your questions from you= r last email.

Here are my replies:
Have you checked the local file systems on all workers for output?<= /div>
Yes, I did (in the case of using "file:///address/to/local/file" = using NFS). They were the same empty files.

Di= d the job process any data at all? The jobs finishes within 1 second (which= is still possible for very small input data).
The data that was = used was provided to me by robert Metzger. Please see the link here: https://github.com/rmetzger/scratch/tree/weblogdata= example/weblog
Actually, the first lines in the "rank" fi= le had some problem with the separators '|' It may be due to difference in = coding between Linux machines ... the programme would end up with some erro= r always.
So I deleted the top few lines and then the program= me finished with code FINISHED but empty files :-((

You can change the example program to write its output to the stdout = by replacing the writeAsCSV() by print(). The stdout of all workers is redi= rected to the ./log/*.out files.
Question to you: What is the loc= ation of this stdtout ./log/*  ? I could not find it anywhere - neithe= r in my local directories nor in the system root.
Question to= you: Is it possible to change the location of the stdout by changing the c= onf file flink-conf.yaml ? Which exact parameter should I change ?

Thanks in advance for all your help,
Ani= rvan



From: "Fabian Huesk= e" <fhueske@apache.org>
To: user@flink.incubator.apache.org=
Sent: Tuesday, November 4, 2014 4:28:40 PM
Subject: Re= : Looking for instructions & source for flink-java-examples-0.7.0-incub= ating-WebLogAnalysis.jar

Hi Anirvan= ,

you specify input and output as files in th= e local file system (file:///). Each worker needs access to the all input f= iles, which means that each worker needs (a copy of) these files in its loc= al file system.
The common setup to use Flink in a distributed cl= uster is to use a distributed data store such as HDFS (or a data store that= can be accessed by each node).
Using a shared file system (like = NFS) that is mounted into each worker would work, but remember, that all no= des will concurrently read and write to the shared system.

Have you checked the local file systems on all workers for output?=
Did the job process any data at all? The jobs finishes within 1 = second (which is still possible for very small input data).

<= /div>
You can change the example program to write its output to the std= out by replacing the writeAsCSV() by print(). The stdout of all workers is = redirected to the ./log/*.out files.

Best, Fabian<= /div>

2014-1= 1-04 16:08 GMT+01:00 Anirvan BASU <anirvan.basu@inria.fr>:
Hello Rob= ert, Stephan et al,

Hope you are doing fine in= Berlin.

I am getting back to you on my previo= us problem on the WebLogAnalysis example, after a long time.
=
We are currently using Flink 0.7.0 over a 10-node cluster in= Manager-Worker configuration.

We ran the foll= owing command:
$ ./flink/bin/flink run flink/examples/flink-j= ava-examples-0.7.0-incubating-WebLogAnalysis.jar file:///home/abasu/example= s/Weblogs/documents file:///home/abasu/examples/Weblogs/ranks file:///home/= abasu/examples/Weblogs/visits file:///home/abasu/examples/Weblogs/result=

For the documents, r= ank and visits files, we used the data generated by you from this link:
=

=
The program executed with the following output:
11= /04/2014 14:58:12:    Job execution switched to status RUNNI= NG
11/04/2014 14:58:12:    CHAIN DataSource (CSV Input (|= ) file:/home/abasu/examples/Weblogs/visits) -> Filter (org.apache.flink.= examples.java.relational.WebLogAnalysis$FilterVisitsByDate) -> Map (Proj= ection [0]) (1/9) switched to SCHEDULED
11/04/2014 14:58:12:  =   CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/vi= sits) -> Filter (org.apache.flink.examples.java.relational.WebLogAnalysi= s$FilterVisitsByDate) -> Map (Projection [0]) (1/9) switched to DEPLOYIN= G
11/04/2014 14:58:12:    CHAIN DataSource (CSV Input (|)= file:/home/abasu/examples/Weblogs/visits) -> Filter (org.apache.flink.e= xamples.java.relational.WebLogAnalysis$FilterVisitsByDate) -> Map (Proje= ction [0]) (2/9) switched to SCHEDULED
11/04/2014 14:58:12:  &= nbsp; CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/vis= its) -> Filter (org.apache.flink.examples.java.relational.WebLogAnalysis= $FilterVisitsByDate) -> Map (Projection [0]) (2/9) switched to DEPLOYING=
11/04/2014 14:58:12:    CHAIN DataSource (CSV Input (|) = file:/home/abasu/examples/Weblogs/visits) -> Filter (org.apache.flink.ex= amples.java.relational.WebLogAnalysis$FilterVisitsByDate) -> Map (Projec= tion [0]) (3/9) switched to SCHEDULED
11/04/2014 14:58:12:  &n= bsp; CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/visi= ts) -> Filter (org.apache.flink.examples.java.relational.WebLogAnalysis$= FilterVisitsByDate) -> Map (Projection [0]) (3/9) switched to DEPLOYING<= br>11/04/2014 14:58:12:    CHAIN DataSource (CSV Input (|) f= ile:/home/abasu/examples/Weblogs/visits) -> Filter (org.apache.flink.exa= mples.java.relational.WebLogAnalysis$FilterVisitsByDate) -> Map (Project= ion [0]) (4/9) switched to SCHEDULED
11/04/2014 14:58:12:  &nb= sp; CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/visit= s) -> Filter (org.apache.flink.examples.java.relational.WebLogAnalysis$F= ilterVisitsByDate) -> Map (Projection [0]) (4/9) switched to DEPLOYING11/04/2014 14:58:12:    CHAIN DataSource (CSV Input (|) fi= le:/home/abasu/examples/Weblogs/visits) -> Filter (org.apache.flink.exam= ples.java.relational.WebLogAnalysis$FilterVisitsByDate) -> Map (Projecti= on [0]) (5/9) switched to SCHEDULED
11/04/2014 14:58:12:  &nbs= p; CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/visits= ) -> Filter (org.apache.flink.examples.java.relational.WebLogAnalysis$Fi= lterVisitsByDate) -> Map (Projection [0]) (5/9) switched to DEPLOYING11/04/2014 14:58:12:    CHAIN DataSource (CSV Input (|) fil= e:/home/abasu/examples/Weblogs/visits) -> Filter (org.apache.flink.examp= les.java.relational.WebLogAnalysis$FilterVisitsByDate) -> Map (Projectio= n [0]) (6/9) switched to SCHEDULED
11/04/2014 14:58:12:   = ; CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/visits)= -> Filter (org.apache.flink.examples.java.relational.WebLogAnalysis$Fil= terVisitsByDate) -> Map (Projection [0]) (6/9) switched to DEPLOYING
= 11/04/2014 14:58:12:    CHAIN DataSource (CSV Input (|) file= :/home/abasu/examples/Weblogs/visits) -> Filter (org.apache.flink.exampl= es.java.relational.WebLogAnalysis$FilterVisitsByDate) -> Map (Projection= [0]) (7/9) switched to SCHEDULED
11/04/2014 14:58:12:   = CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/visits) = -> Filter (org.apache.flink.examples.java.relational.WebLogAnalysis$Filt= erVisitsByDate) -> Map (Projection [0]) (7/9) switched to DEPLOYING
1= 1/04/2014 14:58:12:    CHAIN DataSource (CSV Input (|) file:= /home/abasu/examples/Weblogs/visits) -> Filter (org.apache.flink.example= s.java.relational.WebLogAnalysis$FilterVisitsByDate) -> Map (Projection = [0]) (8/9) switched to SCHEDULED
11/04/2014 14:58:12:    = CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/visits) -= > Filter (org.apache.flink.examples.java.relational.WebLogAnalysis$Filte= rVisitsByDate) -> Map (Projection [0]) (8/9) switched to DEPLOYING
11= /04/2014 14:58:12:    CHAIN DataSource (CSV Input (|) file:/= home/abasu/examples/Weblogs/visits) -> Filter (org.apache.flink.examples= .java.relational.WebLogAnalysis$FilterVisitsByDate) -> Map (Projection [= 0]) (9/9) switched to SCHEDULED
11/04/2014 14:58:12:    C= HAIN DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/visits) -&= gt; Filter (org.apache.flink.examples.java.relational.WebLogAnalysis$Filter= VisitsByDate) -> Map (Projection [0]) (9/9) switched to DEPLOYING
11/= 04/2014 14:58:12:    CHAIN DataSource (CSV Input (|) file:/h= ome/abasu/examples/Weblogs/documents) -> Filter (org.apache.flink.exampl= es.java.relational.WebLogAnalysis$FilterDocByKeyWords) -> Map (Projectio= n [0]) (1/9) switched to SCHEDULED
11/04/2014 14:58:12:   = ; CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/documen= ts) -> Filter (org.apache.flink.examples.java.relational.WebLogAnalysis$= FilterDocByKeyWords) -> Map (Projection [0]) (1/9) switched to DEPLOYING=
11/04/2014 14:58:12:    CHAIN DataSource (CSV Input (|) = file:/home/abasu/examples/Weblogs/documents) -> Filter (org.apache.flink= .examples.java.relational.WebLogAnalysis$FilterDocByKeyWords) -> Map (Pr= ojection [0]) (2/9) switched to SCHEDULED
11/04/2014 14:58:12: &nbs= p;  CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/= documents) -> Filter (org.apache.flink.examples.java.relational.WebLogAn= alysis$FilterDocByKeyWords) -> Map (Projection [0]) (2/9) switched to DE= PLOYING
11/04/2014 14:58:12:    CHAIN DataSource (CSV Inp= ut (|) file:/home/abasu/examples/Weblogs/documents) -> Filter (org.apach= e.flink.examples.java.relational.WebLogAnalysis$FilterDocByKeyWords) -> = Map (Projection [0]) (3/9) switched to SCHEDULED
11/04/2014 14:58:12:&nb= sp;   CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/W= eblogs/documents) -> Filter (org.apache.flink.examples.java.relational.W= ebLogAnalysis$FilterDocByKeyWords) -> Map (Projection [0]) (3/9) switche= d to DEPLOYING
11/04/2014 14:58:12:    CHAIN DataSource (= CSV Input (|) file:/home/abasu/examples/Weblogs/documents) -> Filter (or= g.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKeyWords)= -> Map (Projection [0]) (4/9) switched to SCHEDULED
11/04/2014 14:58= :12:    CHAIN DataSource (CSV Input (|) file:/home/abasu/exa= mples/Weblogs/documents) -> Filter (org.apache.flink.examples.java.relat= ional.WebLogAnalysis$FilterDocByKeyWords) -> Map (Projection [0]) (4/9) = switched to DEPLOYING
11/04/2014 14:58:12:    CHAIN DataS= ource (CSV Input (|) file:/home/abasu/examples/Weblogs/documents) -> Fil= ter (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKe= yWords) -> Map (Projection [0]) (5/9) switched to SCHEDULED
11/04/201= 4 14:58:12:    CHAIN DataSource (CSV Input (|) file:/home/ab= asu/examples/Weblogs/documents) -> Filter (org.apache.flink.examples.jav= a.relational.WebLogAnalysis$FilterDocByKeyWords) -> Map (Projection [0])= (5/9) switched to DEPLOYING
11/04/2014 14:58:12:    CHAI= N DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/documents) -&= gt; Filter (org.apache.flink.examples.java.relational.WebLogAnalysis$Filter= DocByKeyWords) -> Map (Projection [0]) (6/9) switched to SCHEDULED
11= /04/2014 14:58:12:    CHAIN DataSource (CSV Input (|) file:/= home/abasu/examples/Weblogs/documents) -> Filter (org.apache.flink.examp= les.java.relational.WebLogAnalysis$FilterDocByKeyWords) -> Map (Projecti= on [0]) (6/9) switched to DEPLOYING
11/04/2014 14:58:12:  &nbs= p; CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/docume= nts) -> Filter (org.apache.flink.examples.java.relational.WebLogAnalysis= $FilterDocByKeyWords) -> Map (Projection [0]) (7/9) switched to SCHEDULE= D
11/04/2014 14:58:12:    CHAIN DataSource (CSV Input (|)= file:/home/abasu/examples/Weblogs/documents) -> Filter (org.apache.flin= k.examples.java.relational.WebLogAnalysis$FilterDocByKeyWords) -> Map (P= rojection [0]) (7/9) switched to DEPLOYING
11/04/2014 14:58:12: &nb= sp;  CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs= /documents) -> Filter (org.apache.flink.examples.java.relational.WebLogA= nalysis$FilterDocByKeyWords) -> Map (Projection [0]) (8/9) switched to S= CHEDULED
11/04/2014 14:58:12:    CHAIN DataSource (CSV In= put (|) file:/home/abasu/examples/Weblogs/documents) -> Filter (org.apac= he.flink.examples.java.relational.WebLogAnalysis$FilterDocByKeyWords) ->= Map (Projection [0]) (8/9) switched to DEPLOYING
11/04/2014 14:58:12:&n= bsp;   CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/= Weblogs/documents) -> Filter (org.apache.flink.examples.java.relational.= WebLogAnalysis$FilterDocByKeyWords) -> Map (Projection [0]) (9/9) switch= ed to SCHEDULED
11/04/2014 14:58:12:    CHAIN DataSource = (CSV Input (|) file:/home/abasu/examples/Weblogs/documents) -> Filter (o= rg.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKeyWords= ) -> Map (Projection [0]) (9/9) switched to DEPLOYING
11/04/2014 14:5= 8:12:    CHAIN DataSource (CSV Input (|) file:/home/abasu/ex= amples/Weblogs/ranks) -> Filter (org.apache.flink.examples.java.relation= al.WebLogAnalysis$FilterByRank) (1/9) switched to SCHEDULED
11/04/2014 1= 4:58:12:    CHAIN DataSource (CSV Input (|) file:/home/abasu= /examples/Weblogs/ranks) -> Filter (org.apache.flink.examples.java.relat= ional.WebLogAnalysis$FilterByRank) (1/9) switched to DEPLOYING
11/04/201= 4 14:58:12:    CHAIN DataSource (CSV Input (|) file:/home/ab= asu/examples/Weblogs/ranks) -> Filter (org.apache.flink.examples.java.re= lational.WebLogAnalysis$FilterByRank) (2/9) switched to SCHEDULED
11/04/= 2014 14:58:12:    CHAIN DataSource (CSV Input (|) file:/home= /abasu/examples/Weblogs/ranks) -> Filter (org.apache.flink.examples.java= .relational.WebLogAnalysis$FilterByRank) (2/9) switched to DEPLOYING
11/= 04/2014 14:58:12:    CHAIN DataSource (CSV Input (|) file:/h= ome/abasu/examples/Weblogs/ranks) -> Filter (org.apache.flink.examples.j= ava.relational.WebLogAnalysis$FilterByRank) (3/9) switched to SCHEDULED
= 11/04/2014 14:58:12:    CHAIN DataSource (CSV Input (|) file= :/home/abasu/examples/Weblogs/ranks) -> Filter (org.apache.flink.example= s.java.relational.WebLogAnalysis$FilterByRank) (3/9) switched to DEPLOYING<= br>11/04/2014 14:58:12:    CHAIN DataSource (CSV Input (|) f= ile:/home/abasu/examples/Weblogs/ranks) -> Filter (org.apache.flink.exam= ples.java.relational.WebLogAnalysis$FilterByRank) (4/9) switched to SCHEDUL= ED
11/04/2014 14:58:12:    CHAIN DataSource (CSV Input (|= ) file:/home/abasu/examples/Weblogs/ranks) -> Filter (org.apache.flink.e= xamples.java.relational.WebLogAnalysis$FilterByRank) (4/9) switched to DEPL= OYING
11/04/2014 14:58:12:    CHAIN DataSource (CSV Input= (|) file:/home/abasu/examples/Weblogs/ranks) -> Filter (org.apache.flin= k.examples.java.relational.WebLogAnalysis$FilterByRank) (5/9) switched to S= CHEDULED
11/04/2014 14:58:12:    CHAIN DataSource (CSV In= put (|) file:/home/abasu/examples/Weblogs/ranks) -> Filter (org.apache.f= link.examples.java.relational.WebLogAnalysis$FilterByRank) (5/9) switched t= o DEPLOYING
11/04/2014 14:58:12:    CHAIN DataSource (CSV= Input (|) file:/home/abasu/examples/Weblogs/ranks) -> Filter (org.apach= e.flink.examples.java.relational.WebLogAnalysis$FilterByRank) (6/9) switche= d to SCHEDULED
11/04/2014 14:58:12:    CHAIN DataSource (= CSV Input (|) file:/home/abasu/examples/Weblogs/visits) -> Filter (org.a= pache.flink.examples.java.relational.WebLogAnalysis$FilterVisitsByDate) -&g= t; Map (Projection [0]) (1/9) switched to RUNNING
11/04/2014 14:58:12:&n= bsp;   CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/= Weblogs/ranks) -> Filter (org.apache.flink.examples.java.relational.WebL= ogAnalysis$FilterByRank) (6/9) switched to DEPLOYING
11/04/2014 14:58:12= :    CHAIN DataSource (CSV Input (|) file:/home/abasu/exampl= es/Weblogs/ranks) -> Filter (org.apache.flink.examples.java.relational.W= ebLogAnalysis$FilterByRank) (7/9) switched to SCHEDULED
11/04/2014 14:58= :12:    CHAIN DataSource (CSV Input (|) file:/home/abasu/exa= mples/Weblogs/ranks) -> Filter (org.apache.flink.examples.java.relationa= l.WebLogAnalysis$FilterByRank) (7/9) switched to DEPLOYING
11/04/2014 14= :58:12:    CHAIN DataSource (CSV Input (|) file:/home/abasu/= examples/Weblogs/ranks) -> Filter (org.apache.flink.examples.java.relati= onal.WebLogAnalysis$FilterByRank) (8/9) switched to SCHEDULED
11/04/2014= 14:58:12:    CHAIN DataSource (CSV Input (|) file:/home/aba= su/examples/Weblogs/ranks) -> Filter (org.apache.flink.examples.java.rel= ational.WebLogAnalysis$FilterByRank) (8/9) switched to DEPLOYING
11/04/2= 014 14:58:12:    CHAIN DataSource (CSV Input (|) file:/home/= abasu/examples/Weblogs/ranks) -> Filter (org.apache.flink.examples.java.= relational.WebLogAnalysis$FilterByRank) (9/9) switched to SCHEDULED
11/0= 4/2014 14:58:12:    CHAIN DataSource (CSV Input (|) file:/ho= me/abasu/examples/Weblogs/ranks) -> Filter (org.apache.flink.examples.ja= va.relational.WebLogAnalysis$FilterByRank) (9/9) switched to DEPLOYING
1= 1/04/2014 14:58:12:    CHAIN DataSource (CSV Input (|) file:= /home/abasu/examples/Weblogs/visits) -> Filter (org.apache.flink.example= s.java.relational.WebLogAnalysis$FilterVisitsByDate) -> Map (Projection = [0]) (2/9) switched to RUNNING
11/04/2014 14:58:12:    CH= AIN DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/visits) -&g= t; Filter (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterV= isitsByDate) -> Map (Projection [0]) (5/9) switched to RUNNING
11/04/= 2014 14:58:12:    CHAIN DataSource (CSV Input (|) file:/home= /abasu/examples/Weblogs/visits) -> Filter (org.apache.flink.examples.jav= a.relational.WebLogAnalysis$FilterVisitsByDate) -> Map (Projection [0]) = (6/9) switched to RUNNING
11/04/2014 14:58:12:    CHAIN D= ataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/visits) -> Fi= lter (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisits= ByDate) -> Map (Projection [0]) (7/9) switched to RUNNING
11/04/2014 = 14:58:12:    CHAIN DataSource (CSV Input (|) file:/home/abas= u/examples/Weblogs/visits) -> Filter (org.apache.flink.examples.java.rel= ational.WebLogAnalysis$FilterVisitsByDate) -> Map (Projection [0]) (8/9)= switched to RUNNING
11/04/2014 14:58:12:    CHAIN DataSo= urce (CSV Input (|) file:/home/abasu/examples/Weblogs/documents) -> Filt= er (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKey= Words) -> Map (Projection [0]) (2/9) switched to RUNNING
11/04/2014 1= 4:58:12:    CHAIN DataSource (CSV Input (|) file:/home/abasu= /examples/Weblogs/visits) -> Filter (org.apache.flink.examples.java.rela= tional.WebLogAnalysis$FilterVisitsByDate) -> Map (Projection [0]) (9/9) = switched to RUNNING
11/04/2014 14:58:12:    CHAIN DataSou= rce (CSV Input (|) file:/home/abasu/examples/Weblogs/documents) -> Filte= r (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByKeyW= ords) -> Map (Projection [0]) (5/9) switched to RUNNING
11/04/2014 14= :58:12:    CHAIN DataSource (CSV Input (|) file:/home/abasu/= examples/Weblogs/documents) -> Filter (org.apache.flink.examples.java.re= lational.WebLogAnalysis$FilterDocByKeyWords) -> Map (Projection [0]) (6/= 9) switched to RUNNING
11/04/2014 14:58:12:    CHAIN Data= Source (CSV Input (|) file:/home/abasu/examples/Weblogs/documents) -> Fi= lter (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByK= eyWords) -> Map (Projection [0]) (8/9) switched to RUNNING
11/04/2014= 14:58:12:    CHAIN DataSource (CSV Input (|) file:/home/aba= su/examples/Weblogs/documents) -> Filter (org.apache.flink.examples.java= .relational.WebLogAnalysis$FilterDocByKeyWords) -> Map (Projection [0]) = (7/9) switched to RUNNING
11/04/2014 14:58:12:    CHAIN D= ataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/visits) -> Fi= lter (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisits= ByDate) -> Map (Projection [0]) (3/9) switched to RUNNING
11/04/2014 = 14:58:12:    CHAIN DataSource (CSV Input (|) file:/home/abas= u/examples/Weblogs/documents) -> Filter (org.apache.flink.examples.java.= relational.WebLogAnalysis$FilterDocByKeyWords) -> Map (Projection [0]) (= 9/9) switched to RUNNING
11/04/2014 14:58:12:    Join(org= .apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (1/9= ) switched to SCHEDULED
11/04/2014 14:58:12:    Join(org.= apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (1/9)= switched to DEPLOYING
11/04/2014 14:58:12:    Join(org.a= pache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (2/9) = switched to SCHEDULED
11/04/2014 14:58:12:    Join(org.ap= ache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (2/9) s= witched to DEPLOYING
11/04/2014 14:58:12:    CHAIN DataSo= urce (CSV Input (|) file:/home/abasu/examples/Weblogs/ranks) -> Filter (= org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) (8/9= ) switched to RUNNING
11/04/2014 14:58:12:    CHAIN DataS= ource (CSV Input (|) file:/home/abasu/examples/Weblogs/ranks) -> Filter = (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) (2/= 9) switched to RUNNING
11/04/2014 14:58:12:    CHAIN Data= Source (CSV Input (|) file:/home/abasu/examples/Weblogs/documents) -> Fi= lter (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByK= eyWords) -> Map (Projection [0]) (3/9) switched to RUNNING
11/04/2014= 14:58:12:    Join(org.apache.flink.api.java.operators.JoinO= perator$ProjectFlatJoinFunction) (2/9) switched to RUNNING
11/04/2014 14= :58:12:    Join(org.apache.flink.api.java.operators.JoinOper= ator$ProjectFlatJoinFunction) (3/9) switched to SCHEDULED
11/04/2014 14:= 58:12:    Join(org.apache.flink.api.java.operators.JoinOpera= tor$ProjectFlatJoinFunction) (3/9) switched to DEPLOYING
11/04/2014 14:5= 8:12:    CoGroup (org.apache.flink.examples.java.relational.= WebLogAnalysis$AntiJoinVisits) (1/9) switched to SCHEDULED
11/04/2014 14= :58:12:    CoGroup (org.apache.flink.examples.java.relationa= l.WebLogAnalysis$AntiJoinVisits) (1/9) switched to DEPLOYING
11/04/2014 = 14:58:12:    CHAIN DataSource (CSV Input (|) file:/home/abas= u/examples/Weblogs/documents) -> Filter (org.apache.flink.examples.java.= relational.WebLogAnalysis$FilterDocByKeyWords) -> Map (Projection [0]) (= 1/9) switched to RUNNING
11/04/2014 14:58:12:    CHAIN Da= taSource (CSV Input (|) file:/home/abasu/examples/Weblogs/ranks) -> Filt= er (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) = (3/9) switched to RUNNING
11/04/2014 14:58:12:    CHAIN D= ataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/ranks) -> Fil= ter (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank)= (1/9) switched to RUNNING
11/04/2014 14:58:12:    CHAIN = DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/visits) -> F= ilter (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisit= sByDate) -> Map (Projection [0]) (4/9) switched to RUNNING
11/04/2014= 14:58:12:    Join(org.apache.flink.api.java.operators.JoinO= perator$ProjectFlatJoinFunction) (1/9) switched to RUNNING
11/04/2014 14= :58:12:    CHAIN DataSource (CSV Input (|) file:/home/abasu/= examples/Weblogs/documents) -> Filter (org.apache.flink.examples.java.re= lational.WebLogAnalysis$FilterDocByKeyWords) -> Map (Projection [0]) (4/= 9) switched to RUNNING
11/04/2014 14:58:12:    CHAIN Data= Source (CSV Input (|) file:/home/abasu/examples/Weblogs/ranks) -> Filter= (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) (5= /9) switched to RUNNING
11/04/2014 14:58:12:    CHAIN Dat= aSource (CSV Input (|) file:/home/abasu/examples/Weblogs/ranks) -> Filte= r (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) (= 6/9) switched to RUNNING
11/04/2014 14:58:12:    CHAIN Da= taSource (CSV Input (|) file:/home/abasu/examples/Weblogs/ranks) -> Filt= er (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank) = (4/9) switched to RUNNING
11/04/2014 14:58:12:    CHAIN D= ataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/ranks) -> Fil= ter (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank)= (7/9) switched to RUNNING
11/04/2014 14:58:12:    CHAIN = DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/ranks) -> Fi= lter (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank= ) (9/9) switched to RUNNING
11/04/2014 14:58:12:    Join(= org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (= 3/9) switched to RUNNING
11/04/2014 14:58:12:    CoGroup = (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (= 1/9) switched to RUNNING
11/04/2014 14:58:12:    Join(org= .apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (4/9= ) switched to SCHEDULED
11/04/2014 14:58:12:    Join(org.= apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (4/9)= switched to DEPLOYING
11/04/2014 14:58:12:    Join(org.a= pache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (6/9) = switched to SCHEDULED
11/04/2014 14:58:12:    Join(org.ap= ache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (6/9) s= witched to DEPLOYING
11/04/2014 14:58:12:    Join(org.apa= che.flink.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (5/9) sw= itched to SCHEDULED
11/04/2014 14:58:12:    Join(org.apac= he.flink.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (5/9) swi= tched to DEPLOYING
11/04/2014 14:58:12:    Join(org.apach= e.flink.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (6/9) swit= ched to RUNNING
11/04/2014 14:58:12:    Join(org.apache.f= link.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (5/9) switche= d to RUNNING
11/04/2014 14:58:12:    CoGroup (org.apache.= flink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (2/9) switche= d to SCHEDULED
11/04/2014 14:58:12:    CoGroup (org.apach= e.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (2/9) switc= hed to DEPLOYING
11/04/2014 14:58:12:    Join(org.apache.= flink.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (4/9) switch= ed to RUNNING
11/04/2014 14:58:12:    CoGroup (org.apache= .flink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (2/9) switch= ed to RUNNING
11/04/2014 14:58:12:    Join(org.apache.fli= nk.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (7/9) switched = to SCHEDULED
11/04/2014 14:58:12:    Join(org.apache.flin= k.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (7/9) switched t= o DEPLOYING
11/04/2014 14:58:12:    Join(org.apache.flink= .api.java.operators.JoinOperator$ProjectFlatJoinFunction) (8/9) switched to= SCHEDULED
11/04/2014 14:58:12:    Join(org.apache.flink.= api.java.operators.JoinOperator$ProjectFlatJoinFunction) (8/9) switched to = DEPLOYING
11/04/2014 14:58:12:    Join(org.apache.flink.a= pi.java.operators.JoinOperator$ProjectFlatJoinFunction) (7/9) switched to R= UNNING
11/04/2014 14:58:12:    CoGroup (org.apache.flink.= examples.java.relational.WebLogAnalysis$AntiJoinVisits) (3/9) switched to S= CHEDULED
11/04/2014 14:58:12:    CoGroup (org.apache.flin= k.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (3/9) switched to= DEPLOYING
11/04/2014 14:58:12:    Join(org.apache.flink.= api.java.operators.JoinOperator$ProjectFlatJoinFunction) (8/9) switched to = RUNNING
11/04/2014 14:58:12:    Join(org.apache.flink.api= .java.operators.JoinOperator$ProjectFlatJoinFunction) (9/9) switched to SCH= EDULED
11/04/2014 14:58:12:    Join(org.apache.flink.api.= java.operators.JoinOperator$ProjectFlatJoinFunction) (9/9) switched to DEPL= OYING
11/04/2014 14:58:12:    CoGroup (org.apache.flink.e= xamples.java.relational.WebLogAnalysis$AntiJoinVisits) (3/9) switched to RU= NNING
11/04/2014 14:58:12:    Join(org.apache.flink.api.j= ava.operators.JoinOperator$ProjectFlatJoinFunction) (9/9) switched to RUNNI= NG
11/04/2014 14:58:12:    CoGroup (org.apache.flink.exam= ples.java.relational.WebLogAnalysis$AntiJoinVisits) (4/9) switched to SCHED= ULED
11/04/2014 14:58:12:    CoGroup (org.apache.flink.ex= amples.java.relational.WebLogAnalysis$AntiJoinVisits) (4/9) switched to DEP= LOYING
11/04/2014 14:58:12:    CoGroup (org.apache.flink.= examples.java.relational.WebLogAnalysis$AntiJoinVisits) (4/9) switched to R= UNNING
11/04/2014 14:58:12:    CoGroup (org.apache.flink.= examples.java.relational.WebLogAnalysis$AntiJoinVisits) (5/9) switched to S= CHEDULED
11/04/2014 14:58:12:    CoGroup (org.apache.flin= k.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (5/9) switched to= DEPLOYING
11/04/2014 14:58:12:    CoGroup (org.apache.fl= ink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (5/9) switched = to RUNNING
11/04/2014 14:58:13:    CoGroup (org.apache.fl= ink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (6/9) switched = to SCHEDULED
11/04/2014 14:58:13:    CoGroup (org.apache.= flink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (6/9) switche= d to DEPLOYING
11/04/2014 14:58:13:    CoGroup (org.apach= e.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (6/9) switc= hed to RUNNING
11/04/2014 14:58:13:    CoGroup (org.apach= e.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (7/9) switc= hed to SCHEDULED
11/04/2014 14:58:13:    CoGroup (org.apa= che.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (7/9) swi= tched to DEPLOYING
11/04/2014 14:58:13:    CoGroup (org.a= pache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (7/9) s= witched to RUNNING
11/04/2014 14:58:13:    CoGroup (org.a= pache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (8/9) s= witched to SCHEDULED
11/04/2014 14:58:13:    CoGroup (org= .apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (8/9)= switched to DEPLOYING
11/04/2014 14:58:13:    CoGroup (o= rg.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (8/= 9) switched to RUNNING
11/04/2014 14:58:13:    CoGroup (o= rg.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (9/= 9) switched to SCHEDULED
11/04/2014 14:58:13:    CoGroup = (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (= 9/9) switched to DEPLOYING
11/04/2014 14:58:13:    CoGrou= p (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisits)= (9/9) switched to RUNNING
11/04/2014 14:58:13:    DataSi= nk(CsvOutputFormat (path: file:/home/abasu/examples/Weblogs/result, delimit= er: |)) (7/9) switched to SCHEDULED
11/04/2014 14:58:13:  &nbs= p; Join(org.apache.flink.api.java.operators.JoinOperator$ProjectFlatJoinFun= ction) (7/9) switched to FINISHED
11/04/2014 14:58:13:   = DataSink(CsvOutputFormat (path: file:/home/abasu/examples/Weblogs/result, = delimiter: |)) (7/9) switched to DEPLOYING
11/04/2014 14:58:13: &nb= sp;  DataSink(CsvOutputFormat (path: file:/home/abasu/examples/Weblogs= /result, delimiter: |)) (8/9) switched to SCHEDULED
11/04/2014 14:58:13:=     DataSink(CsvOutputFormat (path: file:/home/abasu/example= s/Weblogs/result, delimiter: |)) (8/9) switched to DEPLOYING
11/04/2014 = 14:58:13:    Join(org.apache.flink.api.java.operators.JoinOp= erator$ProjectFlatJoinFunction) (8/9) switched to FINISHED
11/04/2014 14= :58:13:    DataSink(CsvOutputFormat (path: file:/home/abasu/= examples/Weblogs/result, delimiter: |)) (5/9) switched to SCHEDULED
11/0= 4/2014 14:58:13:    DataSink(CsvOutputFormat (path: file:/ho= me/abasu/examples/Weblogs/result, delimiter: |)) (5/9) switched to DEPLOYIN= G
11/04/2014 14:58:13:    DataSink(CsvOutputFormat (path:= file:/home/abasu/examples/Weblogs/result, delimiter: |)) (6/9) switched to= SCHEDULED
11/04/2014 14:58:13:    DataSink(CsvOutputForm= at (path: file:/home/abasu/examples/Weblogs/result, delimiter: |)) (6/9) sw= itched to DEPLOYING
11/04/2014 14:58:13:    Join(org.apac= he.flink.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (5/9) swi= tched to FINISHED
11/04/2014 14:58:13:    Join(org.apache= .flink.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (4/9) switc= hed to FINISHED
11/04/2014 14:58:13:    Join(org.apache.f= link.api.java.operators.JoinOperator$ProjectFlatJoinFunction) (6/9) switche= d to FINISHED
11/04/2014 14:58:13:    DataSink(CsvOutputF= ormat (path: file:/home/abasu/examples/Weblogs/result, delimiter: |)) (4/9)= switched to SCHEDULED
11/04/2014 14:58:13:    DataSink(C= svOutputFormat (path: file:/home/abasu/examples/Weblogs/result, delimiter: = |)) (7/9) switched to RUNNING
11/04/2014 14:58:13:    Dat= aSink(CsvOutputFormat (path: file:/home/abasu/examples/Weblogs/result, deli= miter: |)) (4/9) switched to DEPLOYING
11/04/2014 14:58:13:  &= nbsp; DataSink(CsvOutputFormat (path: file:/home/abasu/examples/Weblogs/res= ult, delimiter: |)) (3/9) switched to SCHEDULED
11/04/2014 14:58:13:&nbs= p;   DataSink(CsvOutputFormat (path: file:/home/abasu/examples/We= blogs/result, delimiter: |)) (3/9) switched to DEPLOYING
11/04/2014 14:5= 8:13:    Join(org.apache.flink.api.java.operators.JoinOperat= or$ProjectFlatJoinFunction) (3/9) switched to FINISHED
11/04/2014 14:58:= 13:    Join(org.apache.flink.api.java.operators.JoinOperator= $ProjectFlatJoinFunction) (2/9) switched to FINISHED
11/04/2014 14:58:13= :    DataSink(CsvOutputFormat (path: file:/home/abasu/exampl= es/Weblogs/result, delimiter: |)) (2/9) switched to SCHEDULED
11/04/2014= 14:58:13:    DataSink(CsvOutputFormat (path: file:/home/aba= su/examples/Weblogs/result, delimiter: |)) (2/9) switched to DEPLOYING
1= 1/04/2014 14:58:13:    DataSink(CsvOutputFormat (path: file:= /home/abasu/examples/Weblogs/result, delimiter: |)) (1/9) switched to SCHED= ULED
11/04/2014 14:58:13:    DataSink(CsvOutputFormat (pa= th: file:/home/abasu/examples/Weblogs/result, delimiter: |)) (1/9) switched= to DEPLOYING
11/04/2014 14:58:13:    DataSink(CsvOutputF= ormat (path: file:/home/abasu/examples/Weblogs/result, delimiter: |)) (5/9)= switched to RUNNING
11/04/2014 14:58:13:    DataSink(Csv= OutputFormat (path: file:/home/abasu/examples/Weblogs/result, delimiter: |)= ) (4/9) switched to RUNNING
11/04/2014 14:58:13:    DataS= ink(CsvOutputFormat (path: file:/home/abasu/examples/Weblogs/result, delimi= ter: |)) (3/9) switched to RUNNING
11/04/2014 14:58:13:   = ; DataSink(CsvOutputFormat (path: file:/home/abasu/examples/Weblogs/result,= delimiter: |)) (2/9) switched to RUNNING
11/04/2014 14:58:13: &nbs= p;  DataSink(CsvOutputFormat (path: file:/home/abasu/examples/Weblogs/= result, delimiter: |)) (8/9) switched to RUNNING
11/04/2014 14:58:13:&nb= sp;   DataSink(CsvOutputFormat (path: file:/home/abasu/examples/W= eblogs/result, delimiter: |)) (6/9) switched to RUNNING
11/04/2014 14:58= :13:    Join(org.apache.flink.api.java.operators.JoinOperato= r$ProjectFlatJoinFunction) (1/9) switched to FINISHED
11/04/2014 14:58:1= 3:    DataSink(CsvOutputFormat (path: file:/home/abasu/examp= les/Weblogs/result, delimiter: |)) (1/9) switched to RUNNING
11/04/2014 = 14:58:13:    CHAIN DataSource (CSV Input (|) file:/home/abas= u/examples/Weblogs/documents) -> Filter (org.apache.flink.examples.java.= relational.WebLogAnalysis$FilterDocByKeyWords) -> Map (Projection [0]) (= 4/9) switched to FINISHED
11/04/2014 14:58:13:    CHAIN D= ataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/documents) ->= Filter (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDoc= ByKeyWords) -> Map (Projection [0]) (2/9) switched to FINISHED
11/04/= 2014 14:58:13:    CHAIN DataSource (CSV Input (|) file:/home= /abasu/examples/Weblogs/documents) -> Filter (org.apache.flink.examples.= java.relational.WebLogAnalysis$FilterDocByKeyWords) -> Map (Projection [= 0]) (5/9) switched to FINISHED
11/04/2014 14:58:13:    CH= AIN DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/documents) = -> Filter (org.apache.flink.examples.java.relational.WebLogAnalysis$Filt= erDocByKeyWords) -> Map (Projection [0]) (9/9) switched to FINISHED
1= 1/04/2014 14:58:13:    CHAIN DataSource (CSV Input (|) file:= /home/abasu/examples/Weblogs/ranks) -> Filter (org.apache.flink.examples= .java.relational.WebLogAnalysis$FilterByRank) (4/9) switched to FINISHED11/04/2014 14:58:13:    CHAIN DataSource (CSV Input (|) fil= e:/home/abasu/examples/Weblogs/documents) -> Filter (org.apache.flink.ex= amples.java.relational.WebLogAnalysis$FilterDocByKeyWords) -> Map (Proje= ction [0]) (8/9) switched to FINISHED
11/04/2014 14:58:13:  &n= bsp; CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/docu= ments) -> Filter (org.apache.flink.examples.java.relational.WebLogAnalys= is$FilterDocByKeyWords) -> Map (Projection [0]) (6/9) switched to FINISH= ED
11/04/2014 14:58:13:    CHAIN DataSource (CSV Input (|= ) file:/home/abasu/examples/Weblogs/ranks) -> Filter (org.apache.flink.e= xamples.java.relational.WebLogAnalysis$FilterByRank) (2/9) switched to FINI= SHED
11/04/2014 14:58:13:    CHAIN DataSource (CSV Input = (|) file:/home/abasu/examples/Weblogs/ranks) -> Filter (org.apache.flink= .examples.java.relational.WebLogAnalysis$FilterByRank) (5/9) switched to FI= NISHED
11/04/2014 14:58:13:    CHAIN DataSource (CSV Inpu= t (|) file:/home/abasu/examples/Weblogs/documents) -> Filter (org.apache= .flink.examples.java.relational.WebLogAnalysis$FilterDocByKeyWords) -> M= ap (Projection [0]) (7/9) switched to FINISHED
11/04/2014 14:58:13: = ;   CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/Web= logs/ranks) -> Filter (org.apache.flink.examples.java.relational.WebLogA= nalysis$FilterByRank) (9/9) switched to FINISHED
11/04/2014 14:58:13:&nb= sp;   CHAIN DataSource (CSV Input (|) file:/home/abasu/examples/W= eblogs/ranks) -> Filter (org.apache.flink.examples.java.relational.WebLo= gAnalysis$FilterByRank) (6/9) switched to FINISHED
11/04/2014 14:58:13:&= nbsp;   CHAIN DataSource (CSV Input (|) file:/home/abasu/examples= /Weblogs/ranks) -> Filter (org.apache.flink.examples.java.relational.Web= LogAnalysis$FilterByRank) (7/9) switched to FINISHED
11/04/2014 14:58:13= :    CHAIN DataSource (CSV Input (|) file:/home/abasu/exampl= es/Weblogs/ranks) -> Filter (org.apache.flink.examples.java.relational.W= ebLogAnalysis$FilterByRank) (8/9) switched to FINISHED
11/04/2014 14:58:= 13:    Join(org.apache.flink.api.java.operators.JoinOperator= $ProjectFlatJoinFunction) (9/9) switched to FINISHED
11/04/2014 14:58:13= :    DataSink(CsvOutputFormat (path: file:/home/abasu/exampl= es/Weblogs/result, delimiter: |)) (9/9) switched to SCHEDULED
11/04/2014= 14:58:13:    CHAIN DataSource (CSV Input (|) file:/home/aba= su/examples/Weblogs/visits) -> Filter (org.apache.flink.examples.java.re= lational.WebLogAnalysis$FilterVisitsByDate) -> Map (Projection [0]) (9/9= ) switched to FINISHED
11/04/2014 14:58:13:    DataSink(C= svOutputFormat (path: file:/home/abasu/examples/Weblogs/result, delimiter: = |)) (9/9) switched to DEPLOYING
11/04/2014 14:58:13:    C= HAIN DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/visits) -&= gt; Filter (org.apache.flink.examples.java.relational.WebLogAnalysis$Filter= VisitsByDate) -> Map (Projection [0]) (4/9) switched to FINISHED
11/0= 4/2014 14:58:13:    CHAIN DataSource (CSV Input (|) file:/ho= me/abasu/examples/Weblogs/visits) -> Filter (org.apache.flink.examples.j= ava.relational.WebLogAnalysis$FilterVisitsByDate) -> Map (Projection [0]= ) (2/9) switched to FINISHED
11/04/2014 14:58:13:    CHAI= N DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/visits) ->= Filter (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVis= itsByDate) -> Map (Projection [0]) (7/9) switched to FINISHED
11/04/2= 014 14:58:13:    CHAIN DataSource (CSV Input (|) file:/home/= abasu/examples/Weblogs/visits) -> Filter (org.apache.flink.examples.java= .relational.WebLogAnalysis$FilterVisitsByDate) -> Map (Projection [0]) (= 6/9) switched to FINISHED
11/04/2014 14:58:13:    CHAIN D= ataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/visits) -> Fi= lter (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisits= ByDate) -> Map (Projection [0]) (5/9) switched to FINISHED
11/04/2014= 14:58:13:    CHAIN DataSource (CSV Input (|) file:/home/aba= su/examples/Weblogs/visits) -> Filter (org.apache.flink.examples.java.re= lational.WebLogAnalysis$FilterVisitsByDate) -> Map (Projection [0]) (8/9= ) switched to FINISHED
11/04/2014 14:58:13:    CHAIN Data= Source (CSV Input (|) file:/home/abasu/examples/Weblogs/documents) -> Fi= lter (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterDocByK= eyWords) -> Map (Projection [0]) (1/9) switched to FINISHED
11/04/201= 4 14:58:13:    CHAIN DataSource (CSV Input (|) file:/home/ab= asu/examples/Weblogs/documents) -> Filter (org.apache.flink.examples.jav= a.relational.WebLogAnalysis$FilterDocByKeyWords) -> Map (Projection [0])= (3/9) switched to FINISHED
11/04/2014 14:58:13:    CHAIN= DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/visits) -> = Filter (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterVisi= tsByDate) -> Map (Projection [0]) (1/9) switched to FINISHED
11/04/20= 14 14:58:13:    CHAIN DataSource (CSV Input (|) file:/home/a= basu/examples/Weblogs/ranks) -> Filter (org.apache.flink.examples.java.r= elational.WebLogAnalysis$FilterByRank) (1/9) switched to FINISHED
11/04/= 2014 14:58:13:    CHAIN DataSource (CSV Input (|) file:/home= /abasu/examples/Weblogs/visits) -> Filter (org.apache.flink.examples.jav= a.relational.WebLogAnalysis$FilterVisitsByDate) -> Map (Projection [0]) = (3/9) switched to FINISHED
11/04/2014 14:58:13:    CHAIN = DataSource (CSV Input (|) file:/home/abasu/examples/Weblogs/ranks) -> Fi= lter (org.apache.flink.examples.java.relational.WebLogAnalysis$FilterByRank= ) (3/9) switched to FINISHED
11/04/2014 14:58:13:    Data= Sink(CsvOutputFormat (path: file:/home/abasu/examples/Weblogs/result, delim= iter: |)) (9/9) switched to RUNNING
11/04/2014 14:58:13:  &nbs= p; DataSink(CsvOutputFormat (path: file:/home/abasu/examples/Weblogs/result= , delimiter: |)) (7/9) switched to FINISHED
11/04/2014 14:58:13: &n= bsp;  CoGroup (org.apache.flink.examples.java.relational.WebLogAnalysi= s$AntiJoinVisits) (7/9) switched to FINISHED
11/04/2014 14:58:13: &= nbsp;  DataSink(CsvOutputFormat (path: file:/home/abasu/examples/Weblo= gs/result, delimiter: |)) (8/9) switched to FINISHED
11/04/2014 14:58:13= :    DataSink(CsvOutputFormat (path: file:/home/abasu/exampl= es/Weblogs/result, delimiter: |)) (5/9) switched to FINISHED
11/04/2014 = 14:58:13:    CoGroup (org.apache.flink.examples.java.relatio= nal.WebLogAnalysis$AntiJoinVisits) (8/9) switched to FINISHED
11/04/2014= 14:58:13:    DataSink(CsvOutputFormat (path: file:/home/aba= su/examples/Weblogs/result, delimiter: |)) (6/9) switched to FINISHED
11= /04/2014 14:58:13:    CoGroup (org.apache.flink.examples.jav= a.relational.WebLogAnalysis$AntiJoinVisits) (5/9) switched to FINISHED
1= 1/04/2014 14:58:13:    CoGroup (org.apache.flink.examples.ja= va.relational.WebLogAnalysis$AntiJoinVisits) (6/9) switched to FINISHED
= 11/04/2014 14:58:13:    CoGroup (org.apache.flink.examples.j= ava.relational.WebLogAnalysis$AntiJoinVisits) (4/9) switched to FINISHED11/04/2014 14:58:13:    DataSink(CsvOutputFormat (path: fil= e:/home/abasu/examples/Weblogs/result, delimiter: |)) (4/9) switched to FIN= ISHED
11/04/2014 14:58:13:    DataSink(CsvOutputFormat (p= ath: file:/home/abasu/examples/Weblogs/result, delimiter: |)) (1/9) switche= d to FINISHED
11/04/2014 14:58:13:    CoGroup (org.apache= .flink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (1/9) switch= ed to FINISHED
11/04/2014 14:58:13:    DataSink(CsvOutput= Format (path: file:/home/abasu/examples/Weblogs/result, delimiter: |)) (3/9= ) switched to FINISHED
11/04/2014 14:58:13:    CoGroup (o= rg.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVisits) (3/= 9) switched to FINISHED
11/04/2014 14:58:13:    DataSink(= CsvOutputFormat (path: file:/home/abasu/examples/Weblogs/result, delimiter:= |)) (2/9) switched to FINISHED
11/04/2014 14:58:13:    C= oGroup (org.apache.flink.examples.java.relational.WebLogAnalysis$AntiJoinVi= sits) (2/9) switched to FINISHED
11/04/2014 14:58:13:    = DataSink(CsvOutputFormat (path: file:/home/abasu/examples/Weblogs/result, d= elimiter: |)) (9/9) switched to FINISHED
11/04/2014 14:58:13:  = ;  CoGroup (org.apache.flink.examples.java.relational.WebLogAnalysis$A= ntiJoinVisits) (9/9) switched to FINISHED
11/04/2014 14:58:13: &nbs= p;  Job execution switched to status FINISHED

=
The following directory was created:
with 9 files (named 1 to 9)
=
All these files are empty!

Hence my naive= question: Is this the expected output ? Or what should be the expected out= put for an error-free run ?

Please let me know= where we are going wrong?
If possible do you have other data= generated to try the WebLogAnalysis example ?

Thanks in advance for your advice and help,
Anirvan





Le 23/09/2014 17:22, rmetzger0 [via Apache Flink= (Incubator) User Mailing List archive.] a =E9crit :
Hi,


It accepts two arguments, t= he number of documents and visits. 
The generated files are = located in /tmp/documents /tmp/ranks and /tmp/visits.
I've genera= ted some sample data for you, located here: https://github.com/rmetzger/scratch/tree/weblogdatae= xample/weblog


Best,
Robert


On Tue, Sep 23, 2014 at 4:05 PM, nir= vanesque [via Apache Flink (Incubator) User Mailing List archive.] <[hidden email]&g= t; wrote:
Hello Robert,

T= hanks as usual for all your help with the information.

I'm trying = in vain to create the different input files from the program source code bu= t running into difficulties.

Could you (or anyone else) please pos= t here samples of the 4 inputs that are required to run this program ?
=
Thanks in advance,
Anirvan

Le 09/09/2014 23:54, rmetz= ger0 [via Apache Flink (Incubator) User Mailing List archive.] a =E9crit&nb= sp;:
Hi Anirvan,

sorry for the late response. Y= ou've posted the question to Nabble, which is only a mirror of our actual m= ailing list at [hidden email]. S= adly, the message is not automatically posted to the apache list because th= e apache server is rejecting the mails from nabble.
I've alread= y asked and there is no way to change this behavior.
So I actually saw= the two messages you posted here by accident.

Regarding your act= ual question:
- The command line arguments for the WebLogAnalysis exam= ple are:
   "WebLogAnalysis <documents path> <= ranks path> <visits path> <result path>"

= - Regarding the "info -d" command. I think its an artifact from our old ja= va API. I've filed an issue in JIRA: https://issues.apache.org= /jira/browse/FLINK-1095 Lets see how we resolve it. Thanks for rep= orting this!

You can find the source code of all of our examples = in the source release of Flink (in the flink-examples/flink-java-examples p= roject. You can also access the source (and hence the examples) through Git= Hub: https://github.com/apache/incubator-flink/blob/maste= r/flink-examples/flink-java-examples/src/main/java/org/apache/flink/example= /java/relational/WebLogAnalysis.java.

To build the examples, = you can run: "mvn clean package -DskipTests" in the "flink-examples/flink-j= ava-examples" directory. This will re-build them.

If you don't wa= nt to import the whole Flink project just for playing around with the examp= les, you can also create an empty maven project. This script:
curl https://raw.githubusercontent.com/apache/incubat= or-flink/master/flink-quickstart/quickstart.sh | bash

wi= ll automatically set everything up for you. Just import the "quickstart" pr= oject into Eclipse or IntelliJ. It will download all dependencies and packa= ge everything correctly. If you want to use an example there, just copy the= Java file into the "quickstart" project.

The examples are indeed= a very good way to learn how to write Flink jobs.

Please continu= e asking if you have further questions!

Best,
Robert
<= /blockquote>
<= /blockquote>
<= /blockquote>
<= /div>




------=_Part_16777674_2001909831.1415197573758--