Return-Path: X-Original-To: apmail-flink-user-archive@minotaur.apache.org Delivered-To: apmail-flink-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2A2681012C for ; Wed, 18 Feb 2015 18:11:09 +0000 (UTC) Received: (qmail 91656 invoked by uid 500); 18 Feb 2015 18:11:09 -0000 Delivered-To: apmail-flink-user-archive@flink.apache.org Received: (qmail 91588 invoked by uid 500); 18 Feb 2015 18:11:08 -0000 Mailing-List: contact user-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@flink.apache.org Delivered-To: mailing list user@flink.apache.org Received: (qmail 91578 invoked by uid 99); 18 Feb 2015 18:11:08 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 18 Feb 2015 18:11:08 +0000 X-ASF-Spam-Status: No, hits=2.3 required=5.0 tests=SPF_SOFTFAIL,URI_HEX X-Spam-Check-By: apache.org Received-SPF: softfail (nike.apache.org: transitioning domain of unicorn.banachi@gmail.com does not designate 162.253.133.43 as permitted sender) Received: from [162.253.133.43] (HELO mwork.nabble.com) (162.253.133.43) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 18 Feb 2015 18:10:44 +0000 Received: from mjoe.nabble.com (unknown [162.253.133.57]) by mwork.nabble.com (Postfix) with ESMTP id 87FC91446616 for ; Wed, 18 Feb 2015 10:10:45 -0800 (PST) Date: Wed, 18 Feb 2015 10:03:58 -0800 (PST) From: HungChang To: user@flink.incubator.apache.org Message-ID: <1424282638421-743.post@n4.nabble.com> Subject: Using Spargel's FilterOnVerices gets stuck. MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Hi, I have a question about generating the sub-graph using Spargel API. We use filterOnVertices to generate it. With 30MB edges, the code gets stuck at Join(Join at filterOnVertices) With 2MB edges, the code doesn't have this issue. Log ------------------------------------------------------------------------------------------------------------------ 02/18/2015 10:34:23: Join(Join at filterOnVertices(Graph.java:615)) (7/20) switched to FINISHED 02/18/2015 10:34:23: Join(Join at filterOnVertices(Graph.java:615)) (12/20) switched to FINISHED 02/18/2015 10:34:23: Join(Join at filterOnVertices(Graph.java:615)) (14/20) switched to FINISHED 02/18/2015 10:34:23: Join(Join at filterOnVertices(Graph.java:615)) (17/20) switched to FINISHED 02/18/2015 10:34:23: Join(Join at filterOnVertices(Graph.java:615)) (20/20) switched to FINISHED 02/18/2015 10:34:23: Join(Join at filterOnVertices(Graph.java:615)) (13/20) switched to FINISHED 02/18/2015 10:34:24: Join(Join at filterOnVertices(Graph.java:615)) (8/20) switched to FINISHED 02/18/2015 10:34:24: Join(Join at filterOnVertices(Graph.java:615)) (2/20) switched to FINISHED 02/18/2015 10:34:24: Join(Join at filterOnVertices(Graph.java:615)) (3/20) switched to FINISHED 02/18/2015 10:34:24: Join(Join at filterOnVertices(Graph.java:615)) (19/20) switched to FINISHED 02/18/2015 10:34:24: Join(Join at filterOnVertices(Graph.java:615)) (16/20) switched to FINISHED It takes more than 10 minutes to continue while other operators complete in seconds. >From the log, it looks like some workers finish and some doesn't. The Spargel API shows it uses join twice so this operator looks a bit expensive. Would it be the reason that the job gets stuck? Our goal of using filterOnVertices is to use the sub-graph as an input for next iteration. ------------------------------------------------------------------------------------------------------------------ public Graph filterOnVertices(FilterFunction> vertexFilter) { DataSet> filteredVertices = this.vertices.filter(vertexFilter); DataSet> remainingEdges = this.edges.join(filteredVertices) .where(0).equalTo(0) .with(new ProjectEdge()) .join(filteredVertices).where(1).equalTo(0) .with(new ProjectEdge()); return new Graph(filteredVertices, remainingEdges, this.context); } Best regards, Hung -- View this message in context: http://apache-flink-incubator-user-mailing-list-archive.2336050.n4.nabble.com/Using-Spargel-s-FilterOnVerices-gets-stuck-tp743.html Sent from the Apache Flink (Incubator) User Mailing List archive. mailing list archive at Nabble.com.