Return-Path: X-Original-To: apmail-giraph-dev-archive@www.apache.org Delivered-To: apmail-giraph-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id D4ADBD0C0 for ; Wed, 8 Aug 2012 22:55:21 +0000 (UTC) Received: (qmail 40799 invoked by uid 500); 8 Aug 2012 22:55:21 -0000 Delivered-To: apmail-giraph-dev-archive@giraph.apache.org Received: (qmail 40758 invoked by uid 500); 8 Aug 2012 22:55:21 -0000 Mailing-List: contact dev-help@giraph.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@giraph.apache.org Delivered-To: mailing list dev@giraph.apache.org Received: (qmail 40748 invoked by uid 500); 8 Aug 2012 22:55:21 -0000 Delivered-To: apmail-incubator-giraph-dev@incubator.apache.org Received: (qmail 40745 invoked by uid 99); 8 Aug 2012 22:55:21 -0000 Received: from issues-vm.apache.org (HELO issues-vm) (140.211.11.160) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Aug 2012 22:55:21 +0000 Received: from isssues-vm.apache.org (localhost [127.0.0.1]) by issues-vm (Postfix) with ESMTP id 5F163141E28 for ; Wed, 8 Aug 2012 22:55:21 +0000 (UTC) Date: Wed, 8 Aug 2012 22:55:21 +0000 (UTC) From: "Jakob Homan (JIRA)" To: giraph-dev@incubator.apache.org Message-ID: <164641644.2204.1344466521394.JavaMail.jiratomcat@issues-vm> In-Reply-To: <2011624212.53296.1342297414457.JavaMail.jiratomcat@issues-vm> Subject: [jira] [Commented] (GIRAPH-256) Partitioning outgoing graph data during INPUT_SUPERSTEP by # of vertices results in wide variance in RPC message sizes MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/GIRAPH-256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13431481#comment-13431481 ] Jakob Homan commented on GIRAPH-256: ------------------------------------ +1 > Partitioning outgoing graph data during INPUT_SUPERSTEP by # of vertices results in wide variance in RPC message sizes > ---------------------------------------------------------------------------------------------------------------------- > > Key: GIRAPH-256 > URL: https://issues.apache.org/jira/browse/GIRAPH-256 > Project: Giraph > Issue Type: Improvement > Components: bsp, graph > Affects Versions: 0.2.0 > Reporter: Eli Reisman > Assignee: Eli Reisman > Labels: patch > Fix For: 0.2.0 > > Attachments: GIRAPH-256-1.patch, GIRAPH-256-2.patch, GIRAPH-256-3.patch, GIRAPH-256-4.patch, GIRAPH-256-5.patch, GIRAPH-256-6.patch, GIRAPH-256-7.patch > > > This relates to GIRAPH-247. The unfortunately named "MAX_VERTICES_PER_PARTITION" fooled me into thinking this value was regulating the size of initial Partition objects as they were composed during INPUT_SUPERSTEP from InputSplits each worker reads. > In fact this configuration option only regulates the size of the outgoing RPC messages, stored locally in Partition objects but decomposed into Collections of BasicVertex for transfer to their eventual homes on another (or this) worker. There they are combined into the actual Partitions they will exist in for the job run. > By partitioning these outgoing messages by # of vertices, metrics load tests have shown the size of the average message is not well regulated and can create overloads on either side of these transfers. This is important because: > 1. Throughput and memory are at a premium during INPUT_SUPERSTEP. > 2. Only one crashed worker in a Giraph job causes cascading job failure, even in an otherwise healthy workflow. > This JIRA renames the offending variables/config options and further regulates outgoing graph data in INPUT_SUPERSTEP by the # of edges and THEN the # of vertices in a candidate for transfer. This much more effectively regulates message size for typical social graph data and has been show in testing to greatly improve the amount of load-in data Giraph can handle without failure given fixed memory and worker limits. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira