Return-Path: X-Original-To: apmail-giraph-dev-archive@www.apache.org Delivered-To: apmail-giraph-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 74172D525 for ; Fri, 8 Feb 2013 00:31:14 +0000 (UTC) Received: (qmail 1432 invoked by uid 500); 8 Feb 2013 00:31:14 -0000 Delivered-To: apmail-giraph-dev-archive@giraph.apache.org Received: (qmail 1383 invoked by uid 500); 8 Feb 2013 00:31:14 -0000 Mailing-List: contact dev-help@giraph.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@giraph.apache.org Delivered-To: mailing list dev@giraph.apache.org Received: (qmail 1375 invoked by uid 500); 8 Feb 2013 00:31:14 -0000 Delivered-To: apmail-incubator-giraph-dev@incubator.apache.org Received: (qmail 1371 invoked by uid 99); 8 Feb 2013 00:31:14 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 08 Feb 2013 00:31:14 +0000 Date: Fri, 8 Feb 2013 00:31:14 +0000 (UTC) From: "Maja Kabiljo (JIRA)" To: giraph-dev@incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (GIRAPH-508) Increase the limit on the number of partitions MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/GIRAPH-508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13574106#comment-13574106 ] Maja Kabiljo commented on GIRAPH-508: ------------------------------------- Sure, https://reviews.apache.org/r/9370 > Increase the limit on the number of partitions > ---------------------------------------------- > > Key: GIRAPH-508 > URL: https://issues.apache.org/jira/browse/GIRAPH-508 > Project: Giraph > Issue Type: Improvement > Reporter: Maja Kabiljo > Assignee: Maja Kabiljo > Attachments: GIRAPH-508.diff > > > We have the limit for total number of partitions of 2995. This is because of Zookeeper znode limit of 1MB, and from the assumption that partition owner description can take 300 bytes. > In the simplest case, when checkpointing is not used and partitions don't move around, we have 5 ints and hostname written per partition. If partitions move around we have one more hostname and 2 ints. And when checkpointing is used we also have the path to checkpoint file written. > For now, we can get rid of whole WorkerInfo description per partition, and just use taskIds, since all WorkerInfos are written in the beginning. This will lead to having just 4 ints per partition in the case when checkpointing is not used, and allow us to have much more partitions. > When checkpointing is used, we can keep the limit (still up it a bit), or have all workers read partition metadata when restarting from checkpoint. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira