Return-Path: X-Original-To: apmail-incubator-giraph-dev-archive@minotaur.apache.org Delivered-To: apmail-incubator-giraph-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9D1B49225 for ; Tue, 24 Jan 2012 21:55:14 +0000 (UTC) Received: (qmail 9783 invoked by uid 500); 24 Jan 2012 21:55:14 -0000 Delivered-To: apmail-incubator-giraph-dev-archive@incubator.apache.org Received: (qmail 9490 invoked by uid 500); 24 Jan 2012 21:55:14 -0000 Mailing-List: contact giraph-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: giraph-dev@incubator.apache.org Delivered-To: mailing list giraph-dev@incubator.apache.org Received: (qmail 9481 invoked by uid 99); 24 Jan 2012 21:55:13 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 24 Jan 2012 21:55:13 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 24 Jan 2012 21:55:11 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 8E9C416048C for ; Tue, 24 Jan 2012 21:54:50 +0000 (UTC) Date: Tue, 24 Jan 2012 21:54:50 +0000 (UTC) From: "Avery Ching (Updated) (JIRA)" To: giraph-dev@incubator.apache.org Message-ID: <95757505.74242.1327442090585.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <980639928.64335.1327220498402.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Updated] (GIRAPH-128) RPC port from BasicRPCCommunications should be only a starting port, and retried MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/GIRAPH-128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Avery Ching updated GIRAPH-128: ------------------------------- Attachment: GIRAPH-128.2.patch Updated after GIRAPH-124 was committed. > RPC port from BasicRPCCommunications should be only a starting port, and retried > -------------------------------------------------------------------------------- > > Key: GIRAPH-128 > URL: https://issues.apache.org/jira/browse/GIRAPH-128 > Project: Giraph > Issue Type: Improvement > Affects Versions: 0.1.0 > Reporter: Avery Ching > Assignee: Avery Ching > Attachments: GIRAPH-128.2.patch > > > Currently Giraph uses a basic port + the task partition to get the RPC port. This doesn't work well for when there are multiple Giraph jobs running simultaneously in the same Hadoop cluster (port conflict). At the same time, it is nice to use this simple algorithm because it makes it very easy to debug problems (you can find the troublesome mapper from the RPC port name). I will be proposing a simple scheme to retry with another port. I will round the total number of mappers up to the nearest power of 10 (let's that that number Z). Then I will increment the port number by Z, retrying up to 20 tries. If you have enough ports, this scheme would guarantee that up to 20 mappers / node would be supported. It should be sufficient for most clusters. At the same time, we still maintain the easy debugging method since you it's still easy to figure out the mapper partition from the port (port % Z = map partition). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira