Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 42253 invoked from network); 11 May 2006 16:11:26 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 11 May 2006 16:11:26 -0000 Received: (qmail 50188 invoked by uid 500); 11 May 2006 16:11:20 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 50153 invoked by uid 500); 11 May 2006 16:11:20 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 49983 invoked by uid 99); 11 May 2006 16:11:20 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 May 2006 09:11:19 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: neutral (asf.osuosl.org: local policy) Received: from [64.202.165.199] (HELO smtpout04-04.prod.mesa1.secureserver.net) (64.202.165.199) by apache.org (qpsmtpd/0.29) with SMTP; Thu, 11 May 2006 09:11:17 -0700 Received: (qmail 18049 invoked from network); 11 May 2006 16:10:53 -0000 Received: from unknown (71.140.53.217) by smtpout04-04.prod.mesa1.secureserver.net (64.202.165.199) with ESMTP; 11 May 2006 16:10:53 -0000 From: "Paul Sutter" To: Subject: RE: [jira] Commented: (HADOOP-195) transfer map output transfer with http instead of rpc Date: Thu, 11 May 2006 09:11:21 -0700 MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Mailer: Microsoft Office Outlook, Build 11.0.6353 In-Reply-To: <6830004.1147363385820.JavaMail.jira@brutus> X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.2869 thread-index: AcZ1FH/EafRg2x6NSdmRuSNxdXj7gAAAJt2A X-Virus-Checked: Checked by ClamAV on apache.org Message-Id: <20060511161118.1F04010FB012@asf.osuosl.org> X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N I assumed it was on one big switch. Sounds like an easy theory to test, and even easier to fix. How hard is it to monitor traffic levels on the switch cross-connects? -----Original Message----- From: Doug Cutting (JIRA) [mailto:jira@apache.org] Sent: Thursday, May 11, 2006 9:03 AM To: hadoop-dev@lucene.apache.org Subject: [jira] Commented: (HADOOP-195) transfer map output transfer with http instead of rpc [ http://issues.apache.org/jira/browse/HADOOP-195?page=comments#action_1237909 2 ] Doug Cutting commented on HADOOP-195: ------------------------------------- Everything we're now seeing is consistent with the inter-rack switches being the primary bottleneck. With 188 nodes sharing a 1Gb/s backbone, there's only 600KB/s per node. We're seeing 10 80kB files transferred per second, or 800kB/second, slightly higher, since some files are already on the same rack. Instead of caching temp files in RAM we can instead try to transfer files soon after they are generated and to process them on the remote end soon after they are recieved. That way we can benefit from the kernel's cache, getting performance similar to what we'd see if we cached them ourselves. > transfer map output transfer with http instead of rpc > ----------------------------------------------------- > > Key: HADOOP-195 > URL: http://issues.apache.org/jira/browse/HADOOP-195 > Project: Hadoop > Type: Improvement > Components: mapred > Versions: 0.2 > Reporter: Owen O'Malley > Assignee: Owen O'Malley > Fix For: 0.3 > Attachments: data-transfer-chart.pdf, netstat.log, netstat.xls > > The data transfer of the map output should be transfered via http instead rpc, because rpc is very slow for this application and the timeout behavior is suboptimal. (server sends data and client ignores it because it took more than 10 seconds to be received.) -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira