Return-Path: Delivered-To: apmail-hadoop-core-user-archive@www.apache.org Received: (qmail 12596 invoked from network); 29 Aug 2008 10:54:23 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 29 Aug 2008 10:54:23 -0000 Received: (qmail 38390 invoked by uid 500); 29 Aug 2008 10:54:18 -0000 Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org Received: (qmail 37666 invoked by uid 500); 29 Aug 2008 10:54:15 -0000 Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-user@hadoop.apache.org Delivered-To: mailing list core-user@hadoop.apache.org Received: (qmail 37655 invoked by uid 99); 29 Aug 2008 10:54:15 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 29 Aug 2008 03:54:15 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of m11@mail.ru designates 194.186.55.140 as permitted sender) Received: from [194.186.55.140] (HELO f205.mail.ru) (194.186.55.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 29 Aug 2008 10:53:17 +0000 Received: from mail by f205.mail.ru with local id 1KZ1bk-000EAH-00; Fri, 29 Aug 2008 14:53:36 +0400 Received: from [93.186.51.219] by koi.mail.ru with HTTP; Fri, 29 Aug 2008 14:53:36 +0400 From: =?koi8-r?Q?=E9=D7=C1=CE?= To: Miles Osborne , core-user@hadoop.apache.org Subject: =?koi8-r?Q?Re=3A_Re=3A_Timeouts_at_reduce_stage?= Mime-Version: 1.0 X-Mailer: mPOP Web-Mail 2.19 X-Originating-IP: [93.186.51.219] Date: Fri, 29 Aug 2008 14:53:36 +0400 References: <73e5a5310808290316u41d05046k53106245667bbe4c@mail.gmail.com> In-Reply-To: <73e5a5310808290316u41d05046k53106245667bbe4c@mail.gmail.com> Reply-To: =?koi8-r?Q?=E9=D7=C1=CE?= Content-Type: text/plain; charset=koi8-r Content-Transfer-Encoding: 8bit Message-Id: X-Spam: Not detected X-Mras: OK X-Virus-Checked: Checked by ClamAV on apache.org Thanks for a fast reply, but in fact it sometimes fails even on default MR jobs like, for example, rowcounter job from HBase 0.2.0 distribution. Hardware problems are theoretically possible, but they doesn't seem to be the case because everything else is operating fine on the same set of servers. It seems that all major components of each server are fine, even disk arrays are regularly checked by datacenter stuff. Ivan Blinkov