Return-Path: Delivered-To: apmail-hadoop-core-user-archive@www.apache.org Received: (qmail 95102 invoked from network); 7 May 2008 20:29:49 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 7 May 2008 20:29:49 -0000 Received: (qmail 49826 invoked by uid 500); 7 May 2008 20:29:47 -0000 Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org Received: (qmail 49795 invoked by uid 500); 7 May 2008 20:29:47 -0000 Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-user@hadoop.apache.org Delivered-To: mailing list core-user@hadoop.apache.org Received: (qmail 49784 invoked by uid 500); 7 May 2008 20:29:47 -0000 Delivered-To: apmail-lucene-hadoop-user@lucene.apache.org Received: (qmail 49781 invoked by uid 99); 7 May 2008 20:29:47 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 May 2008 13:29:47 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jamesthepiper@gmail.com designates 209.85.200.174 as permitted sender) Received: from [209.85.200.174] (HELO wf-out-1314.google.com) (209.85.200.174) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 May 2008 20:28:59 +0000 Received: by wf-out-1314.google.com with SMTP id 28so394638wfc.20 for ; Wed, 07 May 2008 13:29:14 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to:subject:mime-version:content-type:content-transfer-encoding:content-disposition; bh=UtyNOdAZ9Nn9ncXu1VCn/scshMrrrJOwWUfFMei7uc0=; b=XarVMFmrbR4zWtgoB95vvSMIhC5XxLEOft92xTcO/fyXPRvHmWNVvDjsEWKGRZiSmY4+gIMovtNnovvlyYZ99LRxwbGwz02lJ2kAXhCfaWXzwWgme9PonuLEdQmYoJofPutbeq2PSxLO2+xg53gNxq3+xzz1YLsIRvCPscUsnt0= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:mime-version:content-type:content-transfer-encoding:content-disposition; b=Q2zziNm2V8VGcAxyNmbBmXJyt2yUf7Na/R+ItX5S9Vg95hUZGxTxyS8aNCiTRrHzrqO4yTs5JLcam1OqIiWONzib9S2HoaYnQ5E2RwvdW05A2Ay2FwOlcM+wIUtfbnpA1p1qR+liqc2zDZMikxoimRPAyV+8JMIJYxAYNaQUWcw= Received: by 10.142.57.18 with SMTP id f18mr1034793wfa.308.1210192154175; Wed, 07 May 2008 13:29:14 -0700 (PDT) Received: by 10.143.188.10 with HTTP; Wed, 7 May 2008 13:29:13 -0700 (PDT) Message-ID: <7cfed280805071329v79dd16b2s27a9f931f10ea975@mail.gmail.com> Date: Wed, 7 May 2008 13:29:13 -0700 From: "James Moore" To: hadoop-user@lucene.apache.org Subject: Read timed out, Abandoning block blk_-5476242061384228962 MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline X-Virus-Checked: Checked by ClamAV on apache.org What is this bit of the log trying to tell me, and what sorts of things should I be looking at to make sure it doesn't happen? I don't think the network has any basic configuration issues - I can telnet from the machine creating this log to the destination - telnet 10.252.222.239 50010 works fine when I ssh in to the box with this error. 2008-05-07 13:20:31,194 INFO org.apache.hadoop.dfs.DFSClient: Exception in createBlockOutputStream java.net.SocketTimeoutException: Read timed out 2008-05-07 13:20:31,194 INFO org.apache.hadoop.dfs.DFSClient: Abandoning block blk_-5476242061384228962 2008-05-07 13:20:31,196 INFO org.apache.hadoop.dfs.DFSClient: Waiting to find target node: 10.252.222.239:50010 I'm seeing a fair number of these. My reduces finally complete, but there are usually a couple at the end that take longer than I think they should, and they frequently have these sorts of errors. I'm running 20 machines on ec2 right now, with hadoop version 0.16.4. -- James Moore | james@restphone.com blog.restphone.com