Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 7987 invoked from network); 26 Oct 2009 16:06:33 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 26 Oct 2009 16:06:33 -0000 Received: (qmail 2115 invoked by uid 500); 26 Oct 2009 16:06:30 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 2023 invoked by uid 500); 26 Oct 2009 16:06:30 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 2013 invoked by uid 99); 26 Oct 2009 16:06:30 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 26 Oct 2009 16:06:30 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of uclamathguy@gmail.com designates 209.85.160.50 as permitted sender) Received: from [209.85.160.50] (HELO mail-pw0-f50.google.com) (209.85.160.50) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 26 Oct 2009 16:06:22 +0000 Received: by pwi4 with SMTP id 4so3460654pwi.29 for ; Mon, 26 Oct 2009 09:06:01 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type; bh=VZ0KrN8HEk30HLUY58POsqhc0SnrpqrCFd8Ay0GDjrs=; b=bfdHC79om0HSRFyJvkeNBbMk7k/94sSI85la1025GjsXlQQNrQVMmZSELpv+EXepxT ulA9MllRv/JR2YLX9AGYOPQ+HFOxd2k/ltzWlajqOyxIlpdOO4JvZN484pojnftfFAID Ffnx5orXxL9o7PNi08yS6XSJ16wfh3o4ZcyL8= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=gaOYG5nJ5lis3Ov/B3TsrVUwsCFr7s9EmbDyRCt1qNeMFVhTK6iD/wEx+FATheZ3r9 Aj2eX4RVBqJkgV63ukXeWP4MbLErA4nVpR2PPLWCueOSGBCsnOUKmfEYKn4v1e12ulyM 4Eo0Tzjpy893jLDQnhOQecskqRk/i7tiz8iwE= MIME-Version: 1.0 Received: by 10.143.154.17 with SMTP id g17mr1181814wfo.247.1256573161638; Mon, 26 Oct 2009 09:06:01 -0700 (PDT) In-Reply-To: References: <72fc09510910251200n799b3482hdde7aaf99213ba51@mail.gmail.com> Date: Mon, 26 Oct 2009 09:06:01 -0700 Message-ID: <72fc09510910260906r37dd3baxcb517d62cd1f1582@mail.gmail.com> Subject: Re: Streaming ignoring stderr output From: Ryan Rosario To: common-user@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org Thanks. I think that I may have tripped on some sort of bug. Unfortunately, I do not know how to reproduce it and am a bit scared to try to reproduce it. I got this to work. I changed the following things, and now my job completes successfully with stderr written to the logs as output occurs. What was happening before was that no output was being written to the stderr logs until a map task completely finished. -- used print >> sys.stderr, "blah blah" instead of sys.stderr.write("blah blah") -- used the reporter: print >> sys.stderr, "reporter:status:My status message" -- used only one large input file, rather than splitting the file into n files. I thought that was required so I could force n mappers, but apparently not. I am not sure which one of the above solved the problem. Using sys.stderr.write() without the reporting format worked for some time. I don't know why. - Ryan On Mon, Oct 26, 2009 at 8:03 AM, Koji Noguchi wrote: > This doesn't solve your stderr/stdout problem, but you can always set the > timeout to be a bigger value if necessary. > > -Dmapred.task.timeout=______ (in milliseconds) > > Koji > > > On 10/25/09 12:00 PM, "Ryan Rosario" wrote: > >> I am using a Python script as a mapper for a Hadoop Streaming (hadoop >> 0.20.0) job, with reducer NONE. My jobs keep getting killed with "task >> failed to respond after 600 seconds." I tried sending a heartbeat >> every minute to stderr using sys.stderr.write in my mapper, but >> nothing is being output to stderr either on disk (in >> logs/userlogs/...) or in the web UI. stdout is not even recorded. >> >> This also means I have no way of knowing what my tasks are doing at >> any given moment except to look at the counts produced in syslog. >> >> I got it to work once, but have not had any luck since. Any >> suggestions of things to look at as to why I am not able to get any >> output? Help is greatly appreciated. >> >> - Ryan > > -- RRR