Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 89283 invoked from network); 14 Nov 2009 19:23:07 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 14 Nov 2009 19:23:07 -0000 Received: (qmail 15344 invoked by uid 500); 14 Nov 2009 19:23:05 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 15254 invoked by uid 500); 14 Nov 2009 19:23:05 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 15244 invoked by uid 99); 14 Nov 2009 19:23:05 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 14 Nov 2009 19:23:05 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of mkendall@justin.tv designates 209.85.218.223 as permitted sender) Received: from [209.85.218.223] (HELO mail-bw0-f223.google.com) (209.85.218.223) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 14 Nov 2009 19:22:59 +0000 Received: by bwz23 with SMTP id 23so4535175bwz.29 for ; Sat, 14 Nov 2009 11:22:37 -0800 (PST) MIME-Version: 1.0 Received: by 10.216.89.149 with SMTP id c21mr1881568wef.224.1258226557640; Sat, 14 Nov 2009 11:22:37 -0800 (PST) Date: Sat, 14 Nov 2009 11:22:37 -0800 Message-ID: <2d9b8f430911141122r731e5e1axd473fb22c4fb11e6@mail.gmail.com> Subject: common reasons a map task would fail on a distributed cluster but not locally? From: Mike Kendall To: common-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=0016e6d7eefa03df9f047859b330 X-Virus-Checked: Checked by ClamAV on apache.org --0016e6d7eefa03df9f047859b330 Content-Type: text/plain; charset=ISO-8859-1 so if i run my task as: cat input | ./map.py | ./sum.py > output it works just fine. however, running it on my cluster as: hadoop jar /usr/local/hadoop/contrib/streaming/hadoop-*-streaming.jar -file map.py -mapper map.py -file cat.py -reducer cat.py -input input -output output it fails. i'm really confused as to why this script would fail while my others that were written with the same methodology would work. is there a "common reasons map tasks fail" list somewhere? any ideas? --0016e6d7eefa03df9f047859b330--