Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 87F789951 for ; Thu, 30 Aug 2012 03:53:12 +0000 (UTC) Received: (qmail 30527 invoked by uid 500); 30 Aug 2012 03:53:08 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 30302 invoked by uid 500); 30 Aug 2012 03:53:07 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 30256 invoked by uid 99); 30 Aug 2012 03:53:05 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 Aug 2012 03:53:05 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FSL_RCVD_USER,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,WEIRD_PORT X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of periya.data@gmail.com designates 209.85.214.176 as permitted sender) Received: from [209.85.214.176] (HELO mail-ob0-f176.google.com) (209.85.214.176) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 Aug 2012 03:53:00 +0000 Received: by obbtb18 with SMTP id tb18so3224530obb.35 for ; Wed, 29 Aug 2012 20:52:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=RmakOpr8dZ638ZaTMbv+2dWmJwg+73O6EZUP7I6C6Qg=; b=F0MXRdZ9xl2A9HdxEoTI15XZtAMOWDl19/+/Ke+wBD698tRsgoTFQNyBTsZzThlr/L tuifjepC2etvY/T7nL6IBb0iHsKPR/R571JSNGC8VzK6ma7qXYobhW5/6rQxHGe++yFZ /7RIuRaVVNMRYoBKYbfiNAm12OYfPUnlC1bRDCi2zNEjqVeq8PPL7LKf1zjUA4A/ygT5 L9Le2GxvLNWVeJ4Ew0kE/6SbDARdWKBxh96RXqWaHHcT6N4FkNmCLUgPoClNnVPJxoEf x222SUYFfOa58KB+9ZhwJ/OXBwJpBsdbULJhEq73z7TdW4xMlX0V9c/1gAUAKzIY4yFV dMxg== MIME-Version: 1.0 Received: by 10.182.118.2 with SMTP id ki2mr3100564obb.101.1346298759912; Wed, 29 Aug 2012 20:52:39 -0700 (PDT) Received: by 10.60.0.51 with HTTP; Wed, 29 Aug 2012 20:52:39 -0700 (PDT) Date: Wed, 29 Aug 2012 20:52:39 -0700 Message-ID: Subject: no output written to HDFS From: "Periya.Data" To: common-user@hadoop.apache.org, cdh-user@cloudera.org Content-Type: multipart/alternative; boundary=f46d0447f37c589b7204c8739bd6 X-Virus-Checked: Checked by ClamAV on apache.org --f46d0447f37c589b7204c8739bd6 Content-Type: text/plain; charset=ISO-8859-1 Hi All, My Hadoop streaming job (in Python) runs to "completion" (both map and reduce says 100% complete). But, when I look at the output directory in HDFS, the part files are empty. I do not know what might be causing this behavior. I understand that the percentages represent the records that have been read in (not processed). The following are some of the logs. The detailed logs from Cloudera Manager says that there were no Map Outputs...which is interesting. Any suggestions? 12/08/30 03:27:14 INFO streaming.StreamJob: To kill this job, run: 12/08/30 03:27:14 INFO streaming.StreamJob: /usr/lib/hadoop-0.20/bin/hadoop job -Dmapred.job.tracker=xxxxx.yyy.com:8021 -kill job_201208232245_3182 12/08/30 03:27:14 INFO streaming.StreamJob: Tracking URL: http://xxxxxx.yyyy.com:60030/jobdetails.jsp?jobid=job_201208232245_3182 12/08/30 03:27:15 INFO streaming.StreamJob: map 0% reduce 0% 12/08/30 03:27:20 INFO streaming.StreamJob: map 33% reduce 0% 12/08/30 03:27:23 INFO streaming.StreamJob: map 67% reduce 0% 12/08/30 03:27:29 INFO streaming.StreamJob: map 100% reduce 0% 12/08/30 03:27:33 INFO streaming.StreamJob: map 100% reduce 100% 12/08/30 03:27:35 INFO streaming.StreamJob: Job complete: job_201208232245_3182 12/08/30 03:27:35 INFO streaming.StreamJob: Output: /user/GHU Thu Aug 30 03:27:24 GMT 2012 *** END bash-3.2$ bash-3.2$ hadoop fs -ls /user/ghu/ Found 5 items -rw-r--r-- 3 ghu hadoop 0 2012-08-30 03:27 /user/GHU/_SUCCESS drwxrwxrwx - ghu hadoop 0 2012-08-30 03:27 /user/GHU/_logs -rw-r--r-- 3 ghu hadoop 0 2012-08-30 03:27 /user/GHU/part-00000 -rw-r--r-- 3 ghu hadoop 0 2012-08-30 03:27 /user/GHU/part-00001 -rw-r--r-- 3 ghu hadoop 0 2012-08-30 03:27 /user/GHU/part-00002 bash-3.2$ -------------------------------------------------------------------------------------------------------------------- Metadata Status Succeeded Type MapReduce Id job_201208232245_3182 Name CaidMatch User srisrini Mapper class PipeMapper Reducer class Scheduler pool name default Job input directory hdfs://xxxxx.yyy.txt,hdfs://xxxx.yyyy.com/user/GHUcaidlist.txt Job output directory hdfs://xxxx.yyyy.com/user/GHU/ Timing Duration 20.977s Submit time Wed, 29 Aug 2012 08:27 PM Start time Wed, 29 Aug 2012 08:27 PM Finish time Wed, 29 Aug 2012 08:27 PM Progress and Scheduling Map Progress 100.0% Reduce Progress 100.0% Launched maps 4 Data-local maps 3 Rack-local maps 1 Other local maps Desired maps 3 Launched reducers Desired reducers 0 Fairscheduler running tasks Fairscheduler minimum share Fairscheduler demand Current Resource Usage Current User CPUs 0 Current System CPUs 0 Resident memory 0 B Running maps 0 Running reducers 0 Aggregate Resource Usage and Counters User CPU 0s System CPU 0s Map Slot Time 12.135s Reduce slot time 0s Cumulative disk reads Cumulative disk writes 155.0 KiB Cumulative HDFS reads 3.6 KiB Cumulative HDFS writes Map input bytes 2.5 KiB Map input records 45 Map output records 0 Reducer input groups Reducer input records Reducer output records Reducer shuffle bytes Spilled records --f46d0447f37c589b7204c8739bd6--