Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 893CFFBEF for ; Fri, 22 Mar 2013 12:03:20 +0000 (UTC) Received: (qmail 20718 invoked by uid 500); 22 Mar 2013 12:03:15 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 20401 invoked by uid 500); 22 Mar 2013 12:03:15 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 20374 invoked by uid 99); 22 Mar 2013 12:03:14 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 22 Mar 2013 12:03:14 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of cienlux@gmail.com designates 209.85.219.46 as permitted sender) Received: from [209.85.219.46] (HELO mail-oa0-f46.google.com) (209.85.219.46) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 22 Mar 2013 12:03:09 +0000 Received: by mail-oa0-f46.google.com with SMTP id k1so4198691oag.19 for ; Fri, 22 Mar 2013 05:02:48 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:date:message-id:subject:from:to :content-type; bh=UZRUb0KbRDY9CJ6GJ3TGjVZq8yJmQ1qVNYOVty54Qa0=; b=HUXhVp9FLnsFjyeYs7yhniMljAhKzr1oecK7sqKy3BYd7C56NlMar2BoDOz8msaSyV RX+aZEmmVT9nisyDReZ8nlHPSj/fKOvvDV1vWZoa2PhdD/XYhivS/ZVfOIzx5vSrvZ/M KvMnYrAmF+VsQ74sFGCwl4nlRYagkdgeqo5evwGI5hbU7BYImDu7a8N6iDboFGyDKk34 4T40YEXWLFk4akhRKtdEi8GleD/qL0RuHmvdpt+ijeOS8g796J91By6js8JHdOFDqUKp asmm1M45sPLZGAqhSw2QSqLfU7lT8jcaxQXpAaqh/1yJiBEv1WfVjy4C0MHV3VPcqOFA DDfA== MIME-Version: 1.0 X-Received: by 10.60.28.133 with SMTP id b5mr1424175oeh.57.1363953768669; Fri, 22 Mar 2013 05:02:48 -0700 (PDT) Received: by 10.76.83.34 with HTTP; Fri, 22 Mar 2013 05:02:48 -0700 (PDT) Date: Fri, 22 Mar 2013 07:02:48 -0500 Message-ID: Subject: MapReduce Failed and Killed From: Jinchun Kim To: user@hadoop.apache.org Content-Type: multipart/alternative; boundary=e89a8fb1f7a6df2b5f04d8823bc8 X-Virus-Checked: Checked by ClamAV on apache.org --e89a8fb1f7a6df2b5f04d8823bc8 Content-Type: text/plain; charset=ISO-8859-1 Hi, All. I'm trying to create category-based splits of Wikipedia dataset(41GB) and the training data set(5GB) using Mahout. I'm using following command. $MAHOUT_HOME/bin/mahout wikipediaDataSetCreator -i wikipedia/chunks -o wikipediainput -c $MAHOUT_HOME/examples/temp/categories.txt I had no problem with the training data set, but Hadoop showed following messages when I tried to do a same job with Wikipedia dataset, ......... 13/03/21 22:31:00 INFO mapred.JobClient: map 27% reduce 1% 13/03/21 22:40:31 INFO mapred.JobClient: map 27% reduce 2% 13/03/21 22:58:49 INFO mapred.JobClient: map 27% reduce 3% 13/03/21 23:22:57 INFO mapred.JobClient: map 27% reduce 4% 13/03/21 23:46:32 INFO mapred.JobClient: map 27% reduce 5% 13/03/22 00:27:14 INFO mapred.JobClient: map 27% reduce 6% 13/03/22 01:06:55 INFO mapred.JobClient: map 27% reduce 7% 13/03/22 01:14:06 INFO mapred.JobClient: map 27% reduce 3% 13/03/22 01:15:35 INFO mapred.JobClient: Task Id : attempt_201303211339_0002_r_000000_1, Status : FAILED Task attempt_201303211339_0002_r_000000_1 failed to report status for 1200 seconds. Killing! 13/03/22 01:20:09 INFO mapred.JobClient: map 27% reduce 4% 13/03/22 01:33:35 INFO mapred.JobClient: Task Id : attempt_201303211339_0002_m_000037_1, Status : FAILED Task attempt_201303211339_0002_m_000037_1 failed to report status for 1228 seconds. Killing! 13/03/22 01:35:12 INFO mapred.JobClient: map 27% reduce 5% 13/03/22 01:40:38 INFO mapred.JobClient: map 27% reduce 6% 13/03/22 01:52:28 INFO mapred.JobClient: map 27% reduce 7% 13/03/22 02:16:27 INFO mapred.JobClient: map 27% reduce 8% 13/03/22 02:19:02 INFO mapred.JobClient: Task Id : attempt_201303211339_0002_m_000018_1, Status : FAILED Task attempt_201303211339_0002_m_000018_1 failed to report status for 1204 seconds. Killing! 13/03/22 02:49:03 INFO mapred.JobClient: map 27% reduce 9% 13/03/22 02:52:04 INFO mapred.JobClient: map 28% reduce 9% ........ Because I just started to learn how to run Hadoop, I have no idea how to solve this problem... Does anyone have an idea how to handle this weird thing? -- *Jinchun Kim* --e89a8fb1f7a6df2b5f04d8823bc8 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Hi, All.

I'm trying to create= category-based splits of Wikipedia dataset(41GB) and
the t= raining data set(5GB) using Mahout.
I'm using following= command.

$MAHOUT_HOME/bi= n/mahout wikipediaDataSetCreator -i wikipedia/chunks -o wikipediainput -c $= MAHOUT_HOME/examples/temp/categories.txt

I had no problem with the training da= ta set, but Hadoop showed following messages
when I tried t= o do a same job with Wikipedia dataset,=A0

.........
13/03/21 22:31:00 INFO mapred.JobClient: =A0= map 27% reduce 1%
13/03/21 22:40:31 INFO mapred.JobClient: =A0map= 27% reduce 2%
13/03/21 22:58:49 INFO mapred.JobClient: =A0map 27= % reduce 3%
13/03/21 23:22:57 INFO mapred.JobClient: =A0map 27% reduce 4%
13/03/21 23:46:32 INFO mapred.JobClient: =A0map 27% reduce 5%
1= 3/03/22 00:27:14 INFO mapred.JobClient: =A0map 27% reduce 6%
13/0= 3/22 01:06:55 INFO mapred.JobClient: =A0map 27% reduce 7%
13/03/22 01:14:06 INFO mapred.JobClient: =A0map 27% reduce 3%
13/03/22 01:15:35 INFO mapred.JobClient: Task Id : attempt_201303211339_0= 002_r_000000_1, Status : FAILED
Task attempt_201303211339_0002_r_= 000000_1 failed to report status for 1200 seconds. Killing!
13/03/22 01:20:09 INFO mapred.JobClient: =A0map 27% reduce 4%
13/03/22 01:33:35 INFO mapred.JobClient: Task Id : attempt_201303211339_0= 002_m_000037_1, Status : FAILED
Task attempt_201303211339_0002_m_= 000037_1 failed to report status for 1228 seconds. Killing!
13/03/22 01:35:12 INFO mapred.JobClient: =A0map 27% reduce 5%
13/03/22 01:40:38 INFO mapred.JobClient: =A0map 27% reduce 6%
1= 3/03/22 01:52:28 INFO mapred.JobClient: =A0map 27% reduce 7%
13/0= 3/22 02:16:27 INFO mapred.JobClient: =A0map 27% reduce 8%
13/03/22 02:19:02 INFO mapred.JobClient: Task Id : attempt_20130321133= 9_0002_m_000018_1, Status : FAILED
Task attempt_201303211339_0002= _m_000018_1 failed to report status for 1204 seconds. Killing!
13/03/22 02:49:03 INFO mapred.JobClient: =A0map 27% reduce 9%
13/= 03/22 02:52:04 INFO mapred.JobClient: =A0map 28% reduce 9%
......= ..

Because I just started to learn how= to run Hadoop, I have no idea how to solve
this problem...
Does anyone have an idea how to handle this = weird thing?

--
Jinchun Kim
--e89a8fb1f7a6df2b5f04d8823bc8--