Return-Path: X-Original-To: apmail-mahout-user-archive@www.apache.org Delivered-To: apmail-mahout-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 080946E9A for ; Fri, 3 Jun 2011 17:05:00 +0000 (UTC) Received: (qmail 78851 invoked by uid 500); 3 Jun 2011 17:04:58 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 78776 invoked by uid 500); 3 Jun 2011 17:04:58 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 78768 invoked by uid 99); 3 Jun 2011 17:04:58 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 Jun 2011 17:04:58 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,RFC_ABUSE_POST,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of praneetmhatre@gmail.com designates 209.85.214.42 as permitted sender) Received: from [209.85.214.42] (HELO mail-bw0-f42.google.com) (209.85.214.42) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 Jun 2011 17:04:52 +0000 Received: by bwz18 with SMTP id 18so4706984bwz.1 for ; Fri, 03 Jun 2011 10:04:31 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=S4v70C6CYra6o0tGJyCqW3dkasAb7N+ddt61vlDfQBM=; b=fiNUOXseisCRp7l1ItV2n+Z0T0ahkHWFoEcpJlOTo+NjCcgmYVghexAuOQrmrxRNb5 TTIM1XNO+T0mHBOGHb+yxRQ5ve7QHJ0CsKo8iRsOGoiWjjwUuxNG8Vi38vz0qr/IupGn E2QUOPWTffBy9KCIxBHvpCAvDThFV3A0ixnNM= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=CJYXV52q/24ftqfkKYJAw3b9ypRgu3mC8M0AxO8kasRSYktxMmz85f3Tr8ua0WA6Jt ixnTV/61qR7O47J43IWabwgr1nvOfB2sosVrSKwVozmnHBLip+ld6wxCsSb0stLuPTq1 AgUDQx/PWk/WXrYMFXBcqeGAFavXPdZ6paAZc= MIME-Version: 1.0 Received: by 10.204.231.198 with SMTP id jr6mr2023874bkb.205.1307120671806; Fri, 03 Jun 2011 10:04:31 -0700 (PDT) Received: by 10.204.101.83 with HTTP; Fri, 3 Jun 2011 10:04:31 -0700 (PDT) In-Reply-To: <1BBDEC3EDCED3F4EA6F86E4D139A17BB2DB107AF@008-AM1MPN1-032.mgdnok.nokia.com> References: <4DE7EAAE.3010003@smail.inf.fh-brs.de> <1BBDEC3EDCED3F4EA6F86E4D139A17BB2DB107AF@008-AM1MPN1-032.mgdnok.nokia.com> Date: Fri, 3 Jun 2011 10:04:31 -0700 Message-ID: Subject: Re: Reg Randomn forest From: praneet mhatre To: user@mahout.apache.org Content-Type: multipart/alternative; boundary=485b3970d2185255f104a4d1bfef X-Virus-Checked: Checked by ClamAV on apache.org --485b3970d2185255f104a4d1bfef Content-Type: text/plain; charset=ISO-8859-1 Hi, Even I faced the exact same problem and had a long exchange of emails with the Mahout folks regarding this. I'll link you to the mail archive to save them the trouble of going thru it all again: http://search.lucidimagination.com/search/document/ecbfb35f9e05706b/partial_implementation_of_random_forest#98cc8b90d38c0423. In a nutshell, CDH3 uses some patches from Hadoop 0.21 which a create a _SUCCEED file in the output path and the current code does not know how to deal with that file. I switched to an earlier version of Hadoop and everything worked perfectly. I don't know if this issue has been fixed yet. One of the developers could throw some light on that. Thanks, On Fri, Jun 3, 2011 at 4:15 AM, wrote: > Hi, > > I tried to run Randomn forest for KDD data in the Hadoop cluster(CDH > version 3) and ended up with the following error during build forest:- > > Exception in thread "main" java.lang.IllegalStateException: > java.io.EOFException > at > org.apache.mahout.common.iterator.sequencefile.SequenceFileIterable.iterator(SequenceFileIterable.java:63) > at > org.apache.mahout.df.mapreduce.partial.PartialBuilder.processOutput(PartialBuilder.java:173) > at > org.apache.mahout.df.mapreduce.partial.PartialBuilder.parseOutput(PartialBuilder.java:121) > at org.apache.mahout.df.mapreduce.Builder.build(Builder.java:324) > at > org.apache.mahout.df.mapreduce.BuildForest.buildForest(BuildForest.java:195) > at > org.apache.mahout.df.mapreduce.BuildForest.run(BuildForest.java:159) > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) > at > org.apache.mahout.df.mapreduce.BuildForest.main(BuildForest.java:239) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) > at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > at java.lang.reflect.Method.invoke(Method.java:597) > at org.apache.hadoop.util.RunJar.main(RunJar.java:186) > Caused by: java.io.EOFException > at java.io.DataInputStream.readFully(DataInputStream.java:180) > at java.io.DataInputStream.readFully(DataInputStream.java:152) > at > org.apache.hadoop.io.SequenceFile$Reader.init(SequenceFile.java:1457) > at > org.apache.hadoop.io.SequenceFile$Reader.(SequenceFile.java:1435) > at > org.apache.hadoop.io.SequenceFile$Reader.(SequenceFile.java:1424) > at > org.apache.hadoop.io.SequenceFile$Reader.(SequenceFile.java:1419) > at > org.apache.mahout.common.iterator.sequencefile.SequenceFileIterator.(SequenceFileIterator.java:58) > at > org.apache.mahout.common.iterator.sequencefile.SequenceFileIterable.iterator(SequenceFileIterable.java:61) > ... 12 more > > Any help in resolving the above issue is greatly appreciated. > > Thanks and Regards, > Ranjit.C > -- Praneet Mhatre Graduate Student Donald Bren School of ICS University of California, Irvine --485b3970d2185255f104a4d1bfef--