Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id ECDB911192 for ; Wed, 17 Sep 2014 14:29:31 +0000 (UTC) Received: (qmail 96775 invoked by uid 500); 17 Sep 2014 14:29:22 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 96673 invoked by uid 500); 17 Sep 2014 14:29:22 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 96663 invoked by uid 99); 17 Sep 2014 14:29:22 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 17 Sep 2014 14:29:22 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of Blanca.Hernandez@willhaben.at designates 195.12.209.79 as permitted sender) Received: from [195.12.209.79] (HELO srvsgr-smtp02.styria-it.com) (195.12.209.79) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 17 Sep 2014 14:29:14 +0000 Received: from SRVSGR-HUBCAS03.AT.styria-it.net ([195.12.215.43]) by srvsgr-smtp02.styria-it.com (8.14.5/8.14.5) with ESMTP id s8HESqLw003952 for ; Wed, 17 Sep 2014 16:28:52 +0200 Received: from SRVSGR-MBOX02.AT.styria-it.net ([fe80::b823:953a:d902:1ee0]) by SRVSGR-HUBCAS03.AT.styria-it.net ([fe80::fc4c:e673:f40:3198%14]) with mapi id 14.03.0174.001; Wed, 17 Sep 2014 16:28:52 +0200 From: Blanca Hernandez To: "user@hadoop.apache.org" Subject: AW: ClassCastException on running map-reduce jobs + tests on Windows (mongo-hadoop) Thread-Topic: ClassCastException on running map-reduce jobs + tests on Windows (mongo-hadoop) Thread-Index: Ac/Sepm5HuBA1E/LR/ipzyBpjdsu2wACKe+A Date: Wed, 17 Sep 2014 14:28:52 +0000 Message-ID: References: In-Reply-To: Accept-Language: en-US, de-AT Content-Language: de-DE X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [195.12.192.51] Content-Type: multipart/alternative; boundary="_000_CF5C9CB4C4722244884762FA2936D75F14FC6DC3SRVSGRMBOX02ATs_" MIME-Version: 1.0 X-Virus-Checked: Checked by ClamAV on apache.org --_000_CF5C9CB4C4722244884762FA2936D75F14FC6DC3SRVSGRMBOX02ATs_ Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Hi again, I changed the String objects with org.apache.hadoop.io.Text objec= ts (why is String not accepted?), and now I get another exception, so I don= =B4t really know if I solved something or I broke something: java.lang.Exception: java.lang.NullPointerException at org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRun= ner.java:462) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.j= ava:522) Caused by: java.lang.NullPointerException at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.init(MapTask.ja= va:988) at org.apache.hadoop.mapred.MapTask.createSortingCollector(MapTask.= java:391) at org.apache.hadoop.mapred.MapTask.access$100(MapTask.java:80) at org.apache.hadoop.mapred.MapTask$NewOutputCollector.(MapTa= sk.java:675) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:747) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:340) at org.apache.hadoop.mapred.LocalJobRunner$Job$MapTaskRunnable.run(= LocalJobRunner.java:243) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.ja= va:471) at java.util.concurrent.FutureTask.run(FutureTask.java:262) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExec= utor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExe= cutor.java:615) at java.lang.Thread.run(Thread.java:744) If I could debug it in my IDE, I think I could work faster, but I have the = problems already exposed. How am I testing now? Building a jar, copying it = on the server and running a Hadoop jar command (not very performance approa= ch...). Could you give me a hand on this? Any Windows + IntelliJ IDEa there? Maaaan= y thanks! Von: Blanca Hernandez [mailto:Blanca.Hernandez@willhaben.at] Gesendet: Mittwoch, 17. September 2014 15:27 An: user@hadoop.apache.org Betreff: ClassCastException on running map-reduce jobs + tests on Windows (= mongo-hadoop) Hi! I am getting some CCE and don=B4t really understand why... Here my mapper: public class AveragePriceMapper extends Mapper{ @Override public void map(final String key, final BSONObject val, final Context c= ontext) throws IOException, InterruptedException { String id =3D "result_of_making_some_operations"; context.write(id, val); } } And in my configuration: config.setMapperOutputKey(String.class); config.setMapperOutputValue(BSONObject.class); On running my generated jar on the server, seems to work everything ok unti= l: 14/09/17 15:20:36 INFO mapred.MapTask: Processing split: MongoInputSplit{UR= I=3Dmongodb://user:pass@host:27017/my_db.my_collection, authURI=3Dnull, min= =3D{ "_id" : { "$oid" : "541666d8e4b07265e257a42e"}}, max=3D{ }, query=3D{ = }, sort=3D{ }, fields=3D{ }, notimeout=3Dfalse} 14/09/17 15:20:36 INFO mapred.MapTask: Map output collector class =3D org.a= pache.hadoop.mapred.MapTask$MapOutputBuffer 14/09/17 15:20:36 INFO mapred.MapTask: (EQUATOR) 0 kvi 26214396(104857584) 14/09/17 15:20:36 INFO mapred.MapTask: mapreduce.task.io.sort.mb: 100 14/09/17 15:20:36 INFO mapred.MapTask: soft limit at 83886080 14/09/17 15:20:36 INFO mapred.MapTask: bufstart =3D 0; bufvoid =3D 10485760= 0 14/09/17 15:20:36 INFO mapred.MapTask: kvstart =3D 26214396; length =3D 655= 3600 14/09/17 15:20:36 INFO mapred.LocalJobRunner: map task executor complete. 14/09/17 15:20:36 WARN mapred.LocalJobRunner: job_local1701078621_0001java.= lang.Exception: java.lang.ClassCastException: class java.lang.String at org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRun= ner.java:462) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.j= ava:522) Caused by: java.lang.ClassCastException: class java.lang.String at java.lang.Class.asSubclass(Class.java:3126) at org.apache.hadoop.mapred.JobConf.getOutputKeyComparator(JobConf.= java:885) at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.init(MapTask.ja= va:981) at org.apache.hadoop.mapred.MapTask.createSortingCollector(MapTask.= java:391) at org.apache.hadoop.mapred.MapTask.access$100(MapTask.java:80) at org.apache.hadoop.mapred.MapTask$NewOutputCollector.(MapTa= sk.java:675) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:747) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:340) at org.apache.hadoop.mapred.LocalJobRunner$Job$MapTaskRunnable.run(= LocalJobRunner.java:243) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.ja= va:471) at java.util.concurrent.FutureTask.run(FutureTask.java:262) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExec= utor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExe= cutor.java:615) at java.lang.Thread.run(Thread.java:744) Did I miss something?? Another issue I am worry about: working on a Windows system makes everythin= g quite complicated with Hadoop. I have it installed and running, the same = as my mongoDB database (I am using the connector provided by them). Running= the same main class I am using in the hadooop jar call on the server (in t= he example before), but from my IDE, I get this exception: PriviledgedActionException as:hernanbl cause:java.io.IOException: Failed to= set permissions of path: \tmp\hadoop-hernanbl\mapred\staging\hernanbl16008= 42219\.staging to 0700 How could I make it run? Many thanks!! Best regards, Blanca --_000_CF5C9CB4C4722244884762FA2936D75F14FC6DC3SRVSGRMBOX02ATs_ Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable

Hi agai= n, I changed the String objects with org.apache.hadoop.io.Text objects (why= is String not accepted?), and now I get another exception, so I don=B4t re= ally know if I solved something or I broke something:

&n= bsp;

&n= bsp;

java.la= ng.Exception: java.lang.NullPointerException

 &= nbsp;      at org.apache.hadoop.mapred.LocalJobRun= ner$Job.runTasks(LocalJobRunner.java:462)

 &= nbsp;      at org.apache.hadoop.mapred.LocalJobRun= ner$Job.run(LocalJobRunner.java:522)

Caused = by: java.lang.NullPointerException

 &= nbsp;      at org.apache.hadoop.mapred.MapTask$Map= OutputBuffer.init(MapTask.java:988)

 &= nbsp;      at org.apache.hadoop.mapred.MapTask.cre= ateSortingCollector(MapTask.java:391)

 &= nbsp;      at org.apache.hadoop.mapred.MapTask.acc= ess$100(MapTask.java:80)

 &= nbsp;      at org.apache.hadoop.mapred.MapTask$New= OutputCollector.<init>(MapTask.java:675)

 &= nbsp;      at org.apache.hadoop.mapred.MapTask.run= NewMapper(MapTask.java:747)

 &= nbsp;      at org.apache.hadoop.mapred.MapTask.run= (MapTask.java:340)

 &= nbsp;      at org.apache.hadoop.mapred.LocalJobRun= ner$Job$MapTaskRunnable.run(LocalJobRunner.java:243)

 &= nbsp;      at java.util.concurrent.Executors$Runna= bleAdapter.call(Executors.java:471)

 &= nbsp;      at java.util.concurrent.FutureTask.run(= FutureTask.java:262)

 &= nbsp;      at java.util.concurrent.ThreadPoolExecu= tor.runWorker(ThreadPoolExecutor.java:1145)

 &= nbsp;      at java.util.concurrent.ThreadPoolExecu= tor$Worker.run(ThreadPoolExecutor.java:615)

 &= nbsp;      at java.lang.Thread.run(Thread.java:744= )

&n= bsp;

If I co= uld debug it in my IDE, I think I could work faster, but I have the problem= s already exposed. How am I testing now? Building a jar, copying it on the = server and running a Hadoop jar command (not very performance approach…).

&n= bsp;

Could y= ou give me a hand on this? Any Windows + IntelliJ IDEa there? Maaaany t= hanks!

&n= bsp;

&n= bsp;

&n= bsp;

Von: Blanca Hernandez [mailto:Blanca.Hern= andez@willhaben.at]
Gesendet: Mittwoch, 17. September 2014 15:27
An: user@hadoop.apache.org
Betreff: ClassCastException on running map-reduce jobs + tests o= n Windows (mongo-hadoop)

 

Hi!

 

I am getting some CCE and don= =B4t really understand why…

 

Here my mapper:

 

public class AveragePriceMapper= extends Mapper<String, BSONObject, String, BSONObject>{

    @Override

    public void = map(final String key, final BSONObject val, final Context context) throws I= OException, InterruptedException {

     &= nbsp;  String id =3D “result_of_making_some_operations”;

     &= nbsp;  context.write(id, val);

    }=

}

 

And in my configuration:

 

config.setMapperOutputKey(Strin= g.class);

config.setMapperOutputValue(BSO= NObject.class);

 

 

On running my generated jar on = the server, seems to work everything ok until:

 

14/09/17 15:20:36 INFO mapred.M= apTask: Processing split: MongoInputSplit{URI=3Dmongodb://user:pass@host:27= 017/my_db.my_collection, authURI=3Dnull, min=3D{ "_id" : { "= $oid" : "541666d8e4b07265e257a42e"}}, max=3D{ }, query=3D{ }= , sort=3D{ }, fields=3D{ }, notimeout=3Dfalse}

14/09/17 15:20:36 INFO mapred.M= apTask: Map output collector class =3D org.apache.hadoop.mapred.MapTask$Map= OutputBuffer

14/09/17 15:20:36 INFO mapred.M= apTask: (EQUATOR) 0 kvi 26214396(104857584)

14/09/17 15:20:36 INFO mapred.M= apTask: mapreduce.task.io.sort.mb: 100

14/09/17 15:20:36 INFO mapred.M= apTask: soft limit at 83886080

14/09/17 15:20:36 INFO mapred.M= apTask: bufstart =3D 0; bufvoid =3D 104857600

14/09/17 15:20:36 INFO mapred.M= apTask: kvstart =3D 26214396; length =3D 6553600

14/09/17 15:20:36 INFO mapred.L= ocalJobRunner: map task executor complete.

14/09/17 15:20:36 WARN mapred.L= ocalJobRunner: job_local1701078621_0001java.lang.Exception: java.lang.Class= CastException: class java.lang.String

     &= nbsp;  at org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJo= bRunner.java:462)

     &= nbsp;  at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunn= er.java:522)

Caused by: java.lang.ClassCastE= xception: class java.lang.String

     &= nbsp;  at java.lang.Class.asSubclass(Class.java:3126)

     &= nbsp;  at org.apache.hadoop.mapred.JobConf.getOutputKeyComparator(JobC= onf.java:885)

     &= nbsp;  at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.init(MapTas= k.java:981)

     &= nbsp;  at org.apache.hadoop.mapred.MapTask.createSortingCollector(MapT= ask.java:391)

     &= nbsp;  at org.apache.hadoop.mapred.MapTask.access$100(MapTask.java:80)=

     &= nbsp;  at org.apache.hadoop.mapred.MapTask$NewOutputCollector.<init= >(MapTask.java:675)

     &= nbsp;  at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:7= 47)

     &= nbsp;  at org.apache.hadoop.mapred.MapTask.run(MapTask.java:340)<= /o:p>

     &= nbsp;  at org.apache.hadoop.mapred.LocalJobRunner$Job$MapTaskRunnable.= run(LocalJobRunner.java:243)

     &= nbsp;  at java.util.concurrent.Executors$RunnableAdapter.call(Executor= s.java:471)

     &= nbsp;  at java.util.concurrent.FutureTask.run(FutureTask.java:262)

     &= nbsp;  at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPool= Executor.java:1145)

     &= nbsp;  at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoo= lExecutor.java:615)

     &= nbsp;  at java.lang.Thread.run(Thread.java:744)

 

 

Did I miss something??

 

 

Another issue I am worry about:= working on a Windows system makes everything quite complicated with Hadoop= . I have it installed and running, the same as my mongoDB database (I am us= ing the connector provided by them). Running the same main class I am using in the hadooop jar call on the serv= er (in the example before), but from my IDE, I get this exception:

 

PriviledgedActionException as:h= ernanbl cause:java.io.IOException: Failed to set permissions of path: \tmp\= hadoop-hernanbl\mapred\staging\hernanbl1600842219\.staging to 0700

 

How could I make it run?

 

 

Many thanks!!=

 

Best regards,

 

Blanca

--_000_CF5C9CB4C4722244884762FA2936D75F14FC6DC3SRVSGRMBOX02ATs_--