Return-Path: Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: (qmail 86130 invoked from network); 7 Dec 2010 02:11:28 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 7 Dec 2010 02:11:28 -0000 Received: (qmail 34800 invoked by uid 500); 7 Dec 2010 02:11:27 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 34749 invoked by uid 500); 7 Dec 2010 02:11:27 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 34741 invoked by uid 99); 7 Dec 2010 02:11:27 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Dec 2010 02:11:27 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of amp@opendns.com designates 67.215.68.163 as permitted sender) Received: from [67.215.68.163] (HELO mail.opendns.com) (67.215.68.163) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Dec 2010 02:11:19 +0000 Received: from Adams-Desktop.local ([67.215.69.42]) (authenticated bits=0) by mail.opendns.com (8.14.3/8.14.3/Debian-5) with ESMTP id oB72Av6L009749 (version=TLSv1/SSLv3 cipher=AES256-SHA bits=256 verify=NO) for ; Tue, 7 Dec 2010 02:10:57 GMT Message-ID: <4CFD97B1.5030607@opendns.com> Date: Mon, 06 Dec 2010 18:10:57 -0800 From: Adam Phelps User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.2.12) Gecko/20101027 Thunderbird/3.1.6 MIME-Version: 1.0 To: general@hadoop.apache.org Subject: Several problems after CDB3b2 to CDB3b3 upgrade Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org I'm testing a hadoop version upgrade on a prototype EC2 cluster, but while I've now gotten most of it up and running (well, HDFS and HBase at least) I'm hitting some odd problems getting our M/R jobs to run. (I followed all the instructions at https://wiki.cloudera.com/display/DOC/Hadoop+Upgrade+from+CDH2+or+CDH3b2+to+CDH3b3 as well as fixing a number of problems that came up in that process.) They current problem I'm stuck on appears to be a classpath issue, but one I can't figure out. When running a job I hit this error: 10/12/07 02:01:05 INFO mapred.JobClient: Task Id : attempt_201012062243_0009_m_000182_0, Status : FAILED java.lang.RuntimeException: java.lang.ClassNotFoundException: org.apache.hadoop.hbase.mapreduce.HFileOutputFormat at org.apache.hadoop.conf.Configuration.getClass(Configuration.java:973) at org.apache.hadoop.mapreduce.JobContext.getOutputFormatClass(JobContext.java:236) at org.apache.hadoop.mapred.Task.initialize(Task.java:484) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:298) at org.apache.hadoop.mapred.Child$4.run(Child.java:217) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:396) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1063) at org.apache.hadoop.mapred.Child.main(Child.java:211) We do use HFileOutputFormat in our M/R job, however as far as I can tell that should be handled by out existing classpath: 10/12/07 02:07:25 INFO zookeeper.ZooKeeper: Client environment:java.class.path=/usr/lib/hadoop-0.20/conf:/usr/lib/jvm/java-6-sun/lib/tools.jar:/usr/lib/hadoop-0.20:/usr/lib hadoop-0.20/hadoop- ... /jsp-2.1.jar:/usr/lib/hadoop-0.20/lib/jsp-2.1/jsp-api-2.1.jar::/usr/lib/hbase/hbase.jar:/usr/lib/hbase/conf:/usr/lib/zookeeper/zookeeper.jar /usr/lib/hbase/hbase.jar:/usr/lib/hbase/conf:/usr/lib/zookeeper/zookeeper.jar It looks to me like HFileOutputFormat should be covered by that class path: # jar tf /usr/lib/hbase/hbase.jar | grep HFileOutputFormat org/apache/hadoop/hbase/mapreduce/HFileOutputFormat$WriterLength.class org/apache/hadoop/hbase/mapreduce/HFileOutputFormat.class org/apache/hadoop/hbase/mapreduce/HFileOutputFormat$1.class Any ideas here? I have another similar issue, although with this one I have to assume that some package that was previously included with the base cloudera packages is no longer included: Exception in thread "main" java.lang.NoClassDefFoundError: com/google/common/base/Function at org.apache.hadoop.hbase.mapreduce.TableMapReduceUtil.addDependencyJars(TableMapReduceUtil.java:247) at org.apache.hadoop.hbase.mapreduce.TableMapReduceUtil.initTableMapperJob(TableMapReduceUtil.java:81) Thanks - Adam