Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 11404 invoked from network); 2 Feb 2010 19:42:16 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 2 Feb 2010 19:42:16 -0000 Received: (qmail 61580 invoked by uid 500); 2 Feb 2010 19:42:14 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 61498 invoked by uid 500); 2 Feb 2010 19:42:13 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 61488 invoked by uid 99); 2 Feb 2010 19:42:13 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 02 Feb 2010 19:42:13 +0000 X-ASF-Spam-Status: No, hits=3.4 required=10.0 tests=HTML_MESSAGE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [209.85.223.194] (HELO mail-iw0-f194.google.com) (209.85.223.194) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 02 Feb 2010 19:42:01 +0000 Received: by iwn34 with SMTP id 34so299490iwn.21 for ; Tue, 02 Feb 2010 11:41:39 -0800 (PST) MIME-Version: 1.0 Received: by 10.142.4.39 with SMTP id 39mr4316268wfd.76.1265139699086; Tue, 02 Feb 2010 11:41:39 -0800 (PST) In-Reply-To: References: From: Todd Lipcon Date: Tue, 2 Feb 2010 11:41:19 -0800 Message-ID: <45f85f71002021141o1dc81dd8p1cefefdf9d853d81@mail.gmail.com> Subject: Re: ClassCastException in lzo indexer To: common-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=00504502af825af687047ea34a3e X-Virus-Checked: Checked by ClamAV on apache.org --00504502af825af687047ea34a3e Content-Type: text/plain; charset=ISO-8859-1 Hi Vasilis, Did you make sure to "ant clean" before rebuilding hadoop-lzo if you updated the code? Also, can you paste your configuration for io.compression.codecs ? Thanks -Todd On Tue, Feb 2, 2010 at 9:09 AM, Vasilis Liaskovitis wrote: > Hi, > > I am trying to use hadoop-0.20.1 and hadoop-lzo > (http://github.com/kevinweil/hadoop-lzo) to index an lzo file. I 've > followed the instructions and copied both jar and native libs in my > classpaths. I am getting this error in both local and distributed > indexer mode > > bin/hadoop jar lib/hadoop-lzo-0.3.0.jar > com.hadoop.compression.lzo.LzoIndexer /data/userVisits.lzo > > 10/02/02 17:30:38 INFO lzo.GPLNativeCodeLoader: Loaded native gpl library > 10/02/02 17:30:38 INFO lzo.LzoCodec: Successfully loaded & initialized > native-lzo library > 10/02/02 17:30:38 INFO lzo.DistributedLzoIndexer: Adding LZO file > /data/UserVisits.lzo to indexing list (no index currently exists) > 10/02/02 17:30:38 WARN mapred.JobClient: Use GenericOptionsParser for > parsing the arguments. Applications should implement Tool for the > same. > 10/02/02 17:30:39 INFO input.FileInputFormat: Total input paths to process > : 1 > 10/02/02 17:30:39 INFO mapred.JobClient: Running job: job_201002020748_0409 > 10/02/02 17:30:40 INFO mapred.JobClient: map 0% reduce 0% > 10/02/02 17:31:02 INFO mapred.JobClient: Task Id : > attempt_201002020748_0409_m_000000_0, Status : FAILED > java.lang.ClassCastException: > com.hadoop.compression.lzo.LzopCodec$LzopDecompressor cannot be cast > to com.hadoop.compression.lzo.LzopDecompressor > > same error for local indexer: > > user2@amdqc08:~/hadoop-0.20.1-prof> bin/hadoop jar > lib/hadoop-lzo-0.3.0.jar com.hadoop.compression.lzo.LzoIndexer > /data/UserVisits.lzo > 10/02/02 17:38:47 INFO lzo.GPLNativeCodeLoader: Loaded native gpl library > 10/02/02 17:38:47 INFO lzo.LzoCodec: Successfully loaded & initialized > native-lzo library > 10/02/02 17:38:47 INFO lzo.LzoIndexer: [INDEX] LZO Indexing file > /data/UserVisits.lzo, size 9.94 GB... > Exception in thread "main" java.lang.ClassCastException: > com.hadoop.compression.lzo.LzopCodec$LzopDecompressor cannot be cast > to com.hadoop.compression.lzo.LzopDecompressor > > is hadoop-0.20.1 compatible with the git/master of hadoop-lzo? Or do I > need to use some older version of hadoop-lzo to be compatible with > hadoop-0.20.1? > > - a different, but relevant question: In order to compress > intermediate map outputs with lzo and process them in an efficient > way, does the map/reduce job need to explicitly create index files for > the compressed intermediate files? I think that at this shuffle stage, > input files have already been split and we are not relying on indexing > the intermediate lzo files for parallel shuffling. Is that correct? Or > would a job need to index the intermediate files? If yes, can this be > handled in an automatic fashion by hadoop-lzo? > > any suggestions are welcome. > thanks, > > - Vasilis > --00504502af825af687047ea34a3e--