Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 82704CE9E for ; Wed, 23 May 2012 10:30:31 +0000 (UTC) Received: (qmail 35692 invoked by uid 500); 23 May 2012 10:30:27 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 35406 invoked by uid 500); 23 May 2012 10:30:27 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 35392 invoked by uid 99); 23 May 2012 10:30:27 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 23 May 2012 10:30:27 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,WEIRD_PORT X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of mayingnanapple@gmail.com designates 209.85.210.48 as permitted sender) Received: from [209.85.210.48] (HELO mail-pz0-f48.google.com) (209.85.210.48) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 23 May 2012 10:30:18 +0000 Received: by dadz8 with SMTP id z8so12430211dad.35 for ; Wed, 23 May 2012 03:29:57 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=kMq4P7YBXwQrPqw3UFaQ8yc5zpACEID01O+B/CG3pIQ=; b=Ca3F9xKP1SL4c0r8QI5mpmeRraiVmYdjkaDlvrIdkM4LjMRUoCd8NskzDvC7yPiHlA iI7SInp5/gofywA0ynWN97jo4wMi2s49amisFWYEK40JboIwleh09EIV2++wfoLGIO5U 6EY0+uA50nGAMC9/bIMd5QTtyV4fy6lanITfYAr9JlGmKJ3Vt1faWyH19VCLuXT5y2kC roDbLd+tMGgoG0zcudt9S3SDQTdsQ0VN3fUThhbxmJm+0Gt6i5b+C5NBH0+smoBIBEJA mZfL41L7zvvQB/iGURrFhXOB9uQ4KKADOL6kw0onrdUrNTQ8lRvYL+I+pr8+pb96omEE qryA== MIME-Version: 1.0 Received: by 10.68.192.74 with SMTP id he10mr9097456pbc.69.1337768997083; Wed, 23 May 2012 03:29:57 -0700 (PDT) Received: by 10.142.233.8 with HTTP; Wed, 23 May 2012 03:29:57 -0700 (PDT) Date: Wed, 23 May 2012 18:29:57 +0800 Message-ID: Subject: Hadoop LZO compression From: Yingnan Ma To: common-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=e89a8ff24e11dcda7304c0b19dee --e89a8ff24e11dcda7304c0b19dee Content-Type: text/plain; charset=ISO-8859-1 Hi, I encounter a problem about when I install the LZO, after i install it, I found that it can run on Pig scripts and streaming scripts and when I check these jobs though jobtracker , it shows that *mapred.compress.map.output *true, *io.compression.codecs * org.apache.hadoop.io.compress.GzipCodec, org.apache.hadoop.io.compress.DefaultCodec, com.hadoop.compression.lzo.LzoCodec, com.hadoop.compression.lzo.LzopCodec *io.compression.codec.lzo.class*com.hadoop.compression.lzo.LzoCodec and the pig && streaming can run also. However, I found that it would gave me some error notices such as: 2012-05-23 17:32:57,052 [Thread-6] ERROR com.hadoop.compression.lzo.GPLNativeCodeLoader - Could not load native gpl library java.lang.UnsatisfiedLinkError: no gplcompression in java.library.path at java.lang.ClassLoader.loadLibrary(ClassLoader.java:1738) at java.lang.Runtime.loadLibrary0(Runtime.java:823) at java.lang.System.loadLibrary(System.java:1028) at com.hadoop.compression.lzo.GPLNativeCodeLoader.(GPLNativeCodeLoader.java:32) at com.hadoop.compression.lzo.LzoCodec.(LzoCodec.java:71) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:247) at org.apache.hadoop.conf.Configuration.getClassByName(Configuration.java:943) at org.apache.hadoop.io.compress.CompressionCodecFactory.getCodecClasses(CompressionCodecFactory.java:89) at org.apache.hadoop.io.compress.CompressionCodecFactory.(CompressionCodecFactory.java:134) at org.apache.hadoop.mapreduce.lib.input.TextInputFormat.isSplitable(TextInputFormat.java:46) at org.apache.hadoop.mapreduce.lib.input.FileInputFormat.getSplits(FileInputFormat.java:254) at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigInputFormat.getSplits(PigInputFormat.java:268) at org.apache.hadoop.mapred.JobClient.writeNewSplits(JobClient.java:944) at org.apache.hadoop.mapred.JobClient.writeSplits(JobClient.java:961) at org.apache.hadoop.mapred.JobClient.access$500(JobClient.java:170) at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:880) at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:833) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:396) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1115) at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:833) at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:807) at org.apache.hadoop.mapred.jobcontrol.Job.submit(Job.java:378) at org.apache.hadoop.mapred.jobcontrol.JobControl.startReadyJobs(JobControl.java:247) at org.apache.hadoop.mapred.jobcontrol.JobControl.run(JobControl.java:279) at java.lang.Thread.run(Thread.java:662) 2012-05-23 17:32:57,053 [Thread-6] ERROR com.hadoop.compression.lzo.LzoCodec - Cannot load native-lzo without native-hadoop 2012-05-23 17:32:57,070 [Thread-6] INFO org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths (combined) to process : 1 2012-05-23 17:32:58,419 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - HadoopJobId: job_201204051731_1249 2012-05-23 17:32:58,419 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - More information at: http://hdjt:50030/jobdetails.jsp?jobid=job_201204051731_1249 2012-05-23 17:33:14,526 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 50% complete 2012-05-23 17:33:18,134 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 100% complete 2012-05-23 17:33:18,136 [main] INFO org.apache.pig.tools.pigstats.PigStats - Script Statistics: HadoopVersion PigVersion UserId StartedAt FinishedAt Features 0.20.2-cdh3u0 0.8.0-cdh3u0 root 2012-05-23 17:32:54 2012-05-23 17:33:18 FILTER Success! Job Stats (time in seconds): JobId Maps Reduces MaxMapTime MinMapTIme AvgMapTime MaxReduceTime MinReduceTime AvgReduceTime Alias Feature Outputs job_201204051731_1249 1 0 10 10 10 0 0 A,B MAP_ONLY hdfs://hdmaster:54310/tmp/temp-1842686846/tmp-2027515206, It make me confuse because if it has some issues, it would not work, however it may work. So I need some help, thank you for your help! Best Regards Malone 2012-05-23 --e89a8ff24e11dcda7304c0b19dee--