hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Usman Waheed" <usm...@opera.com>
Subject Re: Are .bz2 extensions supported in Hadoop 18.3
Date Wed, 24 Jun 2009 18:22:24 GMT
Very cool, we are using Debian and I checked Cloudera's website. You have  
packages for the Debian platform.
Will check it out and install on a test cluster.

Thanks much,
Usman

> This is correct - thanks for the note Jason. You can see the current
> patch list for Cloudera's Distribution (based on 18.3) at:
> http://www.cloudera.com/hadoop-manifest
>
> In addition to Bzip2, we have patched in: DBInputFormat, the fair
> scheduler, job level task limiting, "soft" fd leak fix, a fix for HDFS
> under-replication, shuffle improvements, EC2/S3 improvements, and
> Sqoop - database import for Hadoop.
>
> You can download RPMs and Ubuntu packages as well as preconfigured EC2
> images from: http://www.cloudera.com/hadoop
>
> Cheers,
> Christophe
>
> On Wed, Jun 24, 2009 at 6:47 AM, jason hadoop<jason.hadoop@gmail.com>  
> wrote:
>> I believe the cloudera 18.3 supports bzip2
>>
>> On Wed, Jun 24, 2009 at 3:45 AM, Usman Waheed <usmanw@opera.com> wrote:
>>
>>> Hi All,
>>>
>>> Can I map/reduce logs that have the .bz2 extension in Hadoop 18.3?
>>> I tried but interestingly the output was not what i expected versus  
>>> what i
>>> got when my data was in uncompressed format.
>>>
>>> Thanks,
>>> Usman
>>>
>>
>>
>>
>> --
>> Pro Hadoop, a book to guide you from beginner to hadoop mastery,
>> http://www.amazon.com/dp/1430219424?tag=jewlerymall
>> www.prohadoopbook.com a community for Hadoop Professionals
>>
>
>
>



-- 
Using Opera's revolutionary e-mail client: http://www.opera.com/mail/

Mime
View raw message