hadoop-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Steve Loughran <ste...@apache.org>
Subject Re: Defining Hadoop Compatibility -revisiting-
Date Wed, 11 May 2011 10:34:06 GMT
On 11/05/2011 08:43, Andrew Purtell wrote:
>> From: Steve Loughran<stevel@apache.org>

>>   -I don't think you can claim to have a
>> Distribution/Fork/Version of Apache Hadoop if you swap out
>> big chunks of it for alternate filesystems, MR engines, etc.
>> Some description of this is needed
>> "Supports the Apache Hadoop MapReduce engine on top of
>> Filesystem XYZ"
> This is also the case with Brisk, which replaces HDFS and the standard JobTracker with
Cassandra and a new JobTracker, and claims to be a Hadoop distribution.
>    "Apache Hadoop TM Powered by Cassandra"
>    http://www.datastax.com/products/brisk
>    "DataStax’ Brisk is an enhanced open-source Apache Hadoop and
>    Hive distribution that utilizes Apache Cassandra for many of
>    its core services. [...]"

+1. It is something containing Hadoop interfaces and possibly 
source/artifacts, but I'm not sure how to describe it. It is just 
something that claims compatibility with Hadoop's filesystem and MR 
runtime. If Google chose to add the same interfaces to their platform 
within Google App Engine, it wouldn't be a Hadoop distro either.

I think it's important to set some definitions here *now* so that 
confusion doesn't set in.

View raw message