hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Allen Wittenauer <awittena...@linkedin.com>
Subject Re: Question regarding a System good candidate for Hadoop?
Date Tue, 04 Jan 2011 04:37:06 GMT

On Jan 1, 2011, at 8:31 PM, Harsh J wrote:

> Hi,
> 
> Hadoop should be evaluated if your to-process dataset is large (Large
> is relative to the size of the cluster you're going to use --
> basically using at least X amount of data such that all the processing
> power of your cluster is utilized for at least a good Y period).
> 
> If you're going to stick to C, you have two options:
>  - Hadoop Streaming [Flexible, but uses a pipe]
>  - Loading a native shared library from the distributed cache. [This
> ought to be faster than former]

or use the Hadoop Pipes (C/C++) interface.



Mime
View raw message