hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mark Kerzner <markkerz...@gmail.com>
Subject Re: Time to build my own cluster - advice?
Date Thu, 05 Nov 2009 18:14:25 GMT

I wanted to install OpenOffice and connect to it from my java code. I tried
to replicate the complete install by copying it, but there must be something
else there, because I can't connect on Amazon MapReduce, but I can on my own

When you say cheaper, do you mean that keeping your own EC2 machines up and
using them as hadoop cluster is in the end cheaper than starting a Hadoop
cluster every time you want to run a job?

Thank you,

On Thu, Nov 5, 2009 at 12:04 PM, Edmund Kohlwey <ekohlwey@gmail.com> wrote:

> If all your dependencies are java based (like Open Office) you might try
> using a dependency manager/build tool like maven or ant/ivy to package the
> dependencies in your jar. I'm not sure if any parts of open office are
> available in a public repo as maven artifacts or not, or if you want to get
> into packaging artifacts for your build system, but its something you might
> try.
> I think its cheaper to just use EC2 anyways, so that might be a motivating
> factor for you as well.
>  Hi,
>>> so far I've been using Amazon MapReduce. However, my app uses a growing
>>> number of Linux packages. I have been installing them on the fly, in the
>>> Mapper.configure(), but with OpenOffice this is hard, and I don't get a
>>> service connection even after local install.
>>> Therefore, my question is on the advice in creating my own Hadoop cluster
>>> out of EC2 machines. Are there instructions? How hard is it? What are
>>> best
>>> practices?
>>> Thank you,
>>> Mark

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message