hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andreas Kostyrka <andr...@kostyrka.org>
Subject Re: hadoop on EC2
Date Fri, 30 May 2008 19:33:45 GMT
On Wednesday 28 May 2008 23:16:43 Chris Anderson wrote:
> Andreas,
>
> If you can ssh into the nodes, you can always set up port-forwarding
> with ssh -L to bring those ports to your local machine.

Yes, and the missing part is simple too: iptables with DNAT on OUTPUT :)

I even made a small ugly script for this kind of tunneling.

Andreas

>
> On Wed, May 28, 2008 at 1:51 PM, Andreas Kostyrka <andreas@kostyrka.org> 
wrote:
> > What I wonder is what ports do I need to access?
> >
> > 50060 on all nodes.
> > 50030 on the jobtracker.
> >
> > Any other ports?
> >
> > Andreas
> >
> > Am Mittwoch, den 28.05.2008, 13:37 -0700 schrieb Allen Wittenauer:
> >> On 5/28/08 1:22 PM, "Andreas Kostyrka" <andreas@kostyrka.org> wrote:
> >> > I just wondered what other people use to access the hadoop webservers,
> >> > when running on EC2?
> >>
> >>     While we don't run on EC2 :), we do protect the hadoop web processes
> >> by putting a proxy in front of it.  A user connects to the proxy,
> >> authenticates, and then gets the output from the hadoop process.  All of
> >> the redirection magic happens via a localhost connection, so no data is
> >> leaked unprotected.



Mime
View raw message