hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Steve Loughran <ste...@hortonworks.com>
Subject Re: error running pi program
Date Sat, 10 Nov 2012 10:38:23 GMT
On 10 November 2012 00:48, yinghua hu <yinghua.hu@gmail.com> wrote:

> Hi, Ted and Andy
> I tried both internal and external hostnames. They both worked. But I will
> elect external host name since this way I can browsing the hdfs file
> structure and other information from a computer not in the cluster.
> Thanks!

I wouldn't recommend this:

If you hook up your DNs to the NN via the external addresses, Amazon will
bill you for external traffic, which could be an unpleasant surprise at the
end of the month.

If you deploy using Apache Whirr then it gives you a command to run an SSH

I've filed a JIRA for the webhdfs feature in Hadoop 1.0.3+ -which offers a
way to get at the filesystem via HTTP to not have the NN issue 302
redirects to the specific DN, but instead act as a proxy; there is a
cluster proxy service (with the same API) that can do this, but for small
EC2 clusters using the NN directly would be easier. What I haven't done is
implemented it -if anyone else wants to I'll gladly review the code (and
its tests)


View raw message