hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stephan Ewen <stephan.e...@tu-berlin.de>
Subject Question regarding host name representation in HDFS
Date Thu, 21 Jun 2012 09:38:02 GMT
Hello HDFS Community!

I have a question regarding the information about HDFS block locations. We
are building a system on top of HDFS that tries to obey data locality
rules, so we are eager to match block locations against machines.

However, when looking at the host names obtained through
"BlockLocation#getHosts() ", it seems that the host names vary in format,
depending on how the machines are set up. Sometimes, the host name contains
the fully qualified domain name (such as "server1.hdfscluster.company.com")
and sometimes it contains only the host name (such as "server1"). It seems
to be neither consistent with the java methods "InetAddress#getHostName()"
or "InetAddress#getCanonicalHostName()"

Is there a general rule after which those names are derived?

Thanks for your help,

View raw message