hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mohit Anchlia <mohitanch...@gmail.com>
Subject Re: Get Current Block or Split ID, and using it, the Block Path
Date Sun, 08 Apr 2012 16:36:58 GMT
I think if you called getInputFormat on JobConf and then called getSplits
you would atleast get the locations.

http://hadoop.apache.org/common/docs/current/api/org/apache/hadoop/mapred/InputSplit.html

On Sun, Apr 8, 2012 at 9:16 AM, Deepak Nettem <deepaknettem@gmail.com>wrote:

> Hi,
>
> Is it possible to get the 'id' of the currently executing split or block
> from within the mapper? Using this block Id / split id, I want to be able
> to query the namenode to get the names of hosts having that block / spllit,
> and the actual path to the data.
>
> I need this for some analytics that I'm doing. Is there a client API that
> allows doing this?  If not, what's the best way to do this?
>
> Best,
> Deepak Nettem
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message