hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sigurd Spieckermann <sigurd.spieckerm...@gmail.com>
Subject Re: Tell Hadoop to store pairs of files at the same location(s) on HDFS
Date Wed, 05 Dec 2012 19:26:13 GMT
Awesome! That's exactly what I'm looking for. Hadn't seen the JIRA. I 
hope this is coming soon!

Am 05.12.2012 18:58, schrieb Harsh J:
> You are probably talking of
> https://issues.apache.org/jira/browse/HDFS-2576 and similar JIRAs.
> This feature isn't available in HDFS yet, but may arrive soon.
> On Wed, Dec 5, 2012 at 11:23 PM, Sigurd Spieckermann
> <sigurd.spieckermann@gmail.com> wrote:
>> Hi guys,
>> I have been wondering if there's a way (hack'ish would be okay too) to tell
>> Hadoop that two files shall be stored together at the same location(s). It
>> would benefit map-side join performance if it could be done somehow because
>> all map tasks would be able to read data from a local copy. Does anyone know
>> a way?
>> -Sigurd

View raw message