hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sigurd Spieckermann <sigurd.spieckerm...@gmail.com>
Subject Tell Hadoop to store pairs of files at the same location(s) on HDFS
Date Wed, 05 Dec 2012 17:53:53 GMT
Hi guys,

I have been wondering if there's a way (hack'ish would be okay too) to tell
Hadoop that two files shall be stored together at the same location(s). It
would benefit map-side join performance if it could be done somehow because
all map tasks would be able to read data from a local copy. Does anyone
know a way?

-Sigurd

Mime
View raw message