Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E59DCE76C for ; Wed, 5 Dec 2012 19:26:51 +0000 (UTC) Received: (qmail 11462 invoked by uid 500); 5 Dec 2012 19:26:46 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 11349 invoked by uid 500); 5 Dec 2012 19:26:46 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 11341 invoked by uid 99); 5 Dec 2012 19:26:46 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Dec 2012 19:26:46 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of sigurd.spieckermann@gmail.com designates 209.85.214.48 as permitted sender) Received: from [209.85.214.48] (HELO mail-bk0-f48.google.com) (209.85.214.48) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Dec 2012 19:26:37 +0000 Received: by mail-bk0-f48.google.com with SMTP id jc3so2539027bkc.35 for ; Wed, 05 Dec 2012 11:26:17 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type:content-transfer-encoding; bh=3L3RadzGBE2uC3HVmnR8rBtG+kuak/7tysUZ7ZFR2VA=; b=V99FiPFEJ2bIq+ihUvlDWk2Ftt/GZJgJReGjEJUG3EKIHHWD6+nsgI3w8qCKRvkYBW 6fAyStYrtE50ynYCXoeEDw5dBJRLE1BxECaPl6rwk06jqlEpij9+365OgbBuX2VTyQlX YmohWudhOde5ro12eeyb+LrfCeRdSyomNYAPT7A3tJ3IRGHueFlKqWy1eNcZw1+M5rJU +H9YBS+IWliJj2gLMt00Xdy8W8aUB9tq7R2w7FqMnFjlZMgAm9z/XZM8yEnUr6LMzmne GNFbECqXPvHKQRfpygmmuil7O1yv6EDkTQ7VUhsIXBBJX1aXRb4cos8WvuwWNalsckL1 FMkw== Received: by 10.204.147.207 with SMTP id m15mr5768143bkv.54.1354735576982; Wed, 05 Dec 2012 11:26:16 -0800 (PST) Received: from [192.168.2.100] (dslb-188-110-087-046.pools.arcor-ip.net. [188.110.87.46]) by mx.google.com with ESMTPS id d16sm4845280bkw.2.2012.12.05.11.26.14 (version=SSLv3 cipher=OTHER); Wed, 05 Dec 2012 11:26:15 -0800 (PST) Message-ID: <50BF9FD5.6000100@gmail.com> Date: Wed, 05 Dec 2012 20:26:13 +0100 From: Sigurd Spieckermann User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/17.0 Thunderbird/17.0 MIME-Version: 1.0 To: user@hadoop.apache.org Subject: Re: Tell Hadoop to store pairs of files at the same location(s) on HDFS References: In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Awesome! That's exactly what I'm looking for. Hadn't seen the JIRA. I hope this is coming soon! Am 05.12.2012 18:58, schrieb Harsh J: > You are probably talking of > https://issues.apache.org/jira/browse/HDFS-2576 and similar JIRAs. > This feature isn't available in HDFS yet, but may arrive soon. > > On Wed, Dec 5, 2012 at 11:23 PM, Sigurd Spieckermann > wrote: >> Hi guys, >> >> I have been wondering if there's a way (hack'ish would be okay too) to tell >> Hadoop that two files shall be stored together at the same location(s). It >> would benefit map-side join performance if it could be done somehow because >> all map tasks would be able to read data from a local copy. Does anyone know >> a way? >> >> -Sigurd > > >