Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 77B1DF110 for ; Tue, 9 Apr 2013 21:57:09 +0000 (UTC) Received: (qmail 25007 invoked by uid 500); 9 Apr 2013 21:57:04 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 24878 invoked by uid 500); 9 Apr 2013 21:57:04 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 24869 invoked by uid 99); 9 Apr 2013 21:57:04 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 09 Apr 2013 21:57:04 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [98.139.212.154] (HELO nm3-vm0.bullet.mail.bf1.yahoo.com) (98.139.212.154) by apache.org (qpsmtpd/0.29) with SMTP; Tue, 09 Apr 2013 21:56:56 +0000 Received: from [98.139.215.142] by nm3.bullet.mail.bf1.yahoo.com with NNFMP; 09 Apr 2013 21:56:35 -0000 Received: from [98.139.215.253] by tm13.bullet.mail.bf1.yahoo.com with NNFMP; 09 Apr 2013 21:56:35 -0000 Received: from [127.0.0.1] by omp1066.mail.bf1.yahoo.com with NNFMP; 09 Apr 2013 21:56:35 -0000 X-Yahoo-Newman-Property: ymail-3 X-Yahoo-Newman-Id: 831070.28945.bm@omp1066.mail.bf1.yahoo.com Received: (qmail 13223 invoked by uid 60001); 9 Apr 2013 21:56:35 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1365544595; bh=Ho3gZ5ymcWCRhJMGMflwtBYQi8UBHh8MXnHpDBMgU7M=; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=C3f+g5ShGXUojxVwYOhP/PYDQcqjXQ0kfz4DQwu/PNebqX4NVUug6kHn8BXLd8vZSpkF0rKOVS18B7FipN2wfCgTMHKBtNWXmhJhXH8g1Jve3UkQc4teXAvO2ZQyGzl7DqPwezp/Vv9ocrYCBN6LwIQKvuq98VQXayZcIc+jiIA= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=q8UWB5wJMcuADs659mX8JJ/S0joigIAVcBhuHxbH4PykiCpT6Cfk6380B7NxqLBpXRtMlguCM5/yHnZUuJFLvkgWbDz6ficjo4zUHj/yJ7jKJkjSeM+3tq1nxOhFiqi9KZSwvrvwJDtIaNBR9j/SOIVVEot/cJ1n4BHIozLv4/c=; X-YMail-OSG: Ulc3UngVM1nv57O.7N3votg3ML7dye7qC3XU7qfR5Pt27B_ xjN8HGhxGJnDrvEeJrJkGI5nBjHJqPlGgcdgXdV9mVko8NSW1z9a6b4HaW7B C9rbkFyOhizs9MPijY7C756h2YUveIHhazfqcXyblWek2kS1F4yGVd6iv_xO 7CCNVk0VUXXmnJmWK87LLBtInj6S78ynQjWD65FQT..qNnWF6FAAZJzsrsDs 9T14mNLFraVA1vb79_6N63YXleYNTMWzYQ1htpG8vh2pMCqUyFs9oISQVk.3 M3K.ryMcjR4Dlaqe_7hivfbonT2WBbD5aqeFRT_xTReK4yYaRbORCistH.tN M45mBjR9MkM3hE7IECzLFHuPYIHPdtWMBd2yuv7nc7KSG4Ibo7ngcIjN2F7z 9E95wnDtA.qiMFYFBeF2.9t70cA3nWx6saWrjQOI_e9jNFCkoDlyDBp0X87a siaqhljCeUidDTVVWB_PpgR6aPA6e7vI7KOgwinoV5YKJLtjHkntiUlzUp8w .bw_Ra9aAiRYfuA-- Received: from [15.227.185.73] by web160702.mail.bf1.yahoo.com via HTTP; Tue, 09 Apr 2013 14:56:35 PDT X-Rocket-MIMEInfo: 002.001,WW91IGNvdWxkIHVzZSB0aGUgZm9sbG93aW5nIGZhY3RzLgoxLiBGaWxlcyBhcmUgc3RvcmVkIGluIGJsb2Nrcy4gU28gbWFrZSB5b3VyIGJsb2Nrc2l6ZSBiaWdnZXIgdGhhbiB0aGUgbGFyZ2VzdCBmaWxlLgoyLCBUaGUgZmlyc3Qgc3BsaXQgaXMgc3RvcmVkIG9uIHRoZSBsb2NhbG5vZGUuCgpSYWoKCgoKPl9fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fCj4gRnJvbTogamVyZW15IHAgPGF0aG9tZXdpdGhhZ3Jvb3ZlYm94QGdtYWlsLmNvbT4KPlRvOiB1c2VyQGhhZG9vcC5hcGFjaGUub3JnIAo.U2UBMAEBAQE- X-Mailer: YahooMailWebService/0.8.140.532 References: Message-ID: <1365544595.11142.YahooMailNeo@web160702.mail.bf1.yahoo.com> Date: Tue, 9 Apr 2013 14:56:35 -0700 (PDT) From: Raj Vishwanathan Reply-To: Raj Vishwanathan Subject: Re: When copying a file to HDFS, how to control what nodes that file will reside on? To: "user@hadoop.apache.org" In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="-2130163251-1430606915-1365544595=:11142" X-Virus-Checked: Checked by ClamAV on apache.org ---2130163251-1430606915-1365544595=:11142 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable You could use the following facts.=0A1. Files are stored in blocks. So make= your blocksize bigger than the largest file.=0A2, The first split is store= d on the localnode.=0A=0ARaj=0A=0A=0A=0A>________________________________= =0A> From: jeremy p =0A>To: user@hadoop.apa= che.org =0A>Sent: Tuesday, April 9, 2013 1:49 PM=0A>Subject: When copying a= file to HDFS, how to control what nodes that file will reside on?=0A> =0A>= =0A>Hey all,=0A>=0A>=0A>I'm dealing with kind of a bizarre use case where I= need to make sure that File A is local to Machine A, File B is local to Ma= chine B, etc. =A0When copying a file to HDFS, is there a way to control whi= ch machines that file will reside on? =A0I know that any given file will be= replicated across three machines, but I need to be able to say "File A wil= l DEFINITELY exist on Machine A". =A0I don't really care about the other tw= o machines -- they could be any machines on my cluster.=0A>=0A>=0A>Thank yo= u.=0A>=0A> ---2130163251-1430606915-1365544595=:11142 Content-Type: text/html; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable
You could use the fol= lowing facts.
1. Files are stored in blocks.= So make your blocksize bigger than the largest file.
2, The first split is stored on the localnode.

Raj

=
From: jeremy p <athomewithagroovebox@gmail.com>= ;
To: user@hadoop.apac= he.org
Sent: Tuesday,= April 9, 2013 1:49 PM
Subject: When copying a file to HDFS, how to control what nodes that file = will reside on?

=0A
Hey all,

I'm dealing with kind of a= bizarre use case where I need to make sure that File A is local to Machine= A, File B is local to Machine B, etc.  When copying a file to HDFS, i= s there a way to control which machines that file will reside on?  I k= now that any given file will be replicated across three machines, but I nee= d to be able to say "File A will DEFINITELY exist on Machine A".  I do= n't really care about the other two machines -- they could be any machines = on my cluster.
=0A=0A

Thank you.
=0A

---2130163251-1430606915-1365544595=:11142--