Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4F97BD4B6 for ; Tue, 18 Dec 2012 20:37:19 +0000 (UTC) Received: (qmail 70085 invoked by uid 500); 18 Dec 2012 20:37:14 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 70010 invoked by uid 500); 18 Dec 2012 20:37:14 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 70000 invoked by uid 99); 18 Dec 2012 20:37:14 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 18 Dec 2012 20:37:14 +0000 X-ASF-Spam-Status: No, hits=-0.1 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of bbeaudreault@hubspot.com designates 74.125.149.201 as permitted sender) Received: from [74.125.149.201] (HELO na3sys009aog109.obsmtp.com) (74.125.149.201) by apache.org (qpsmtpd/0.29) with SMTP; Tue, 18 Dec 2012 20:37:08 +0000 Received: from mail-qc0-f198.google.com ([209.85.216.198]) (using TLSv1) by na3sys009aob109.postini.com ([74.125.148.12]) with SMTP ID DSNKUNDT37xAoNseUrU26q9iUUsLkjQJFq68@postini.com; Tue, 18 Dec 2012 12:36:47 PST Received: by mail-qc0-f198.google.com with SMTP id l35so1667930qco.9 for ; Tue, 18 Dec 2012 12:36:45 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=x-received:mime-version:in-reply-to:references:from:date:message-id :subject:to:content-type:x-gm-message-state; bh=Cw+fZK4mC1ay7G/aYpbw1RJY4Nn9mJbffWi0j8D/yFE=; b=JpR5/npfBNvJt6jSPY4zTS/JOu/XMxEjOGZRRbPsNsNSfopmTm75DsyIuyNzCpxOX2 4tol6DdwRhrqpWv6XC4KVZH2INwqmXXYOTnZ1/5vp/IBtI2yFaOFQm6C2HDquEbeOFz0 5ratVECQmOHGCge00KzWOQfV99Z/4KivNI7sX2lYV9UkBoBck5+KHGnXaL+zuBGtjDww ckdBgm6qhpt+Kf4MWarf2y+OQQqUJmqwHBD406+vvabBg1m+SCXeMICwIHmr5ut1l1+V H9vXj5VLTSnWTTAaxiZI3szpyOZWF/CNWQhvvqlXd8+MZk08UGyTBYP77gVglLOSWE+o kPjg== X-Received: by 10.58.12.200 with SMTP id a8mr5217802vec.52.1355863005813; Tue, 18 Dec 2012 12:36:45 -0800 (PST) Received: by 10.58.12.200 with SMTP id a8mr5217785vec.52.1355863005696; Tue, 18 Dec 2012 12:36:45 -0800 (PST) MIME-Version: 1.0 Received: by 10.220.133.70 with HTTP; Tue, 18 Dec 2012 12:36:25 -0800 (PST) In-Reply-To: References: From: Bryan Beaudreault Date: Tue, 18 Dec 2012 15:36:25 -0500 Message-ID: Subject: Re: Misconfiguration of hdfs-site.xml To: "hbase-user@hadoop.apache.org" Content-Type: multipart/alternative; boundary=047d7b2e0cabd1946104d12674a6 X-Gm-Message-State: ALoCoQmXn1HcuwMnRPxaSUYSqSQUY0dGOIw7EXRK6j/5fiFeqCCki2ZBsoxBPajUMAatIM2sn/P5OkVXVr+c2pzRGP/0iE8evWbhZ4r+2fhNGdmDlVII5Vh+06fpiamLPu35pM9V0wrRutAvHL6U9oWZ7lvlRJFZ0gDtr/Qf5MpQPVULKao41gc= X-Virus-Checked: Checked by ClamAV on apache.org --047d7b2e0cabd1946104d12674a6 Content-Type: text/plain; charset=ISO-8859-1 Updating the config and restarting will result in data loss. You're going to want to migrate the data from your tmp directory to the appropriate place. The NameNode will use the dfs.name.dir property, and DataNodes will use the dfs.data.dir. Assuming you aren't running a DN and NN on the same server (don't do that), then they shouldn't collide/overlap. Even still its probably best to use different directories. For instance mine is like this: dfs.name.dir = /mnt/hdfs/name,/mnt2/hdfs/name dfs.data.dir = /mnt/hdfs/data,/mnt2/hdfs/data,/mnt3/hdfs/data,/mnt4/hdfs/data Hope this helps. On Tue, Dec 18, 2012 at 3:28 PM, Jean-Marc Spaggiari < jean-marc@spaggiari.org> wrote: > Hi, > > For months now I'm using my hadoop cluster with absolutly nothing > related to the drive directory on my hdfs-site.xml file. > > It seems that it's using the hadoop.tmp.dir directory to store data. > > My hadoop.tmp.dir is pointing to > /home/hadoop/haddop_drive/${user.name} and on my > /home/hadoop/haddop_drive directory I can see hadoop and hbase. > > Now, I want to configure that properly without loosing the data I have. > > I'm using 1.0.3. > > Based on the documentation > http://hadoop.apache.org/docs/r1.0.3/cluster_setup.html#Site+Configuration > it seems I need to setup dfs.data.dir to point to > /home/hadoop/haddop_drive. If I add this entry and restart my > datanode, will it "simply" continue to point to the same place and > find the data he needs? > > Also, what about dfs.name.dir ? I guess I can simply point it to the > same place, right? > > On /home/hadoop/haddop_drive/hadoop/dfs I have data and name > directories. So I'm not sure if I should point the properties above to > /home/hadoop/haddop_drive or to /home/hadoop/haddop_drive/dfs. > > Thanks, > > JM > --047d7b2e0cabd1946104d12674a6 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Updating the config and restarting will result in data loss. =A0You're = going to want to migrate the data from your tmp directory to the appropriat= e place.

The NameNode will use the dfs.name.dir property= , and DataNodes will use the dfs.data.dir. =A0Assuming you aren't runni= ng a DN and NN on the same server (don't do that), then they shouldn= 9;t collide/overlap. =A0Even still its probably best to use different direc= tories. =A0For instance mine is like this:

dfs.name.dir =3D /mnt/hdfs/name,/mnt2/hdfs/name
dfs.data.dir =3D=A0/mnt/hdfs/data,/mnt2/hdfs/data,/mnt3/hdfs/data,/mnt4/= hdfs/data

Hope this helps.


On Tue, Dec 18, 2012 at 3:28 PM, Jean-Ma= rc Spaggiari <jean-marc@spaggiari.org> wrote:
Hi,

For months now I'm using my hadoop cluster with absolutly nothing
related to the drive directory on my hdfs-site.xml file.

It seems that it's using the hadoop.tmp.dir directory to store data.
My hadoop.tmp.dir is pointing to
/home/hadoop/haddop_drive/${= user.name} and on my
/home/hadoop/haddop_drive directory I can see hadoop and hbase.

Now, I want to configure that properly without loosing the data I have.

I'm using 1.0.3.

Based on the documentation
http://hadoop.apache.org/docs/r1.0.3/cluster_= setup.html#Site+Configuration
it seems I need to setup dfs.data.dir to point to
/home/hadoop/haddop_drive. If I add this entry and restart my
datanode, will it "simply" continue to point to the same place an= d
find the data he needs?

Also, what about dfs.name.dir ? I guess I can simply point it to the
same place, right?

On =A0/home/hadoop/haddop_drive/hadoop/dfs I have data and name
directories. So I'm not sure if I should point the properties above to<= br> =A0/home/hadoop/haddop_drive or to =A0/home/hadoop/haddop_drive/dfs.

Thanks,

JM

--047d7b2e0cabd1946104d12674a6--