Return-Path: Delivered-To: apmail-hadoop-hbase-user-archive@minotaur.apache.org Received: (qmail 95769 invoked from network); 26 Nov 2009 21:55:51 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 26 Nov 2009 21:55:51 -0000 Received: (qmail 72480 invoked by uid 500); 26 Nov 2009 21:55:50 -0000 Delivered-To: apmail-hadoop-hbase-user-archive@hadoop.apache.org Received: (qmail 72410 invoked by uid 500); 26 Nov 2009 21:55:50 -0000 Mailing-List: contact hbase-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-user@hadoop.apache.org Delivered-To: mailing list hbase-user@hadoop.apache.org Received: (qmail 72400 invoked by uid 99); 26 Nov 2009 21:55:50 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 Nov 2009 21:55:50 +0000 X-ASF-Spam-Status: No, hits=-2.6 required=5.0 tests=BAYES_00 X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [76.13.9.48] (HELO web65504.mail.ac4.yahoo.com) (76.13.9.48) by apache.org (qpsmtpd/0.29) with SMTP; Thu, 26 Nov 2009 21:55:47 +0000 Received: (qmail 79419 invoked by uid 60001); 26 Nov 2009 21:55:26 -0000 Message-ID: <283622.77935.qm@web65504.mail.ac4.yahoo.com> X-YMail-OSG: qio1vn0VM1km0zYGqv1tD.Gkg2rt8GyDTKWPbEDeH7dEo7skAGeZLi7BSUhgn8XvrIVzm.eygXK_REUm.cPtSyS54sSJ64qtVQWQ5Np19tUXTerKc4yvo4l.78kDQp8aA_1wf2vUz.qanf0RkI2KJpq7Rz.dy2cAjlmErNwGlxp_sb4GsDJM2iLIYxiJsNRK8cO_xsSN5a8W1XmijtVWaIJjvH0VFZ3utAo4b_hGAHJmzPgRM.o1d4KkKNJYNcTzXW45x1OPVpH_JUCDFifydlw.vkBeP77MlD31zNIyvlgjReB.Mkd5cg1LhT31..hHgSqeBWR0wNVZTpnjte0dPPaoKt59EE4fQPAsyj3Nms_MBwcy4FDxdtF84_XSkz_cW1YHkd.SMj7RSnmAiv1am6FAkL.vArfOXjCuIj3yOo83qKh7miX4JC6F5hjkV87baVpHtasX0LzNfI2m Received: from [98.136.57.74] by web65504.mail.ac4.yahoo.com via HTTP; Thu, 26 Nov 2009 13:55:26 PST X-RocketYMMF: apurtell X-Mailer: YahooMailWebService/0.8.100.260964 Date: Thu, 26 Nov 2009 13:55:26 -0800 (PST) From: Andrew Purtell Subject: Re: newbie question on disk usage on node with different disk size To: hbase-user@hadoop.apache.org MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii The balancer moves blocks so they are evenly distributed across the cluster. It must be run manually and won't do what you want once the datanode with the smallest disk fills. Neither the HDFS balancer nor the config var you mention is part of HBase. You can run the balancer with the Hadoop scripts and specify a change to that config var in Hadoop's hdfs-site.xml. - Andy On Thu Nov 26th, 2009 1:21 PM PST Tux Racer wrote: >Thanks Andrew for your answer. > >I may have found the HDFS parameter I was looking for: > >dfs.datanode.du.reserved 0 Reserved space in bytes per volume. Always >leave this much space free for non dfs use. > > >http://hadoop.apache.org/common/docs/current/hdfs-default.html >https://issues.apache.org/jira/browse/HADOOP-1463 > >The book "Pro Hadoop" page 115, also mentions the "balancer" service and >the > >dfs.balance.bandwidthPerSec > >parameter also documented at: > >http://hadoop.apache.org/common/docs/current/hdfs-default.html > >dfs.balance.bandwidthPerSec 1048576 Specifies the maximum amount of >bandwidth that each datanode can utilize for the balancing purpose in >term of the number of bytes per second. > > > >however I do not see the script "start-blancer.sh" in hbase. > >Would it be possible to use those Hadoop parameters in a hbase setup? > >Thanks >TR > > > >Andrew Purtell wrote: >> Hi, >> >> Short answer: No. >> >> Longer answer: HBase uses the underlying filesystem (typically HDFS) to replicate and persist data. This is independent of the key space. Any special block placement policy like you want would be handled by the filesystem. To my knowledge, HDFS doesn't support it. HDFS also does not like heterogeneous backing storage at the moment. It causes problems if one node fills before the others, and there is not yet an automatic mechanism for moving blocks from full nodes to less utilized ones, though I see there is an issue for that: http://issues.apache.org/jira/browse/HDFS-339 . I wouldn't recommend a setup like you propose. >> >> - Andy >> >> >> >> ________________________________ >> From: Tux Racer >> To: hbase-user@hadoop.apache.org >> Sent: Thu, November 26, 2009 11:14:15 AM >> Subject: newbie question on disk usage on node with different disk size >> >> Hello Hbase Users! >> >> I am trying to find some pointers on how to configure hbase region server and in particular how the disk will be filled on each node. >> >> Say for instance that I have a small cluster of 3 nodes: >> node 1 has a 100Gb disk >> node 2 has a 200Gb disk >> and node 3 has a 300 Gb disk >> >> is there a way to tell hbase that it should store the keys proportionally to the he node disk space? >> (i.e. to have at some stage each disk filled at 50%: 50/100/150 Gb of space used) >> >> Or is that a pure Hadoop configuration question? >> I looked at the files in the ~/hbase-0.20.1/conf/ folder with no luck. >> >> Thanks >> TR >> >> >> >> >