Return-Path: Delivered-To: apmail-hadoop-hbase-user-archive@locus.apache.org Received: (qmail 52563 invoked from network); 22 May 2008 20:18:25 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 22 May 2008 20:18:25 -0000 Received: (qmail 93216 invoked by uid 500); 22 May 2008 20:18:26 -0000 Delivered-To: apmail-hadoop-hbase-user-archive@hadoop.apache.org Received: (qmail 93191 invoked by uid 500); 22 May 2008 20:18:26 -0000 Mailing-List: contact hbase-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-user@hadoop.apache.org Delivered-To: mailing list hbase-user@hadoop.apache.org Received: (qmail 93180 invoked by uid 99); 22 May 2008 20:18:26 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 May 2008 13:18:26 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [63.203.238.117] (HELO dns.duboce.net) (63.203.238.117) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 May 2008 20:17:31 +0000 Received: by dns.duboce.net (Postfix, from userid 1008) id BA429C51D; Thu, 22 May 2008 11:48:17 -0700 (PDT) X-Spam-Checker-Version: SpamAssassin 3.1.4 (2006-07-26) on dns.duboce.net X-Spam-Level: Received: from durruti.desk.hq.powerset.com (durruti.desk.hq.powerset.com [208.84.6.150]) by dns.duboce.net (Postfix) with ESMTP id 03BECC1CE for ; Thu, 22 May 2008 11:48:10 -0700 (PDT) Message-ID: <4835D4E1.9090403@duboce.net> Date: Thu, 22 May 2008 13:17:37 -0700 From: stack User-Agent: Thunderbird 2.0.0.14 (Macintosh/20080421) MIME-Version: 1.0 To: hbase-user@hadoop.apache.org Subject: Re: Unable to drop table References: <17402135.post@talk.nabble.com> In-Reply-To: <17402135.post@talk.nabble.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org X-Old-Spam-Status: No, score=-2.5 required=5.0 tests=AWL,BAYES_00 autolearn=ham version=3.1.4 Jean-Adrien wrote: > By the way, is there an obvious link between dfs DataNodes cluster size and > hbase HRegionServers cluster ? I'm not sure what is the meaning of the fact > that the hadoop slaves file is a synonym of hbase regionServers file (as > seen in the documentation http://hadoop.apache.org/hbase/docs/current/ API > ), and how the hbase deals with hadoop-site.xml config file ; I mean what is > the purpose to have ${HADOOP_CONF} dir in the hbase classpath ? > There is no 'obvious' heuristic that we're aware of. Optimally, regionservers would run on top of the datanode hosting their the regionservers' data (We have a bit of work to do to make this happen). If a running regionserver was light as a feather, we'd suggest just putting up a regionserver on every datanode but unfortunately, they cost some so the set of regionservers and datanodes tend to diverge. Access patterns, amount of hbase data, proportion of your hdfs data that is up in your hbase instance and strength of your hosting servers are some of the inputs to consider sizing your hbase cluster. Because the two sets don't often match, we have a regionserver file apart from slaves for listing the hosts carrying hbase cluster members. The documentation on what the regionservers file is, is misleading/incorrect. I'll fix it so instead of 'synonym', it says 'is like the'. Are you seeing the HADOOP_CONF_DIR in your CLASSPATH? Its not there by default, not since we became a subproject at least. Regards configuration in hadoop-site.xml, we don't read it unless you explicitly add it to the hbase CLASSPATH (You can add it by adding it in hbase-env.sh to the HBASE_CLASSPATH variable). Most of the time, hbase doesn't need to know hadoop-site.xml site-specific configurations but if the configurations effect hdfs clients, then you'll want hbase to pick them up. One example would be use of non-default replication count. I'm sure there are others. St.Ack