Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 19363 invoked from network); 9 Aug 2006 20:51:07 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 9 Aug 2006 20:51:07 -0000 Received: (qmail 93903 invoked by uid 500); 9 Aug 2006 20:51:02 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 93854 invoked by uid 500); 9 Aug 2006 20:51:02 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 93781 invoked by uid 99); 9 Aug 2006 20:51:02 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Aug 2006 13:51:01 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received: from [209.237.227.198] (HELO brutus.apache.org) (209.237.227.198) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Aug 2006 13:51:00 -0700 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 01DB87141F2 for ; Wed, 9 Aug 2006 20:48:16 +0000 (GMT) Message-ID: <32779719.1155156496005.JavaMail.jira@brutus> Date: Wed, 9 Aug 2006 13:48:16 -0700 (PDT) From: "Konstantin Shvachko (JIRA)" To: hadoop-dev@lucene.apache.org Subject: [jira] Commented: (HADOOP-64) DataNode should be capable of managing multiple volumes In-Reply-To: <96561968.1141686362086.JavaMail.jira@ajax> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N [ http://issues.apache.org/jira/browse/HADOOP-64?page=comments#action_12427026 ] Konstantin Shvachko commented on HADOOP-64: ------------------------------------------- This proposition looks good to me. The only thing that seems excessive is the dynamic data structures for maintaining blockid-to-directory mapping. The alternative is to do a static mapping based on blockids and the number of directories. Suppose that the maximal number of entries per directory is N. We should define a function dirName( blockId, N, dirLevel ) which returns a local directory name for each level of the directory tree. So the datanode needs to store only the current hight of the directory tree H. Then for a given blockId, its path is determined by / dirName(blockId,N,0) / dirName(blockId,N,1) / ... / dirName(blockId,N,H) And when the datanode needs to add a new directory level it will not need to rename anything in the existing directory tree. I see a disadvantage of this approach, that the directories should be re-structured if the maximal number of entries per directory is changed. But the same is applicable for the dynamic approach, at least when N is decreased. We might consider hardcoding N rather than having it configurable. > DataNode should be capable of managing multiple volumes > ------------------------------------------------------- > > Key: HADOOP-64 > URL: http://issues.apache.org/jira/browse/HADOOP-64 > Project: Hadoop > Issue Type: Improvement > Components: dfs > Affects Versions: 0.2.0 > Reporter: Sameer Paranjpye > Assigned To: Milind Bhandarkar > Priority: Minor > Fix For: 0.6.0 > > > The dfs Datanode can only store data on a single filesystem volume. When a node runs its disks JBOD this means running a Datanode per disk on the machine. While the scheme works reasonably well on small clusters, on larger installations (several 100 nodes) it implies a very large number of Datanodes with associated management overhead in the Namenode. > The Datanod should be enhanced to be able to handle multiple volumes on a single machine. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira