Return-Path: Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: (qmail 90044 invoked from network); 22 Oct 2009 17:01:30 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 22 Oct 2009 17:01:30 -0000 Received: (qmail 37249 invoked by uid 500); 22 Oct 2009 17:01:30 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 37164 invoked by uid 500); 22 Oct 2009 17:01:29 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 37154 invoked by uid 99); 22 Oct 2009 17:01:29 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Oct 2009 17:01:29 +0000 X-ASF-Spam-Status: No, hits=-2.6 required=5.0 tests=BAYES_00,HTML_MESSAGE X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [216.145.54.172] (HELO mrout2.yahoo.com) (216.145.54.172) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Oct 2009 17:01:26 +0000 Received: from oceanfarearth-lm.corp.yahoo.com (oceanfarearth-lm.corp.yahoo.com [10.72.113.156]) by mrout2.yahoo.com (8.13.6/8.13.6/y.out) with ESMTP id n9MH0Bvq026028 for ; Thu, 22 Oct 2009 10:00:12 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; s=serpent; d=yahoo-inc.com; c=nofws; q=dns; h=message-id:from:to:in-reply-to:content-type:mime-version: subject:date:references:x-mailer; b=o0oHSJipC3lmD0VUr1i/nkbl5UL5JPt/Xy6Sf3AKbYo9vsxpMw0EdIIvvOcelEhh Message-Id: <5480928D-30E6-4B28-AC23-A77F31EF3164@yahoo-inc.com> From: Sanjay Radia To: In-Reply-To: Content-Type: multipart/alternative; boundary=Apple-Mail-4-898220983 Mime-Version: 1.0 (Apple Message framework v936) Subject: Re: Namenode with External Storage? Date: Thu, 22 Oct 2009 10:00:11 -0700 References: <7ef0f1a90910210915p6e404c1eq7979c0a969a504a3@mail.gmail.com> X-Mailer: Apple Mail (2.936) --Apple-Mail-4-898220983 Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit On Oct 22, 2009, at 9:37 AM, wrote: > As with Dhruba's comment, so long as it is just the namenode that is > running on a networked file system everything should be chill. The > namenode > keeps all of its working metadata in main mem, and it only > occasionally > pushes a log file out to hard storage (and if I remember correctly > you can > adjust this time window in one of the site files). > Actually it pushes out the update logs on each and every update synchronously. The checkpoint however is pushed out periodically. Also, at yahoo, we push out NN state to multiple disks and one of the "disks" is a nfs filer. This is configurable. sanjay > > However, you are going to run into huge performance issues running > datanodes over a networked storage system. Having to push that many > file > requests over a network for a respectable mapreduce job is going to > kill > your equipment. > > - Grant > > On Oct 21 2009, Jonathan Seidman wrote: > > >Apologies if this has been answered previously, but I'm unable to > find > >anything that seems to cover this. > > > > It's clear that datanodes require local storage for Hadoop to > function > > efficiently, but is there any significant disadvantage to using > external > > storage for namenodes? We're exploring the possibility of using a > > different class of hardware for our namenodes with attached > storage and > > little or no internal storage. Some of the benefits this would > provide us > > are: 1) allowing our sysadmins to deploy hardware that they're > familiar > > with and already have considerable experience keeping up in a > production > > environment. 2) no namenode downtime to replace a failed disk. > > > >We don't anticipate that this approach would cause any significant > >degradation to performance, but let me know if there's something > we're not > >considering. > > > >Thanks. > > > >Jonathan > > > > -- > -- > Grant Mackey > PhD student Computer Engineering > University of Central Florida > Rm 231 cube 5 (321) 960-8851 > > --Apple-Mail-4-898220983--