Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 39331 invoked from network); 5 Nov 2007 23:57:15 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 5 Nov 2007 23:57:15 -0000 Received: (qmail 95245 invoked by uid 500); 5 Nov 2007 23:57:02 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 95215 invoked by uid 500); 5 Nov 2007 23:57:01 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 95206 invoked by uid 99); 5 Nov 2007 23:57:01 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 05 Nov 2007 15:57:01 -0800 X-ASF-Spam-Status: No, hits=-100.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 05 Nov 2007 23:57:41 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id DCE0371422B for ; Mon, 5 Nov 2007 15:56:50 -0800 (PST) Message-ID: <25327965.1194307010902.JavaMail.jira@brutus> Date: Mon, 5 Nov 2007 15:56:50 -0800 (PST) From: "Sanjay Radia (JIRA)" To: hadoop-dev@lucene.apache.org Subject: [jira] Updated: (HADOOP-1989) Add support for simulated Data Nodes - helpful for testing and performance benchmarking of the Name Node without having a large cluster In-Reply-To: <24543146.1191434150640.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-1989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sanjay Radia updated HADOOP-1989: --------------------------------- Attachment: SimulatedStoragePatchSubmit5.txt The attached patch addresses Konstantine's feedback on the previous patch. It also add a new class DataNodeCluster that allows one to run a DataNode cluster in a single address space (the name node can be in a separate address space). This class allows one to run multiple instances of the simulated data node in a single VM; this is useful for benchmarking with a real Name node and a large number of simulated data nodes. The hadoop command has been modified to allow one to run this as: bin/hadoop datanodecluster > Add support for simulated Data Nodes - helpful for testing and performance benchmarking of the Name Node without having a large cluster > ---------------------------------------------------------------------------------------------------------------------------------------- > > Key: HADOOP-1989 > URL: https://issues.apache.org/jira/browse/HADOOP-1989 > Project: Hadoop > Issue Type: Improvement > Components: dfs > Reporter: Sanjay Radia > Priority: Minor > Attachments: SimulatedStoragePatchSubmit.txt, SimulatedStoragePatchSubmit5.txt > > > Proposal is to add an implementation for a Simulated Data Node. > This will > - allow one to test certain parts of the system (especially the Name Node, protocols) much more easily and efficiently. > - allow one to run performance benchmarks on the Name node without having a large cluster. > - Inject faults for testing (e.g. one can add random faults based probability parameters). > The idea is that the Simulated Data Node will > - discard any data written to blocks (but remember the blocks and their sizes) > - generate fixed data on the fly when blocks are read (e.g. block is fixed set of bytes or repeated sequence of strings). > The Simulated Data Node can also be used for fault injection. > The data node can be parameterized with probabilities that allow one to control: > - Delays on reads and writes, creates, etc > - IO Exceptions > - Loss of blocks > - Failures -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.