Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 37090 invoked from network); 5 Nov 2007 23:49:15 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 5 Nov 2007 23:49:15 -0000 Received: (qmail 88482 invoked by uid 500); 5 Nov 2007 23:49:02 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 88274 invoked by uid 500); 5 Nov 2007 23:49:02 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 88265 invoked by uid 99); 5 Nov 2007 23:49:02 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 05 Nov 2007 15:49:02 -0800 X-ASF-Spam-Status: No, hits=-100.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 05 Nov 2007 23:49:41 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 30200714211 for ; Mon, 5 Nov 2007 15:48:51 -0800 (PST) Message-ID: <10863219.1194306531193.JavaMail.jira@brutus> Date: Mon, 5 Nov 2007 15:48:51 -0800 (PST) From: "Sanjay Radia (JIRA)" To: hadoop-dev@lucene.apache.org Subject: [jira] Commented: (HADOOP-1989) Add support for simulated Data Nodes - helpful for testing and performance benchmarking of the Name Node without having a large cluster In-Reply-To: <24543146.1191434150640.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-1989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12540298 ] Sanjay Radia commented on HADOOP-1989: -------------------------------------- Tried to put the simulated implementations inside the test tree; unfortunately, it required that the test tree be compiled before the main src tree. For example startDataNode refers to the simulated impl if it is starting the simulated fsdataset. Note that we are reusing the datanode implementation; only the fsdataset is different. > Add support for simulated Data Nodes - helpful for testing and performance benchmarking of the Name Node without having a large cluster > ---------------------------------------------------------------------------------------------------------------------------------------- > > Key: HADOOP-1989 > URL: https://issues.apache.org/jira/browse/HADOOP-1989 > Project: Hadoop > Issue Type: Improvement > Components: dfs > Reporter: Sanjay Radia > Priority: Minor > Attachments: SimulatedStoragePatchSubmit.txt > > > Proposal is to add an implementation for a Simulated Data Node. > This will > - allow one to test certain parts of the system (especially the Name Node, protocols) much more easily and efficiently. > - allow one to run performance benchmarks on the Name node without having a large cluster. > - Inject faults for testing (e.g. one can add random faults based probability parameters). > The idea is that the Simulated Data Node will > - discard any data written to blocks (but remember the blocks and their sizes) > - generate fixed data on the fly when blocks are read (e.g. block is fixed set of bytes or repeated sequence of strings). > The Simulated Data Node can also be used for fault injection. > The data node can be parameterized with probabilities that allow one to control: > - Delays on reads and writes, creates, etc > - IO Exceptions > - Loss of blocks > - Failures -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.