[ https://issues.apache.org/jira/browse/HADOOP-1989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12532226
]
Stu Hood commented on HADOOP-1989:
----------------------------------
I can't believe this hasn't already been implemented! Very good idea sir.
Perhaps each Simulated Data Node should be a pool of dummy nodes that grabs a few ports and
contacts and replies to the Namenode with a thread pool.
> Add support for simulated Data Nodes - helpful for testing and performance benchmarking
of the Name Node without having a large cluster
> ----------------------------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-1989
> URL: https://issues.apache.org/jira/browse/HADOOP-1989
> Project: Hadoop
> Issue Type: Improvement
> Components: dfs
> Reporter: Sanjay Radia
> Priority: Minor
>
> Proposal is to add an implementation for a Simulated Data Node.
> This will
> - allow one to test certain parts of the system (especially the Name Node, protocols)
much more easily and efficiently.
> - allow one to run performance benchmarks on the Name node without having a large cluster.
> - Inject faults for testing (e.g. one can add random faults based probability parameters).
> The idea is that the Simulated Data Node will
> - discard any data written to blocks (but remember the blocks and their sizes)
> - generate fixed data on the fly when blocks are read (e.g. block is fixed set of bytes
or repeated sequence of strings).
> The Simulated Data Node can also be used for fault injection.
> The data node can be parameterized with probabilities that allow one to control:
> - Delays on reads and writes, creates, etc
> - IO Exceptions
> - Loss of blocks
> - Failures
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
|