hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eric Yang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDDS-1773) Add intermittent IO disk test to fault injection test
Date Wed, 17 Jul 2019 22:23:00 GMT

    [ https://issues.apache.org/jira/browse/HDDS-1773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16887469#comment-16887469

Eric Yang commented on HDDS-1773:

[~elek] {quote}I am sorry to say, but I have different opinion (as I tried to explain earlier).
Sometimes it's notable, sometimes it's not.{quote}

Btw, cgroup can controll manually to have greater precision of number of IOs to commit to
disk as a group operation.  For example:

echo "<major>:<minor>  <io_serviced>" > /cgrp/blkio.throttle.io_serviced

This throttle configuration can be changed base on time intervals to produce slow and intermittent
IO at fixed interval.  Would this work in your train of thoughts?

> Add intermittent IO disk test to fault injection test
> -----------------------------------------------------
>                 Key: HDDS-1773
>                 URL: https://issues.apache.org/jira/browse/HDDS-1773
>             Project: Hadoop Distributed Data Store
>          Issue Type: Improvement
>            Reporter: Eric Yang
>            Priority: Major
>         Attachments: HDDS-1773.001.patch, HDDS-1773.002.patch
> Disk errors can also be simulated by setting cgroup blkio rate to 0 while Ozone cluster
is running.  
> This test will be added to corruption test project and this test will only be performed
if there is write access into host cgroup to control the throttle of disk IO.
> Expected result:
> When datanode becomes irresponsive due to slow io, scm must flag the node as unhealthy.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org

View raw message