hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Konstantin Boudnik (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HDFS-1566) Test that covers full partition
Date Tue, 25 Jan 2011 02:11:44 GMT

     [ https://issues.apache.org/jira/browse/HDFS-1566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Konstantin Boudnik updated HDFS-1566:
-------------------------------------

    Attachment: HDFS-1566.sh

A pretty poor attempt to automate this regression test is attached.
I was able to reproduce this in 100% of the cases using 0.20.2 based installation of Hadoop.

Script has to be run under super-privileges.

At the end of the tests you can remove log files (to free some space) and try to restart namenode,
which produces NumberFormatException in the newly created log file, which indicates that edit
log has been corrupted.

> Test that covers full partition  
> ---------------------------------
>
>                 Key: HDFS-1566
>                 URL: https://issues.apache.org/jira/browse/HDFS-1566
>             Project: Hadoop HDFS
>          Issue Type: Test
>          Components: name-node
>    Affects Versions: 0.20.2
>            Reporter: Eli Collins
>            Assignee: Konstantin Boudnik
>             Fix For: 0.23.0
>
>         Attachments: HDFS-1566.sh
>
>
> We've seen the following bug, hdfs needs a test to reproduce this:
> * /var filled up
> * 2NN failed checkpoint due to no space left on device
> * NN log hit end of disk
> * NN seems to have exited on the spot, mid-log-message
> * NN edits are left corrupted
> ** Half of a rename made it into the log
> ** valid data appears to end on a sector boundary
> ** this is true across all of the edit dirs

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message