hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Weiwei Yang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-12914) Block report leases cause missing blocks until next report
Date Tue, 09 Jul 2019 04:34:00 GMT

    [ https://issues.apache.org/jira/browse/HDFS-12914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16880921#comment-16880921

Weiwei Yang commented on HDFS-12914:

Sorry for the late response, I was on vacation.

I just reverted 974dd2b4b6103374969fd7cfeb2cee50d4112c6a, to unblock others. Now the patch
for branch-3.0 should apply. Just triggered the jenkins job manually. Cc [~jojochuang] [~hexiaoqiao]

> Block report leases cause missing blocks until next report
> ----------------------------------------------------------
>                 Key: HDFS-12914
>                 URL: https://issues.apache.org/jira/browse/HDFS-12914
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: namenode
>    Affects Versions: 2.8.0, 2.9.2
>            Reporter: Daryn Sharp
>            Assignee: Santosh Marella
>            Priority: Critical
>             Fix For: 3.0.4, 3.3.0, 3.2.1, 3.1.3
>         Attachments: HDFS-12914-branch-2.001.patch, HDFS-12914-trunk.00.patch, HDFS-12914-trunk.01.patch,
HDFS-12914.005.patch, HDFS-12914.006.patch, HDFS-12914.007.patch, HDFS-12914.008.patch, HDFS-12914.009.patch,
HDFS-12914.branch-2.patch, HDFS-12914.branch-3.0.patch, HDFS-12914.branch-3.1.001.patch, HDFS-12914.branch-3.1.002.patch,
HDFS-12914.branch-3.2.patch, HDFS-12914.utfix.patch
> {{BlockReportLeaseManager#checkLease}} will reject FBRs from DNs for conditions such
as "unknown datanode", "not in pending set", "lease has expired", wrong lease id, etc.  Lease
rejection does not throw an exception.  It returns false which bubbles up to  {{NameNodeRpcServer#blockReport}}
and interpreted as {{noStaleStorages}}.
> A re-registering node whose FBR is rejected from an invalid lease becomes active with
_no blocks_.  A replication storm ensues possibly causing DNs to temporarily go dead (HDFS-12645),
leading to more FBR lease rejections on re-registration.  The cluster will have many "missing
blocks" until the DNs next FBR is sent and/or forced.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org

View raw message