hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew Wang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-7165) Separate block metrics for files with replication count 1
Date Tue, 21 Oct 2014 17:39:35 GMT

    [ https://issues.apache.org/jira/browse/HDFS-7165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14178706#comment-14178706
] 

Andrew Wang commented on HDFS-7165:
-----------------------------------

It's okay to make whitespace changes if they're nearby changes you're already making. I also
make an exception if it's something egregious like the indentation being off. We generally
avoid doing this unnecessarily to avoid messing up git blame and to avoid making the patch
bigger than it needs to be.

Thinking about this case a bit more though, I guess since it's just deleting lines it won't
mess up a git blame, so we can keep it.

> Separate block metrics for files with replication count 1
> ---------------------------------------------------------
>
>                 Key: HDFS-7165
>                 URL: https://issues.apache.org/jira/browse/HDFS-7165
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>            Reporter: Andrew Wang
>            Assignee: Zhe Zhang
>         Attachments: HDFS-7165-20141003-v1.patch, HDFS-7165-20141009-v1.patch, HDFS-7165-20141010-v1.patch,
HDFS-7165-20141015-v1.patch
>
>
> We see a lot of escalations because someone has written teragen output with a replication
factor of 1, a DN goes down, and a bunch of missing blocks show up. These are normally false
positives, since teragen output is disposable, and generally speaking, users should understand
this is true for all repl=1 files.
> It'd be nice to be able to separate out these repl=1 missing blocks from missing blocks
with higher replication factors..



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message