hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ray Chiang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-11574) Uber-JIRA: improve Hadoop network resilience & diagnostics
Date Tue, 24 Feb 2015 20:38:05 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-11574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14335407#comment-14335407
] 

Ray Chiang commented on HADOOP-11574:
-------------------------------------

I like the user-centric definitions above.  Then for each type of error, such as:

- DNS/UnknownHostException
- RPC/RemoteException
- SecurityException

We can see where it's deficient in the context of each user.

As with most of our log messages, I might worry a bit about finding the right balance of giving
notification and filling the logs too much.

> Uber-JIRA: improve Hadoop network resilience & diagnostics
> ----------------------------------------------------------
>
>                 Key: HADOOP-11574
>                 URL: https://issues.apache.org/jira/browse/HADOOP-11574
>             Project: Hadoop Common
>          Issue Type: Task
>          Components: net
>    Affects Versions: 2.6.0
>            Reporter: Steve Loughran
>              Labels: supportability
>
> Improve Hadoop's resilience to bad network conditions/problems, including
> * improving recognition of problem states
> * improving diagnostics
> * better handling of IPv6 addresses, even if the protocol is unsupported.
> * better behaviour client-side when there are connectivity problems. (i.e while some
errors you can spin on, DNS failures are not on the list)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message