hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steve Loughran (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HADOOP-15216) S3AInputStream to handle reconnect on read() failure better
Date Thu, 08 Feb 2018 18:42:00 GMT
Steve Loughran created HADOOP-15216:
---------------------------------------

             Summary: S3AInputStream to handle reconnect on read() failure better
                 Key: HADOOP-15216
                 URL: https://issues.apache.org/jira/browse/HADOOP-15216
             Project: Hadoop Common
          Issue Type: Sub-task
          Components: fs/s3
    Affects Versions: 3.0.0
            Reporter: Steve Loughran


{{S3AInputStream}} handles any IOE through a close() of stream and single re-invocation of
the read, with 
* no backoff
* no abort of the HTTPS connection, which is just returned to the pool, If httpclient hasn't
noticed the failure, it may get returned to the caller on the next read

Proposed
* switch to invoker
* retry policy explicitly for stream (EOF => throw, timeout => close, sleep, retry,
etc)

We could think about extending the fault injection to inject stream read failures intermittently
too, though it would need something in S3AInputStream to (optionally) wrap the http input
streams with the failing stream. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org


Mime
View raw message