hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steve Loughran (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-13811) s3a: getFileStatus fails with com.amazonaws.AmazonClientException: Failed to sanitize XML document destined for handler class
Date Fri, 11 Nov 2016 18:31:58 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-13811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15657744#comment-15657744
] 

Steve Loughran commented on HADOOP-13811:
-----------------------------------------

I think what's happening here is that one thread is completing/shutting stuff down, but spark
streaming is still scanning for new files. The thread's been interrupted on the read, which
raises an AbortedException() (which extends AmazonClientException. Your XML parser catches
it, but then throws a new AmazonClientException, rather than just rethrow the one that came
in

[https://github.com/aws/aws-sdk-java/blob/master/aws-java-sdk-s3/src/main/java/com/amazonaws/services/s3/model/transform/XmlResponsesSaxParser.java#L177]

This isn't generating very meaningful messages, not when "thread interrupted" is what we'd
like to see

> s3a: getFileStatus fails with com.amazonaws.AmazonClientException: Failed to sanitize
XML document destined for handler class
> -----------------------------------------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-13811
>                 URL: https://issues.apache.org/jira/browse/HADOOP-13811
>             Project: Hadoop Common
>          Issue Type: Sub-task
>          Components: fs/s3
>    Affects Versions: 2.8.0
>            Reporter: Steve Loughran
>            Assignee: Steve Loughran
>
> Sometimes, occasionally, getFileStatus() fails with a stack trace starting with {{com.amazonaws.AmazonClientException:
Failed to sanitize XML document destined for handler class}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org


Mime
View raw message