hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daryn Sharp (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-3577) WebHdfsFileSystem can not read files larger than 24KB
Date Tue, 17 Jul 2012 13:47:36 GMT

    [ https://issues.apache.org/jira/browse/HDFS-3577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13416186#comment-13416186
] 

Daryn Sharp commented on HDFS-3577:
-----------------------------------

If the content-length check is removed and content-length is not present, transfers >2GB
will fail unless the client's read timeout is greater than the server's connection idle timeout
(200s), and partial downloads will be considered successful.

I'd suggest the check be left in place on this jira, but it only generates an exception if
it's not a chunked download.  We can then decide what to do on HDFS-3671. 
                
> WebHdfsFileSystem can not read files larger than 24KB
> -----------------------------------------------------
>
>                 Key: HDFS-3577
>                 URL: https://issues.apache.org/jira/browse/HDFS-3577
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: hdfs client
>    Affects Versions: 0.23.3, 2.0.0-alpha
>            Reporter: Alejandro Abdelnur
>            Assignee: Tsz Wo (Nicholas), SZE
>            Priority: Blocker
>         Attachments: h3577_20120705.patch, h3577_20120708.patch, h3577_20120714.patch,
h3577_20120716.patch
>
>
> If reading a file large enough for which the httpserver running webhdfs/httpfs uses chunked
transfer encoding (more than 24K in the case of webhdfs), then the WebHdfsFileSystem client
fails with an IOException with message *Content-Length header is missing*.
> It looks like WebHdfsFileSystem is delegating opening of the inputstream to *ByteRangeInputStream.URLOpener*
class, which checks for the *Content-Length* header, but when using chunked transfer encoding
the *Content-Length* header is not present and  the *URLOpener.openInputStream()* method thrown
an exception.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message