hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rajesh Balamohan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-14081) S3A: Consider avoiding array copy in S3ABlockOutputStream (ByteArrayBlock)
Date Mon, 20 Feb 2017 23:47:44 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-14081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15875140#comment-15875140

Rajesh Balamohan commented on HADOOP-14081:

Thanks [~stevel@apache.org]. I ran with "mvn test -Dtest=ITestS\* -Dscale". I should have
used scale test param for huge filesize upload.

> S3A: Consider avoiding array copy in S3ABlockOutputStream (ByteArrayBlock)
> --------------------------------------------------------------------------
>                 Key: HADOOP-14081
>                 URL: https://issues.apache.org/jira/browse/HADOOP-14081
>             Project: Hadoop Common
>          Issue Type: Sub-task
>          Components: fs/s3
>            Reporter: Rajesh Balamohan
>            Assignee: Rajesh Balamohan
>            Priority: Minor
>             Fix For: 2.8.0
>         Attachments: HADOOP-14081.001.patch
> In {{S3ADataBlocks::ByteArrayBlock}}, data is copied whenever {{startUpload}} is called.
It might be possible to directly access the byte[] array from ByteArrayOutputStream. 
> Might have to extend ByteArrayOutputStream and create a method like getInputStream()
which can return ByteArrayInputStream.  This would avoid expensive array copy during large

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org

View raw message