hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted Malaska (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HDFS-6383) Upgrade S3n s3.fs.buffer.dir to suppoer multi directories
Date Tue, 13 May 2014 14:33:20 GMT

     [ https://issues.apache.org/jira/browse/HDFS-6383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Ted Malaska updated HDFS-6383:

    Status: Patch Available  (was: Open)

Patch is ready

> Upgrade S3n s3.fs.buffer.dir to suppoer multi directories
> ---------------------------------------------------------
>                 Key: HDFS-6383
>                 URL: https://issues.apache.org/jira/browse/HDFS-6383
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>            Reporter: Ted Malaska
>            Priority: Minor
>         Attachments: HDFS-6383.patch
> s3.fs.buffer.dir defines the tmp folder where files will be written to before getting
sent to S3.  Right now this is limited to a single folder which causes to major issues.
> 1. You need a drive with enough space to store all the tmp files at once
> 2. You are limited to the IO speeds of a single drive
> This solution will resolve both and has been tested to increase the S3 write speed by
2.5x with 10 mappers on hs1.

This message was sent by Atlassian JIRA

View raw message