hadoop-hdfs-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted Malaska (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HDFS-6383) Upgrade S3n s3.fs.buffer.dir to suppoer multi directories
Date Tue, 13 May 2014 14:29:16 GMT
Ted Malaska created HDFS-6383:

             Summary: Upgrade S3n s3.fs.buffer.dir to suppoer multi directories
                 Key: HDFS-6383
                 URL: https://issues.apache.org/jira/browse/HDFS-6383
             Project: Hadoop HDFS
          Issue Type: Improvement
            Reporter: Ted Malaska
            Priority: Minor

s3.fs.buffer.dir defines the tmp folder where files will be written to before getting sent
to S3.  Right now this is limited to a single folder which causes to major issues.

1. You need a drive with enough space to store all the tmp files at once
2. You are limited to the IO speeds of a single drive

This solution will resolve both and has been tested to increase the S3 write speed by 2.5x
with 10 mappers on hs1.

This message was sent by Atlassian JIRA

View raw message