hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Konstantin Shvachko (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-5675) Add Mkdirs operation to NNThroughputBenchmark
Date Wed, 18 Dec 2013 01:26:07 GMT

    [ https://issues.apache.org/jira/browse/HDFS-5675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13851211#comment-13851211
] 

Konstantin Shvachko commented on HDFS-5675:
-------------------------------------------

mkdirs is indeed an interesting operation for benchmarking as one of the simplest modification
of the namespace.
Comments on the patch:
# The operation should be called mkdirs not mkdir. And the Stats class should be {{MkdirsStats}},
no "File". Don't forget to update op name and usage constants.
# You are now creating a flat collection of directories. Would be good to support multi-level
structure, same as {{-filesPerDir}} in {{CreateFileStats}}. Too many entries in one directory
can be a performance issues by itself, so it is good to have flexibility.

Also, please adjust jira fields.

> Add Mkdirs operation to NNThroughputBenchmark
> ---------------------------------------------
>
>                 Key: HDFS-5675
>                 URL: https://issues.apache.org/jira/browse/HDFS-5675
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: benchmarks
>            Reporter: Plamen Jeliazkov
>            Assignee: Plamen Jeliazkov
>            Priority: Minor
>             Fix For: 3.0.0
>
>         Attachments: mkdirsBenchmarkPatchTrunk.patch
>
>
> I did some work to extend NNThroughputBenchmark that I would like to contribute to the
community. It is pretty straightforward; just adding a Mkdir operation to the test in order
to see the operations per second of a multiple 'mkdir' commands.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

Mime
View raw message