hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tsz Wo (Nicholas), SZE (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HDFS-219) Add md5sum facility in dfsshell
Date Thu, 24 Feb 2011 22:46:38 GMT

    [ https://issues.apache.org/jira/browse/HDFS-219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12999099#comment-12999099

Tsz Wo (Nicholas), SZE commented on HDFS-219:

> I think it would be usefull to add md5sum ...

Computing a single md5 over a file (i.e. md5sum) is inefficient when the file is large since
the md5 algorithm cannot be executed in parallel.  This is also the reason that we have {{MD5MD5CRC32FileChecksum}}
in HDFS.

> Add md5sum facility in dfsshell
> -------------------------------
>                 Key: HDFS-219
>                 URL: https://issues.apache.org/jira/browse/HDFS-219
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>            Reporter: zhangwei
> I think it would be usefull to add md5sum (or anyone else) to dfsshell ,and the facility
can verify the file on hdfs.It can confirm the file is integrity after copyFromLocal or copyToLocal.

This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message