hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Doug Cutting (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-3307) Archives in Hadoop.
Date Sat, 26 Apr 2008 03:39:56 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-3307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12592551#action_12592551

Doug Cutting commented on HADOOP-3307:

> In har://hdfs-host:port/dir/foo.har/bar.har/file, what is the behavior if foo.har is
indeed a directory and bar.har is an archive?

As I said before, I think it would be nice and not too difficult to make nested archives work.
 Not essential, but convenient if its not too difficult.  So if you have hdfs://h:p/bar/*
and you pack it into hdfs://foo/bar.har, then you pack that into hdfs://h:p/foo/* into hdfs://h:p/dir/foo.har,
then har://hdfs-h:p/dir/foo.har/bar.har/file should either (a) contain the content of the
original file if we implement nested archives, or (b) throw FileNotFoundException if we don't
implement nested archives.  Is that what you were asking?

> Can we distinguish a directory ending with .har to be an archive only if it has index
file in it.

If a path component of a har: uri ends with ".har" then I think it should be an error if it
is not a ".har" format directory.  It's fine to have files named .har in HDFS that are not
har-format, but if one tries to access them using the archive mechanism, we shouldn't silently
ignore them, but rather throw a MalformedArchive exception, no?

> Archives in Hadoop.
> -------------------
>                 Key: HADOOP-3307
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3307
>             Project: Hadoop Core
>          Issue Type: New Feature
>          Components: fs
>            Reporter: Mahadev konar
>            Assignee: Mahadev konar
>             Fix For: 0.18.0
> This is a new feature for archiving and unarchiving files in HDFS. 

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message