hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "eric baldeschwieler (JIRA)" <j...@apache.org>
Subject [jira] Created: (HADOOP-2715) Review and document '_' prefix convention in input directories
Date Fri, 25 Jan 2008 22:04:34 GMT
Review and document '_' prefix convention in input directories

                 Key: HADOOP-2715
                 URL: https://issues.apache.org/jira/browse/HADOOP-2715
             Project: Hadoop Core
          Issue Type: Bug
            Reporter: eric baldeschwieler

We use files and directories prefixed with '_' to store logs, metadata and other info that
might be useful to the owner of a job within the output directory.  The standard input methods
then ignore such files by default.

HADOOP-2391 lead to some discussion of the '_' convention in output directories.  No all developers
input formats are supporting this.  We should review the convention and document it well so
that future input methods support it.  Or we should come up with an alternate approach.  

My hope is that after some discuss we will close this bug by creating a documentation patch
explaining the convention.

It sounds like the convention is implemented via some input filter classes.  We should discuss
if this generic solution is helping or obscuring the intent of the convention.  Perhaps we
should just have a non-configurable filter, so '_' prefixed files are treated like '.' prefixed
files by most unix tools.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message