hadoop-hdfs-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew Wang (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HDFS-5096) Automatically cache new data added to a cached path
Date Wed, 14 Aug 2013 21:19:49 GMT
Andrew Wang created HDFS-5096:
---------------------------------

             Summary: Automatically cache new data added to a cached path
                 Key: HDFS-5096
                 URL: https://issues.apache.org/jira/browse/HDFS-5096
             Project: Hadoop HDFS
          Issue Type: Sub-task
            Reporter: Andrew Wang


For some applications, it's convenient to specify a path to cache, and have HDFS automatically
cache new data added to the path without sending a new caching request or a manual refresh
command.

One example is new data appended to a cached file. It would be nice to re-cache a block at
the new appended length, and cache new blocks added to the file.

Another example is a cached Hive partition directory, where a user can drop new files directly
into the partition. It would be nice if these new files were cached.

In both cases, this automatic caching would happen after the file is closed, i.e. block replica
is finalized.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message