accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Christopher Tubbs (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (ACCUMULO-3369) Recursive Directory Imports
Date Thu, 21 Jun 2018 23:55:00 GMT

     [ https://issues.apache.org/jira/browse/ACCUMULO-3369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Christopher Tubbs resolved ACCUMULO-3369.
-----------------------------------------
    Resolution: Won't Fix

The new bulk import API (for 2.0) would make this trivial to implement on the client-side,
so I think it's safe to say that this probably won't be implemented baked in to Accumulo.

Note: It's currently possible in the old API to implement on the client side, but since it's
common to have a failure directory as a subdirectory, so it's a bit more sensible using the
new API which doesn't have a failure directory.

> Recursive Directory Imports
> ---------------------------
>
>                 Key: ACCUMULO-3369
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-3369
>             Project: Accumulo
>          Issue Type: New Feature
>            Reporter: Ariel Valentin
>            Priority: Minor
>
> importDirectory does not recurse through subdirectories to import rifles. It would be
nice to add an option that would recurse through subdirectories or accept glob patterns to
allow the user to specify multiple directories to import.
> Here is a snippet of our email conversation from the mailing list to provide some context:
> {quote}
> Ah, ok. Thanks, Mike.
> It wouldn't be too bad to ensure that the failures are unique (we could preserve the
directory name in the failures dir). I thought that we assign a unique name on the backend
(b-XXXXXXX.rf) so I think that's ok now.
> What I'm boiling down to: sounds like a reasonable feature to request if you want to
file an issue on JIRA, Ariel. :)
> Mike Drob wrote:
> Name collision of failures and I think name collision of successes might
> cause problems sometimes too. Or maybe that's just with older versions.
> Regardless, having to write your own code puts it out of the realm of
> easy into at least middling territory - if import directory could
> natively handle recursion then it would become easy.
> On Tue, Nov 25, 2014 at 10:44 AM, Josh Elser  wrote:
>     What's the difficulty, Mike? Handling name collision of failures?
>     Mike Drob wrote:
>         Ariel,
>         There is not an easy way to do this recursively. Your best option is
>         going to be writing your own wrapper around the import command. If
>         you're using shell commands, this could be as easy as feeding the
>         results of 'find . -type d' into a script, or in Java you might
>         want to
>         look at DirectoryWalker in Apache Commons as possible solutions.
>         Mike
>         On Tue, Nov 25, 2014 at 10:22 AM, Ariel Valentin
>        
>              Hello!
>              We are running a couple of experiments using
>         importDirectory and are
>              curious if there is a simple way to import directories
>         recursively.
>              Based on looking at the source code it does not look like it
>              currently supports that feature:
>         (https://github.com/apache/__accumulo/blob/__1835c27ca41426ddd570cde14f9612__c45680b917/core/src/main/java/__org/apache/accumulo/core/__client/admin/__TableOperationsImpl.java
>         https://github.com/apache/accumulo/blob/1835c27ca41426ddd570cde14f9612c45680b917/core/src/main/java/org/apache/accumulo/core/client/admin/TableOperationsImpl.java
>              Are there plans to add it in the future? Or is there a
>         simple way to
>              do this right now?
> {quote}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message