accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Keith Turner (JIRA)" <>
Subject [jira] [Commented] (ACCUMULO-2851) Import Table Operation removes files
Date Mon, 02 Jun 2014 21:39:02 GMT


Keith Turner commented on ACCUMULO-2851:

If the following steps are used to export/import table

 # clone table1 to table_oe  (this step is optional, but it allows table1 to stay online while
 # offline table_oe
 # export table_oe
 # distcp
 # import

Import will rename the distcp files.  As long as table_oe exist and is offline, distcp can
be run again creating another copy.  Alternatively, distcp could be run on the destination
cluster to create a 2nd copy before import. 

 # clone table1 to table_oe  on cluster A
 # offline table_oe  on cluster A
 # export table_oe on cluster A
 # distcp from cluster A to cluster B
 # distcp from cluster B to cluster B  (creates a 2nd copy on cluster B)
 # import on cluster B

I do not think Accumulo should attempt to replicate the functionality of distcp by copying
files before importing.

> Import Table Operation removes files
> ------------------------------------
>                 Key: ACCUMULO-2851
>                 URL:
>             Project: Accumulo
>          Issue Type: Wish
>    Affects Versions: 1.5.1, 1.6.0
>            Reporter: Andrew George Wells
>            Priority: Minor
>              Labels: easyfix
>             Fix For: 1.5.2, 1.6.1
>   Original Estimate: 12h
>  Remaining Estimate: 12h
> When Importing a table, the code calls rename, which moves the file away. However, in
some use cases, the user may need to keep data intact. An option should be provided to not
remove the data from the import directory.

This message was sent by Atlassian JIRA

View raw message