accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sean Busbey (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (ACCUMULO-2485) HDFS permissions error leaves root_tablet in inconsistent state
Date Mon, 17 Mar 2014 15:53:43 GMT

     [ https://issues.apache.org/jira/browse/ACCUMULO-2485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Sean Busbey updated ACCUMULO-2485:
----------------------------------

    Attachment: ACCUMULO-2485-tserver.log

I was going to include the GC log, but browsing it looks like the GC doesn't handle deletes
for the root tablet. (from discussing with [~mdrob] I think this is expected).

root_tablet directory in HDFS during failure:

{noformat}

-bash-4.1$ hadoop dfs -ls '/accumulo/tables/!0/root_tablet'
Found 6 items
-rw-r--r--   5 accumulo accumulo        589 2014-03-17 07:52 /accumulo/tables/!0/root_tablet/A0000001.rf
-rw-r--r--   5 accumulo accumulo        670 2014-03-17 07:56 /accumulo/tables/!0/root_tablet/F000000b.rf
-rw-r--r--   5 accumulo accumulo        685 2014-03-17 07:59 /accumulo/tables/!0/root_tablet/F000000j.rf
-rw-r--r--   5 accumulo accumulo        693 2014-03-17 08:01 /accumulo/tables/!0/root_tablet/F000000o.rf
-rw-r--r--   5 accumulo accumulo        675 2014-03-17 08:01 /accumulo/tables/!0/root_tablet/F000000q.rf
-rw-r--r--   5 accumulo accumulo        664 2014-03-17 08:06 /accumulo/tables/!0/root_tablet/F0000011.rf
-bash-4.1$ hadoop dfs -ls '/accumulo/tables/!0/root_tablet'
Found 9 items
-rw-r--r--   5 accumulo accumulo        589 2014-03-17 07:52 /accumulo/tables/!0/root_tablet/A0000001.rf
-rw-r--r--   5 accumulo accumulo        670 2014-03-17 07:56 /accumulo/tables/!0/root_tablet/F000000b.rf
-rw-r--r--   5 accumulo accumulo        685 2014-03-17 07:59 /accumulo/tables/!0/root_tablet/F000000j.rf
-rw-r--r--   5 accumulo accumulo        693 2014-03-17 08:01 /accumulo/tables/!0/root_tablet/F000000o.rf
-rw-r--r--   5 accumulo accumulo        675 2014-03-17 08:01 /accumulo/tables/!0/root_tablet/F000000q.rf
-rw-r--r--   5 accumulo accumulo        664 2014-03-17 08:06 /accumulo/tables/!0/root_tablet/F0000011.rf
-rw-r--r--   5 accumulo accumulo        666 2014-03-17 08:11 /accumulo/tables/!0/root_tablet/F000001c.rf
-rw-r--r--   5 accumulo accumulo        675 2014-03-17 08:16 /accumulo/tables/!0/root_tablet/F000001n.rf
-rw-r--r--   5 accumulo accumulo        655 2014-03-17 08:21 /accumulo/tables/!0/root_tablet/F000001y.rf
{noformat}

and after recovery:

{noformat}
-bash-4.1$ hadoop dfs -ls '/accumulo/tables/!0/root_tablet'
Found 2 items
-rw-r--r--   5 accumulo accumulo        589 2014-03-17 07:52 /accumulo/tables/!0/root_tablet/A0000001.rf
-rw-r--r--   5 accumulo accumulo        768 2014-03-17 08:47 /accumulo/tables/!0/root_tablet/A000002q.rf
{noformat}

I presume leaving the now defunct A0000001.rf file around won't harm anything.

> HDFS permissions error leaves root_tablet in inconsistent state
> ---------------------------------------------------------------
>
>                 Key: ACCUMULO-2485
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-2485
>             Project: Accumulo
>          Issue Type: Bug
>    Affects Versions: 1.4.5
>         Environment: 1.4.5-SNAPSHOT on 6faac42, Hadoop 1 via CDH3u6
>            Reporter: Sean Busbey
>              Labels: 14_qa_bug
>             Fix For: 1.4.6
>
>         Attachments: ACCUMULO-2485-tserver.log
>
>
> While setting up a test cluster, I mistakenly left out a home directory for the Accumulo
user on a cluster set up with HDFS Trash enabled.
> After successfully writing out the new file for a MajC on the root tablet, some part
of the cleanup of old files failed when moving to Trash didn't work. The result was that the
tablet still listed one of the original files, but that file was no longer in HDFS.
> All future MajCs failed with file not found. Attaching representative log sample.
> I don't know if this impacts future versions.
> Workaround: ACCUMULO-1219 (copied the result of the initial MajC to the files named as
its source)



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message