accumulo-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Keith Turner (Updated) (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (ACCUMULO-427) Data lost when tablets moving around frequently
Date Mon, 27 Feb 2012 19:04:46 GMT

     [ https://issues.apache.org/jira/browse/ACCUMULO-427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Keith Turner updated ACCUMULO-427:
----------------------------------

    Description: The shard random walk test failed when verifiy its new index.  This test
has two tables a document table and a sharded index table used to find documents.  The test
has a node that rebuilds the index from the document table and then verifies that the new
and old index are the same.  This verification failed.  The failure was all realted to data
loss in one tablet in the new index table.  The data that was lost was read from two tablets
in the document table.  None of the lost data appeared in any write ahead logs.  The tablet
that lost data was being moved around very frequently during the time of the data loss.  All
of the evidence points to a bug in the batch writer or the tablet server code related to writing
data.  (was: The shard random walk test failed when verifiy its new index.  This test has
two tables a document table and a sharded index table used to find documents.  The test has
a node that rebuilds the index from the document table and then verifies that the new and
old index are the same.  This verification failed.  The failure was all realted to data loss
in one tablet in the new index table.  The data that was lost was read from two tablets in
the document table.  None of the lost data appeared in any write ahead logs.  The tablet that
last data was being moved around very frequently during the time of the data loss.  All of
the evidence points to a bug in the batch writer or the tablet server code related to writing
data.)
    
> Data lost when tablets moving around frequently
> -----------------------------------------------
>
>                 Key: ACCUMULO-427
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-427
>             Project: Accumulo
>          Issue Type: Bug
>          Components: tserver
>         Environment: 10 node cluster running random walk test w/ agitation
>            Reporter: Keith Turner
>            Assignee: Keith Turner
>            Priority: Blocker
>              Labels: 14_qa_bug
>             Fix For: 1.4.0
>
>
> The shard random walk test failed when verifiy its new index.  This test has two tables
a document table and a sharded index table used to find documents.  The test has a node that
rebuilds the index from the document table and then verifies that the new and old index are
the same.  This verification failed.  The failure was all realted to data loss in one tablet
in the new index table.  The data that was lost was read from two tablets in the document
table.  None of the lost data appeared in any write ahead logs.  The tablet that lost data
was being moved around very frequently during the time of the data loss.  All of the evidence
points to a bug in the batch writer or the tablet server code related to writing data.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

Mime
View raw message