accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Keith Turner (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ACCUMULO-2368) Addsplits to an offline table
Date Fri, 20 Jun 2014 20:38:24 GMT

    [ https://issues.apache.org/jira/browse/ACCUMULO-2368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14039319#comment-14039319
] 

Keith Turner commented on ACCUMULO-2368:
----------------------------------------

There is an optimization that could be done.  When an online tablet splits, if a file does
not contain data for a child then its not referenced by the child.  Offline split could do
this same analysis.  There is a drawback to this though, it could slow down the operation
in the case where many tablets are being split.   It would be nice to do this check if one
tablet with a few files were split into 100,000 tablets.  In the case where 100,000 tablets
are split into 200,000 tablets, would not want to do this check on a single node.

> Addsplits to an offline table
> -----------------------------
>
>                 Key: ACCUMULO-2368
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-2368
>             Project: Accumulo
>          Issue Type: Improvement
>          Components: master
>            Reporter: John Vines
>            Assignee: Sean Busbey
>             Fix For: 1.7.0
>
>
> Currently a table must be online to addsplits. Firstly, it's relatively slow. Secondly,
it could be a LOT faster to do it to an offline table because it's just a few metadata writes
per split point.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message