accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eric Newton (JIRA)" <j...@apache.org>
Subject [jira] [Created] (ACCUMULO-1802) use case for future configurability of major compactions
Date Tue, 22 Oct 2013 18:24:47 GMT
Eric Newton created ACCUMULO-1802:
-------------------------------------

             Summary: use case for future configurability of major compactions
                 Key: ACCUMULO-1802
                 URL: https://issues.apache.org/jira/browse/ACCUMULO-1802
             Project: Accumulo
          Issue Type: Sub-task
          Components: tserver
            Reporter: Eric Newton


The default compaction strategy has a tendency to put the oldest data in the largest files.
 This leads to a lot of work when it is time to age off data.

One could imaging a compaction strategy that would split data into separate files based on
the timestamp.  Additionally, if the min/max timestamps for a file were known, old data could
be aged off by deleting whole files.

Augment the configurable compaction strategy to support multiple output files, and saving/using
extra metadata in each file.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message