accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eric Newton (JIRA)" <>
Subject [jira] [Created] (ACCUMULO-1802) use case for future configurability of major compactions
Date Tue, 22 Oct 2013 18:24:47 GMT
Eric Newton created ACCUMULO-1802:

             Summary: use case for future configurability of major compactions
                 Key: ACCUMULO-1802
             Project: Accumulo
          Issue Type: Sub-task
          Components: tserver
            Reporter: Eric Newton

The default compaction strategy has a tendency to put the oldest data in the largest files.
 This leads to a lot of work when it is time to age off data.

One could imaging a compaction strategy that would split data into separate files based on
the timestamp.  Additionally, if the min/max timestamps for a file were known, old data could
be aged off by deleting whole files.

Augment the configurable compaction strategy to support multiple output files, and saving/using
extra metadata in each file.

This message was sent by Atlassian JIRA

View raw message