accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Keith Turner (JIRA)" <j...@apache.org>
Subject [jira] [Created] (ACCUMULO-4066) Conditional mutation processing performance could be improved.
Date Fri, 20 Nov 2015 23:13:11 GMT
Keith Turner created ACCUMULO-4066:
--------------------------------------

             Summary: Conditional mutation processing performance could be improved.
                 Key: ACCUMULO-4066
                 URL: https://issues.apache.org/jira/browse/ACCUMULO-4066
             Project: Accumulo
          Issue Type: Improvement
          Components: tserver
    Affects Versions: 1.7.0, 1.6.4
            Reporter: Keith Turner
            Assignee: Keith Turner
             Fix For: 1.8.0


When processing conditional mutations tablets reads are done.   The way the current implementation
does tablet reads has a lot of overhead.   For each condition the following is done :

 * Opens and reserves iterators files.
 * Parse table iterators from table config (involves scanning and filtering entire table config)
 * Merges condition iterators and table iterators
 * Constructs iterator stack.

I created a branch where these operations (except for constructing iterator stack) are done
per tablet and/or per batch of conditional mutations.   Doing this I am seeing a 3x speed
up in conditional mutation processing rates when data is cached.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message