asterixdb-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Till (JIRA)" <>
Subject [jira] [Updated] (ASTERIXDB-2252) Improve scan efficiency of LSM components
Date Wed, 14 Mar 2018 23:22:00 GMT


Till updated ASTERIXDB-2252:
    Labels: triaged  (was: )

> Improve scan efficiency of LSM components
> -----------------------------------------
>                 Key: ASTERIXDB-2252
>                 URL:
>             Project: Apache AsterixDB
>          Issue Type: Improvement
>          Components: STO - Storage
>            Reporter: Chen Luo
>            Assignee: Chen Luo
>            Priority: Major
>              Labels: triaged
> The current (full) scan on LSM components is not very efficient, especially on hard disks,
in two aspects:
>  # We often need to use a priority queue to merge results from multiple components. However,
we only read a page at a time, which incurs a lot of random I/O overhead on hard disks.
>  # Full scan can often fill up (and clean up) the buffer cache. This problem is especially
notable when we do merge. After a merge operation, the buffer cache would be filled up pages
of old components, which would not be accessed by future queries.

This message was sent by Atlassian JIRA

View raw message