hawq-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Kuien Liu (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HAWQ-1660) optimize parquet scan
Date Thu, 20 Sep 2018 08:33:00 GMT

     [ https://issues.apache.org/jira/browse/HAWQ-1660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Kuien Liu updated HAWQ-1660:
----------------------------
    Priority: Minor  (was: Major)

> optimize parquet scan
> ---------------------
>
>                 Key: HAWQ-1660
>                 URL: https://issues.apache.org/jira/browse/HAWQ-1660
>             Project: Apache HAWQ
>          Issue Type: Improvement
>          Components: Storage
>            Reporter: Kuien Liu
>            Assignee: Radar Lei
>            Priority: Minor
>
> I saw Mr. Wen Lin's work on bloomfilter, and push down the fitler into parquet scan.
It provide a chance to accelerate further, that is: read the hash-key-attributes first and
perform filter, if positive, fetch the remaining columns from parquet. does it make sense?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message