impala-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dan Hecht (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (IMPALA-4864) Speed up binary predicates against dictionary encoded Parquet data by converting the predicates to their codewords
Date Fri, 17 Mar 2017 19:37:41 GMT

     [ https://issues.apache.org/jira/browse/IMPALA-4864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Dan Hecht reassigned IMPALA-4864:
---------------------------------

    Assignee: Zach Amsden

> Speed up binary predicates against dictionary encoded Parquet data by converting the
predicates to their codewords
> ------------------------------------------------------------------------------------------------------------------
>
>                 Key: IMPALA-4864
>                 URL: https://issues.apache.org/jira/browse/IMPALA-4864
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Backend
>    Affects Versions: Impala 2.9.0
>            Reporter: Mostafa Mokhtar
>            Assignee: Zach Amsden
>              Labels: performance
>
> Selective binary predicates against dictionary-encoded columns can be speeded up by converting
the original predicates on the column type to predicates on the dictionary codewords, this
should help avoid expensive comparisons. 
> Similar to Kudu 
> https://kudu.apache.org/2016/09/16/predicate-pushdown.html
> https://github.com/cloudera/kudu/commit/c0f37278cb09a7781d9073279ea54b08db6e2010
> https://github.com/cloudera/kudu/commit/ec80fdb37be44d380046a823b5e6d8e2241ec3da



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message