drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From jinfengni <...@git.apache.org>
Subject [GitHub] drill pull request #637: Drill 1950 : Parquet row group filter pushdown.
Date Mon, 31 Oct 2016 05:21:16 GMT
GitHub user jinfengni opened a pull request:

    https://github.com/apache/drill/pull/637

    Drill 1950 : Parquet row group filter pushdown. 

    

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/jinfengni/incubator-drill DRILL-1950

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/drill/pull/637.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #637
    
----
commit e87f6c616bf8921867a283a2a46865e2eca5c917
Author: AdamPD <adam@pharmadata.net.au>
Date:   2016-06-30T21:09:53Z

    DRILL-1950: Use three new classes from Adam's patch : the rule and filter builder.

commit 86fd14b19d8d1ac06a01361b8173d19e15541afd
Author: Jinfeng Ni <jni@apache.org>
Date:   2016-10-31T04:30:12Z

    DRILL-1950: Update parquet metadata cache format to include both min/max and additional
column type information.
    
        Parquet meta cache format change:
        1. include both min/max in ColumnMetaData if column statistics is available,
        2. include precision/scale/repetionLevel in ColumnTypeMetaData (precision/scale is
for future use).

commit 242c45e1b1b531a1c7ec3c50438e6a33fe97a711
Author: Jinfeng Ni <jni@apache.org>
Date:   2016-10-31T04:30:37Z

    DRILL-1950: Parquet rowgroup level filter pushdown.
    
    Implement Parquet rowgroup level filter pushdown. The filter pushdown is performed in
    query planning time, in Drill physical planning phase.
    
    Only a local filter, which refers to columns in a single table, is qualified for filter
pushdown.
    
    A filter may be elligible if it is a simple compare filter, or a compound "and/or" filter
consists of
    simple compare filter. Data types allowed in compare filter are int, bigint, float, double,
date,
    timestamp, time. Compare operators are =, !=, <, <=, >, >=. Operands have
to be a column of the above
    data types, or an explicit cast or implicit cast function, or a constant expressions.

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

Mime
View raw message