crunch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Josh Wills (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CRUNCH-299) Support predicate pushdown for Parquet sources
Date Fri, 22 Nov 2013 15:36:36 GMT

    [ https://issues.apache.org/jira/browse/CRUNCH-299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13830054#comment-13830054
] 

Josh Wills commented on CRUNCH-299:
-----------------------------------

So we could either a) add an option to give a ColumnRecordFilter to the Parquet Source, or
b) create a subclass of FilterFn that knows how to express ColumnRecordFilters.

Right now, the planner doesn't generally do anything special to handle the children of a DoFn,
with the exception of CombineFns around GBK operations. If we go with b), we'd need to add
some planner functionality to support it.

> Support predicate pushdown for Parquet sources
> ----------------------------------------------
>
>                 Key: CRUNCH-299
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-299
>             Project: Crunch
>          Issue Type: Improvement
>          Components: Core
>            Reporter: Tom White
>            Assignee: Josh Wills
>
> We should be able to push Crunch FilterFn down to a Parquet ColumnRecordFilter. 



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message