spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Apache Spark (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-25702) Push down filters with `Not` operator in Parquet
Date Wed, 10 Oct 2018 10:00:00 GMT

    [ https://issues.apache.org/jira/browse/SPARK-25702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16644767#comment-16644767
] 

Apache Spark commented on SPARK-25702:
--------------------------------------

User 'gengliangwang' has created a pull request for this issue:
https://github.com/apache/spark/pull/22687

> Push down filters with `Not` operator in Parquet
> ------------------------------------------------
>
>                 Key: SPARK-25702
>                 URL: https://issues.apache.org/jira/browse/SPARK-25702
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>    Affects Versions: 2.4.0
>            Reporter: Gengliang Wang
>            Priority: Major
>
> Currently, in ParquetFilters, predicates inside `Not` operator are considered as unable
to perform partial push down.
> However, the following cases is still possible for push down:
> 1. `Not(Or(left, right))` can be conversed as `And(Not(left), Not(right))`
> 2. `Not(Not(pred))` can be conversed as `pred`
> Both cases should be quite trivial, since the `Not` operator should be pushed down by
optimization rule `BooleanSimplification` already.
> But I think it should be good to handle such cases in Parquet data source module as well.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message