hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dong Chen (JIRA)" <>
Subject [jira] [Created] (HIVE-10257) Ensure Parquet Hive has null optimization
Date Wed, 08 Apr 2015 03:38:12 GMT
Dong Chen created HIVE-10257:

             Summary: Ensure Parquet Hive has null optimization
                 Key: HIVE-10257
             Project: Hive
          Issue Type: Sub-task
            Reporter: Dong Chen
            Assignee: Dong Chen

In Parquet statistics, a boolean value {{hasNonNullValue}} is used for each column chunk.
Hive could use this value to skip a column, avoid null-checking logic, and speed up vectorization
like HIVE-4478 (in the future, it is not completed yet).

In this Jira we could check whether this null optimization works, and make changes if any.

This message was sent by Atlassian JIRA

View raw message