hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alan Gates (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-8515) Column projection not being pushed to ORC delta files
Date Sat, 18 Oct 2014 18:42:34 GMT

    [ https://issues.apache.org/jira/browse/HIVE-8515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14176077#comment-14176077
] 

Alan Gates commented on HIVE-8515:
----------------------------------

The issue is in OrcInputFormat.getReader:
{code}
    if (split.hasBase()) {
      bucket = AcidUtils.parseBaseBucketFilename(split.getPath(), conf)
          .getBucket();
      reader = OrcFile.createReader(path, OrcFile.readerOptions(conf));
      final List<OrcProto.Type> types = reader.getTypes();
      setIncludedColumns(readOptions, types, conf, split.isOriginal());
      setSearchArgument(readOptions, types, conf, split.isOriginal());
    } else {
      bucket = (int) split.getStart();
      reader = null;
    }
}
{code}

setIncludeColumns is called if there is a base, but not if there isn't.

> Column projection not being pushed to ORC delta files
> -----------------------------------------------------
>
>                 Key: HIVE-8515
>                 URL: https://issues.apache.org/jira/browse/HIVE-8515
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 0.14.0
>            Reporter: Alan Gates
>            Assignee: Alan Gates
>
> Currently when only some columns are projected, that projection is pushed to the base
file but not to delta files.  This does not cause incorrect results (the columns are projected
out later in the query execution), but it is less efficient then it could be.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message