hawq-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Oleksandr Diachenko (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HAWQ-1446) Introduce vectorized profile for ORC
Date Sat, 29 Apr 2017 00:40:04 GMT
Oleksandr Diachenko created HAWQ-1446:

             Summary: Introduce vectorized profile for ORC
                 Key: HAWQ-1446
                 URL: https://issues.apache.org/jira/browse/HAWQ-1446
             Project: Apache HAWQ
          Issue Type: New Feature
          Components: PXF
            Reporter: Oleksandr Diachenko
            Assignee: Ed Espino

As for now, PXF has HiveORC format which supports reading ORC files with leveraging ORC's
file advantages(column projections, predicate push-downs).
 The only drawback of this profile - it still operates on a single row. So the idea is to
implement a profile for reading ORC files, but operate on batches of rows, which supposed
to be faster on reading and resolution phases.

This message was sent by Atlassian JIRA

View raw message