hawq-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Oleksandr Diachenko (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HAWQ-1446) Introduce vectorized profile for ORC
Date Sat, 29 Apr 2017 00:40:04 GMT
Oleksandr Diachenko created HAWQ-1446:
-----------------------------------------

             Summary: Introduce vectorized profile for ORC
                 Key: HAWQ-1446
                 URL: https://issues.apache.org/jira/browse/HAWQ-1446
             Project: Apache HAWQ
          Issue Type: New Feature
          Components: PXF
            Reporter: Oleksandr Diachenko
            Assignee: Ed Espino


As for now, PXF has HiveORC format which supports reading ORC files with leveraging ORC's
file advantages(column projections, predicate push-downs).
 The only drawback of this profile - it still operates on a single row. So the idea is to
implement a profile for reading ORC files, but operate on batches of rows, which supposed
to be faster on reading and resolution phases.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message