hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vaibhav Gumashta (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HIVE-14551) HiveServer2: Use vectorized data whenever available for writing final results
Date Tue, 16 Aug 2016 22:22:21 GMT
Vaibhav Gumashta created HIVE-14551:
---------------------------------------

             Summary: HiveServer2: Use vectorized data whenever available for writing final
results 
                 Key: HIVE-14551
                 URL: https://issues.apache.org/jira/browse/HIVE-14551
             Project: Hive
          Issue Type: Sub-task
          Components: HiveServer2
    Affects Versions: 2.1.0
            Reporter: Vaibhav Gumashta


In ThriftJDBCBinarySerde, which we are using in FileSinkOperator to write final results, we
buffer rows and store them into typed columns before writing a batch of rows to the result
file. However, when vectorized rows batches are available from higher level operators, we
should try to use them and avoid the extra penalty of converting from vector --> non-vector
single row --> buffered thrift columns (equivalent to vector).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message