hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sarvesh Sakalanaga (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HIVE-4727) Optimize ORC StringTreeReader::nextVector to not create dictionary of strings for each call to nextVector
Date Thu, 13 Jun 2013 01:06:20 GMT
Sarvesh Sakalanaga created HIVE-4727:
----------------------------------------

             Summary: Optimize ORC StringTreeReader::nextVector to not create dictionary of
strings for each call to nextVector
                 Key: HIVE-4727
                 URL: https://issues.apache.org/jira/browse/HIVE-4727
             Project: Hive
          Issue Type: Sub-task
            Reporter: Sarvesh Sakalanaga
            Assignee: Sarvesh Sakalanaga
            Priority: Minor


Currently ORC StringTreeReader::nextVector creates dictionary of strings for each call to
nextVector. This leads to bad perf as there is huge memory allocation and deallocation on
each call. Since the dictionary does not change within a stripe, StringTreeReader::nextVector
should be optimized to create this dictionary only on stripe read.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message