hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sarvesh Sakalanaga (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-4727) Optimize ORC StringTreeReader::nextVector to not create dictionary of strings for each call to nextVector
Date Thu, 13 Jun 2013 01:20:20 GMT

     [ https://issues.apache.org/jira/browse/HIVE-4727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Sarvesh Sakalanaga updated HIVE-4727:
-------------------------------------

    Attachment: Hive-4727.0.patch

Patch available.
                
> Optimize ORC StringTreeReader::nextVector to not create dictionary of strings for each
call to nextVector
> ---------------------------------------------------------------------------------------------------------
>
>                 Key: HIVE-4727
>                 URL: https://issues.apache.org/jira/browse/HIVE-4727
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Sarvesh Sakalanaga
>            Assignee: Sarvesh Sakalanaga
>            Priority: Minor
>         Attachments: Hive-4727.0.patch
>
>
> Currently ORC StringTreeReader::nextVector creates dictionary of strings for each call
to nextVector. This leads to bad perf as there is huge memory allocation and deallocation
on each call. Since the dictionary does not change within a stripe, StringTreeReader::nextVector
should be optimized to create this dictionary only on stripe read.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message