hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
Subject [jira] [Created] (HIVE-16663) String Caching For Rows
Date Fri, 12 May 2017 20:18:04 GMT
BELUGA BEHR created HIVE-16663:

             Summary: String Caching For Rows
                 Key: HIVE-16663
             Project: Hive
          Issue Type: Improvement
          Components: Beeline
    Affects Versions: 2.0.1
            Reporter: BELUGA BEHR
            Priority: Minor

It is very common that there are many repeated values in the result set of a query.  As it
currently stands, beeline does not attempt to cache any of these values and therefore it consumes
a lot of memory.

Adding a string cache may save a lot of memory.  There are organizations that use beeline
to perform ETL processing of result sets into CSV.  This will better support those organizations.

This message was sent by Atlassian JIRA

View raw message