hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Dere (JIRA)" <>
Subject [jira] [Created] (HIVE-17611) Add new LazyBinary SerDe for faster writes
Date Tue, 26 Sep 2017 21:02:00 GMT
Jason Dere created HIVE-17611:

             Summary: Add new LazyBinary SerDe for faster writes
                 Key: HIVE-17611
             Project: Hive
          Issue Type: Improvement
          Components: Serializers/Deserializers
            Reporter: Jason Dere
            Assignee: Jason Dere

LazyBinarySerDe.serialize() ends up making getCategory()/getPrimitiveCategory() calls for
every column of every row. Tried some simple tests to eliminate these calls for the non-vectorized
version, this looks like it speeds up the writes by ~3x.
Adding a LazyBinarySerDe2 class with this new implementation.

This message was sent by Atlassian JIRA

View raw message