cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aleksey Yeschenko (JIRA)" <j...@apache.org>
Subject [jira] [Created] (CASSANDRA-8959) More efficient frozen UDT and tuple serialization format
Date Thu, 12 Mar 2015 05:39:38 GMT
Aleksey Yeschenko created CASSANDRA-8959:
--------------------------------------------

             Summary: More efficient frozen UDT and tuple serialization format
                 Key: CASSANDRA-8959
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-8959
             Project: Cassandra
          Issue Type: Improvement
            Reporter: Aleksey Yeschenko
             Fix For: 3.1


The current serialization format for UDTs has a fixed overhead of 4 bytes per defined field
(encoding the size of the field).

It is inefficient for sparse UDTs - ones with many defined fields, but few of them present.
We could keep a bitset to indicate the missing fields, if any.

It's sub-optimal for encoding UDTs with all the values present as well. We could use varint
encoding for the field sizes of blob/text fields and encode 'fixed' sized types directly,
without the 4-bytes size prologue.

That or something more brilliant. Any improvement right now is lhf.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message