cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Srikanth Seshadri (JIRA)" <>
Subject [jira] [Created] (CASSANDRA-7755) GZIPBase64 Validator
Date Tue, 12 Aug 2014 10:09:11 GMT
Srikanth Seshadri created CASSANDRA-7755:

             Summary: GZIPBase64 Validator
                 Key: CASSANDRA-7755
             Project: Cassandra
          Issue Type: Wish
            Reporter: Srikanth Seshadri
            Priority: Minor

I have implemented this extension.

Please let me know if you think it will be useful for others. If yes, I work on submitting
a patch for the same

Extension: GZIP-Base64 Datatype

For size advantages - we compress the text data in Text/UTF-8 columns in cassandra. The text
is GZIPed and then Base64 encoded - result is stored in Cassandra.

When we peek into the data using Cassandra-Cli, the data we see is not in clear text - this
benefit is lost because of compression. Hence I added this extension which indicates to Cassandra
that the data in the text column is GZIP-Base64 encoded. The extension will decode the value
and display the result in clear text when queried.


Let’s assume that the employee column family has address column data in compressed format.
Execute the following assumption in Cassandra-Cli.

With this assumption the output of the address column will be in clear text. The GZIPBASE64
type is implemented in such a way that it detects data for compression, only then decodes
it. If no compression is performed, the data is not altered.

This implementation relieves the user of indicating, in the above example that only address
column is compressed and others are not


This message was sent by Atlassian JIRA

View raw message