orc-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Owen O'Malley (JIRA)" <j...@apache.org>
Subject [jira] [Created] (ORC-250) Create sha256 mask
Date Fri, 13 Oct 2017 00:09:00 GMT
Owen O'Malley created ORC-250:

             Summary: Create sha256 mask
                 Key: ORC-250
                 URL: https://issues.apache.org/jira/browse/ORC-250
             Project: ORC
          Issue Type: Sub-task
            Reporter: Owen O'Malley

We should also create a DataMask that does sha256 of the data:

* strings should be sha256 of the utf-8 representation of the string represented as hex digits
* binary should be sha256 of the binary in binary
* integer types should be sha256 of the little endian representation of the number in little
endian cut down to the right size (1,2,4, or 8 bytes)
* floating point types should be sha256 of the binary representation as either 4 (float) or
8 (double) bytes
* timestamps and dates should convert like integers
* decimal should convert like 128 bit numbers with the result cut to the matching number of

It isn't clear what we should do in the very small data types:
* boolean
* byte
* short

I'd lean toward either making them null or passing them through unchanged.

This message was sent by Atlassian JIRA

View raw message