arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Julien Le Dem (JIRA)" <>
Subject [jira] [Created] (ARROW-255) Finalize Dictionary representation
Date Wed, 10 Aug 2016 22:01:20 GMT
Julien Le Dem created ARROW-255:

             Summary: Finalize Dictionary representation
                 Key: ARROW-255
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Format
            Reporter: Julien Le Dem

format/Messages.fbs mentions DictionaryBatches with an id but does not specify where they
are referenced.

We should add a {{dictionary: long}} in Field that references the dictionary id:


Dictionary id:

We need a spec in format/ that describes the dictionary layout.
When dictionary encoded the value vector is an array of unsigned int32.
The dictionary vector is a Vector of the type of the value. indexed by their id in the dictionary.

This message was sent by Atlassian JIRA

View raw message