arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Emilio Lahr-Vivaz (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ARROW-542) [Java] Implement dictionaries in stream/file encoding
Date Wed, 08 Feb 2017 17:40:42 GMT

    [ https://issues.apache.org/jira/browse/ARROW-542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15858295#comment-15858295
] 

Emilio Lahr-Vivaz commented on ARROW-542:
-----------------------------------------

[~wesmckinn] I'm looking into how dictionary vectors will be encoded in the file format. In
the current message definitions, it appears dictionary batches are distinct from regular batches,
and have an ID associated with them: https://github.com/apache/arrow/blob/b99d049c3d1894908b7e52774eb657675dc1f439/format/Message.fbs#L284
Wouldn't the dictionary already be defined by the Field? I'm unclear what the ID in the DictionaryBatch
is supposed to represent.
Thanks,

> [Java] Implement dictionaries in stream/file encoding
> -----------------------------------------------------
>
>                 Key: ARROW-542
>                 URL: https://issues.apache.org/jira/browse/ARROW-542
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Java - Vectors
>            Reporter: Emilio Lahr-Vivaz
>            Assignee: Emilio Lahr-Vivaz
>




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message