tajo-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TAJO-2189) Dictionary encoded text in ORC scanner may cause incorrect result.
Date Fri, 13 Jul 2018 01:40:00 GMT

    [ https://issues.apache.org/jira/browse/TAJO-2189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16542402#comment-16542402
] 

ASF GitHub Bot commented on TAJO-2189:
--------------------------------------

GitHub user jinossy opened a pull request:

    https://github.com/apache/tajo/pull/1055

    Dictionary encoded text in ORC scanner may cause incorrect result.

    See the issue:
    https://issues.apache.org/jira/browse/TAJO-2189

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/jinossy/tajo TAJO-2189

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/tajo/pull/1055.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #1055
    
----
commit 6f1788425a7231adb661745d0e554955f5ef787f
Author: Jinho Kim <jhkim@...>
Date:   2018-07-13T01:36:17Z

    TAJO-2189: Dictionary encoded text in ORC scanner may cause incorrect result.

----


> Dictionary encoded text in ORC scanner may cause incorrect result.
> ------------------------------------------------------------------
>
>                 Key: TAJO-2189
>                 URL: https://issues.apache.org/jira/browse/TAJO-2189
>             Project: Tajo
>          Issue Type: Bug
>          Components: Orc, Storage
>    Affects Versions: 0.12.0, 0.11.2
>            Reporter: Jinho Kim
>            Assignee: Jinho Kim
>            Priority: Critical
>
> See the title. {{org.apache.hadoop.io.Text:getBytes()}} returns internal raw bytes. It
should use with {{getLength()}}
> {code:java}
> /**
>  * Returns the raw bytes; however, only data up to {@link #getLength()} is
>  * valid. Please use {@link #copyBytes()} if you
>  * need the returned array to be precisely the length of the data.
>  */
> @Override
> public byte[] getBytes() {
>   return bytes;
> }
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message