orc-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gang Wu (JIRA)" <j...@apache.org>
Subject [jira] [Created] (ORC-412) [C++] ORC: Char(n) and Varchar(n) writers truncate to n bytes & corrupts multi-byte data
Date Tue, 09 Oct 2018 07:05:00 GMT
Gang Wu created ORC-412:
---------------------------

             Summary: [C++] ORC: Char(n) and Varchar(n) writers truncate to n bytes &
corrupts multi-byte data
                 Key: ORC-412
                 URL: https://issues.apache.org/jira/browse/ORC-412
             Project: ORC
          Issue Type: Bug
    Affects Versions: 1.5.2
            Reporter: Gang Wu
            Assignee: Gopal V
             Fix For: 1.6.0, 1.5.3


https://github.com/apache/orc/blob/master/java/core/src/java/org/apache/orc/impl/writer/CharTreeWriter.java#L41

{code}
    itemLength = schema.getMaxLength();
    padding = new byte[itemLength];
  }
{code}

https://github.com/apache/orc/blob/master/java/core/src/java/org/apache/orc/impl/writer/VarcharTreeWriter.java#L48

{code}
      if (vector.noNulls || !vector.isNull[0]) {
        int itemLength = Math.min(vec.length[0], maxLength);
{code}





--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message