cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jonathan Ellis (JIRA)" <j...@apache.org>
Subject [jira] Commented: (CASSANDRA-1235) BytesType and batch mutate causes encoded bytes of non-printable characters to be dropped
Date Thu, 08 Jul 2010 15:29:52 GMT

    [ https://issues.apache.org/jira/browse/CASSANDRA-1235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12886346#action_12886346
] 

Jonathan Ellis commented on CASSANDRA-1235:
-------------------------------------------

I believe that

                return new String(buffer, 0, len);

will treat buffer as UTF-16, not UTF-8.  you want

                return new String(buffer, 0, len, "UTF8");

I'm not at all sure that longToPrefixCoded is going to generate valid UTF-8, either.

> BytesType and batch mutate causes encoded bytes of non-printable characters to be dropped
> -----------------------------------------------------------------------------------------
>
>                 Key: CASSANDRA-1235
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1235
>             Project: Cassandra
>          Issue Type: Bug
>    Affects Versions: 0.6
>         Environment: Java 1.6 sun JDK 
> Java(TM) SE Runtime Environment (build 1.6.0_20-b02)
> Java HotSpot(TM) 64-Bit Server VM (build 16.3-b01, 
> Ubuntu 10.04 64 bit
>            Reporter: Todd Nine
>            Priority: Critical
>             Fix For: 0.6.4
>
>         Attachments: TestEncodedKeys.java
>
>
> When running the two tests, individual column insert works with the values generated.
 However, batch insert with the same values causes an encoding failure on the key.  It appears
bytes are dropped from the end of the byte array that represents the key value.  See the attached
unit test

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message