drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Parth Chandra (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (DRILL-3403) <Unicode 6 digit escape value> handled incorrectly
Date Thu, 02 Jul 2015 18:43:04 GMT

     [ https://issues.apache.org/jira/browse/DRILL-3403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Parth Chandra updated DRILL-3403:
---------------------------------
    Fix Version/s:     (was: 1.2.0)
                   1.4.0

> <Unicode 6 digit escape value> handled incorrectly
> --------------------------------------------------
>
>                 Key: DRILL-3403
>                 URL: https://issues.apache.org/jira/browse/DRILL-3403
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Execution - Data Types
>            Reporter: Daniel Barclay (Drill)
>            Assignee: Daniel Barclay (Drill)
>             Fix For: 1.4.0
>
>
> The {{<Unicode 6 digit escape value>}} syntax (e.g., U&'$+000000' UESCAPE '$')
is not handled correctly.
> In particular, the parser doesn't seem to recognize that the first character after the
escape character is a "{{+}}", it takes the first three hex digits and decodes them into a
character, and then it takes the next three hex-digit characters as plain characters.
> In the following, note how the part with a backslash followed by the {{+000043}} part
yields a NULL (as evidenced by the unaligned trailing vertical bar) and "043" instead of yielding
"C":
> {noformat}
> 0: jdbc:drill:zk=local> SELECT  U&'\0041 2 \+000043'  UESCAPE '\' FROM INFORMATION_SCHEMA.CATALOGS;
> +-----------+
> |  EXPR$0   |
> +-----------+
> | A 2 043  |
> +-----------+
> 1 row selected (0.253 seconds)
> 0: jdbc:drill:zk=local> 
> {noformat}
> (This means that Drill can't accept character string literals containing characters beyond
code point U+00FFFF.)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message