drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Matt Welsh (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (DRILL-4149) Escape Character Not Used for TSVs
Date Tue, 01 Dec 2015 21:56:10 GMT

     [ https://issues.apache.org/jira/browse/DRILL-4149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Matt Welsh updated DRILL-4149:
------------------------------
    Description: 
Escape Character does not escape tabs in TSVs

For instance query:

select * from df.`bug.tsz`;

With Storage Format configured as:
    "tsv": {
      "type": "text",
      "extensions": [
        "tsv"
      ],
      "escape": "\\",
      "delimiter": "\t"
    },
bug.tsv file:
testval	1	2	3	sometext
testval	4	5	6	some text with a tab between here\	here

This returns 5 columns for first and 6 for second.  Should be 5 for both.

  was:
Escape Character does not escape tabs in TSVs

For instance query:

With Storage Format configured as:
    "tsv": {
      "type": "text",
      "extensions": [
        "tsv"
      ],
      "escape": "\\",
      "delimiter": "\t"
    },
File:
testval	1	2	3	sometext
testval	4	5	6	some text with a tab between here\	here

This returns 5 columns for first and 6 for second.  Should be 5 for both.


> Escape Character Not Used for TSVs
> ----------------------------------
>
>                 Key: DRILL-4149
>                 URL: https://issues.apache.org/jira/browse/DRILL-4149
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Execution - Data Types
>    Affects Versions: 1.3.0
>            Reporter: Matt Welsh
>            Priority: Minor
>
> Escape Character does not escape tabs in TSVs
> For instance query:
> select * from df.`bug.tsz`;
> With Storage Format configured as:
>     "tsv": {
>       "type": "text",
>       "extensions": [
>         "tsv"
>       ],
>       "escape": "\\",
>       "delimiter": "\t"
>     },
> bug.tsv file:
> testval	1	2	3	sometext
> testval	4	5	6	some text with a tab between here\	here
> This returns 5 columns for first and 6 for second.  Should be 5 for both.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message