hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael Chirico (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-5999) Allow other characters for LINES TERMINATED BY
Date Sun, 24 Mar 2019 03:37:00 GMT

    [ https://issues.apache.org/jira/browse/HIVE-5999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16799897#comment-16799897
] 

Michael Chirico commented on HIVE-5999:
---------------------------------------

[~nemon] what ever happened to the patch here? This would still be an incredibly useful feature
as embedded newlines are of course ubiquitous for day-to-day data tasks. Any free text field
is bound to have embedded newlines, given enough observations...

> Allow other characters for LINES TERMINATED BY 
> -----------------------------------------------
>
>                 Key: HIVE-5999
>                 URL: https://issues.apache.org/jira/browse/HIVE-5999
>             Project: Hive
>          Issue Type: Improvement
>          Components: Beeline, Database/Schema, Hive
>    Affects Versions: 0.12.0
>            Reporter: Mariano Dominguez
>            Assignee: Nemon Lou
>            Priority: Critical
>              Labels: Delimiter, Hive, Row, SerDe
>         Attachments: HIVE-5999.1.patch, HIVE-5999.patch
>
>
> LINES TERMINATED BY only supports newline '\n' right now.
> It would be nice to loosen this constraint and allow other characters.
> This limitation seems to be hardcoded here:
> https://github.com/apache/hive/blob/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java#L171
> The DDL Definition on the Hive Language manual shows this as a configurable property
whereas it is not. This may lead to mileading assement of being able to choose a choice of
field delimiter.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message