hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Amareshwari Sriramadasu (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-2303) files with control-A,B are not delimited correctly.
Date Thu, 25 Aug 2011 06:24:29 GMT

     [ https://issues.apache.org/jira/browse/HIVE-2303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Amareshwari Sriramadasu updated HIVE-2303:
------------------------------------------

    Status: Patch Available  (was: Open)

> files with control-A,B are not delimited correctly.
> ---------------------------------------------------
>
>                 Key: HIVE-2303
>                 URL: https://issues.apache.org/jira/browse/HIVE-2303
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Amareshwari Sriramadasu
>            Assignee: Amareshwari Sriramadasu
>             Fix For: 0.8.0
>
>         Attachments: patch-2303-2.txt, patch-2303-3.txt, patch-2303-4.txt, patch-2303.txt
>
>
> The following is from one of our users:
>  
> create external table impressions (imp string, msg string)
>   row format delimited
>     fields terminated by '\t'
>     lines terminated by '\n'
>   stored as textfile                 
>   location '/xxx';
>  
> Some strings in my data contains Control-A, Control-B etc as internal delimiters.  If
I do a
>  
> Select * from impressions limit 10;
>  
> All fields were able to print correctly.  However if I do a
>  
> Select * from impressions where msg regexp '.*' limit 10;
>  
> The fields were broken by the control characters.  The difference between the 2 commands
is that the latter requires a map-reduce job.  
>  

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message