hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aihua Xu (JIRA)" <>
Subject [jira] [Created] (HIVE-11785) Carriage return and new line are processed differently when hive.fetch.task.conversion is set to none
Date Thu, 10 Sep 2015 13:38:45 GMT
Aihua Xu created HIVE-11785:

             Summary: Carriage return and new line are processed differently when hive.fetch.task.conversion
is set to none
                 Key: HIVE-11785
             Project: Hive
          Issue Type: Bug
          Components: Query Processor
    Affects Versions: 2.0.0
            Reporter: Aihua Xu
            Assignee: Aihua Xu

Create the table and perform the queries as follows. You will see different results when the
setting changes. Seems both present incorrect results.

hive> create table repo (lvalue int, charstring string) stored as parquet;
Time taken: 0.34 seconds
hive> load data inpath '/tmp/repo/test.parquet' overwrite into table repo;
Loading data to table default.repo
chgrp: changing ownership of 'hdfs://nameservice1/user/hive/warehouse/repo/test.parquet':
User does not belong to hive
Table default.repo stats: [numFiles=1, numRows=0, totalSize=610, rawDataSize=0]
Time taken: 0.732 seconds
hive> set hive.fetch.task.conversion=more;
hive> select * from repo;
1	newline
here	carriage return
3	both
Time taken: 0.253 seconds, Fetched: 3 row(s)
hive> set hive.fetch.task.conversion=none;
hive> select * from repo;
Query ID = root_20150909113535_e081db8b-ccd9-4c44-aad9-d990ffb8edf3
Total jobs = 1
Launching Job 1 out of 1
Number of reduce tasks is set to 0 since there's no reduce operator
Starting Job = job_1441752031022_0006, Tracking URL =
Kill Command = /opt/cloudera/parcels/CDH-5.4.5-1.cdh5.4.5.p0.7/lib/hadoop/bin/hadoop job 
-kill job_1441752031022_0006
Hadoop job information for Stage-1: number of mappers: 1; number of reducers: 0
2015-09-09 11:35:54,127 Stage-1 map = 0%,  reduce = 0%
2015-09-09 11:36:04,664 Stage-1 map = 100%,  reduce = 0%, Cumulative CPU 2.98 sec
MapReduce Total cumulative CPU time: 2 seconds 980 msec
Ended Job = job_1441752031022_0006
MapReduce Jobs Launched:
Stage-Stage-1: Map: 1   Cumulative CPU: 2.98 sec   HDFS Read: 4251 HDFS Write: 51 SUCCESS
Total MapReduce CPU Time Spent: 2 seconds 980 msec
1	newline
2	carriage return
3	both
Time taken: 25.131 seconds, Fetched: 6 row(s)

This message was sent by Atlassian JIRA

View raw message