carbondata-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vandana Yadav (JIRA)" <j...@apache.org>
Subject [jira] [Created] (CARBONDATA-1845) Incorrect output on presto CLI while executing OR operator with multiple load
Date Mon, 04 Dec 2017 07:10:01 GMT
Vandana Yadav created CARBONDATA-1845:
-----------------------------------------

             Summary: Incorrect output on presto CLI while executing OR operator with multiple
load
                 Key: CARBONDATA-1845
                 URL: https://issues.apache.org/jira/browse/CARBONDATA-1845
             Project: CarbonData
          Issue Type: Bug
          Components: presto-integration
    Affects Versions: 1.3.0
         Environment: spark 2.1
            Reporter: Vandana Yadav
            Priority: Minor
         Attachments: 2000_UniqData.csv, 3000_UniqData.csv, 4000_UniqData.csv, 5000_UniqData.csv,
6000_UniqData.csv, 7000_UniqData;delm.csv

Incorrect output on presto CLI while executing OR operator with multiple load

Steps to reproduce:
On beeline:
1) Create a table:
CREATE TABLE uniqdata_time (CUST_ID int,CUST_NAME String,ACTIVE_EMUI_VERSION string, DOB timestamp,
DOJ timestamp, BIGINT_COLUMN1 bigint,BIGINT_COLUMN2 bigint,DECIMAL_COLUMN1 decimal(30,10),
DECIMAL_COLUMN2 decimal(36,10),Double_COLUMN1 double, Double_COLUMN2 double,INTEGER_COLUMN1
int) STORED BY 'org.apache.carbondata.format' TBLPROPERTIES ("TABLE_BLOCKSIZE"= "256 MB")

2)Load Data with multiple segments in it: 
a) LOAD DATA INPATH 'hdfs://localhost:54310/Data/uniqdata/2000_UniqData.csv' into table uniqdata_time
OPTIONS('DELIMITER'=',' , 'QUOTECHAR'='"','BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')
b) LOAD DATA INPATH 'hdfs://localhost:54310/Data/uniqdata/3000_UniqData.csv' into table uniqdata_time
OPTIONS('DELIMITER'=',' , 'QUOTECHAR'='"','BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')

c) LOAD DATA INPATH 'hdfs://localhost:54310/Data/uniqdata/4000_UniqData.csv' into table uniqdata_time
OPTIONS('DELIMITER'=',' , 'QUOTECHAR'='"','BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')

d) LOAD DATA INPATH 'hdfs://localhost:54310/Data/uniqdata/5000_UniqData.csv' into table uniqdata_time
OPTIONS('DELIMITER'=',' , 'QUOTECHAR'='"','BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')

e) LOAD DATA INPATH 'hdfs://localhost:54310/Data/uniqdata/6000_UniqData.csv' into table uniqdata_time
OPTIONS('DELIMITER'=',' , 'QUOTECHAR'='"','BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')

f) LOAD DATA INPATH 'hdfs://localhost:54310/Data/uniqdata/7000_UniqData.csv' into table uniqdata_time
OPTIONS('DELIMITER'=',' , 'QUOTECHAR'='"','BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')

3) Execute Query:
select * from uniqdata_time where DOB = cast('2015-10-04 01:00:03' as timestamp) or DOB =
cast('2015-10-07 01:00:03' as timestamp)

Output on beeline:
--+-------------------------+-------------------------+----------------------+-----------------------+------------------+--+
| CUST_ID  |    CUST_NAME     |    ACTIVE_EMUI_VERSION     |          DOB           |    
     DOJ           | BIGINT_COLUMN1  | BIGINT_COLUMN2  |     DECIMAL_COLUMN1     |     DECIMAL_COLUMN2
    |    Double_COLUMN1    |    Double_COLUMN2     | INTEGER_COLUMN1  |
+----------+------------------+----------------------------+------------------------+------------------------+-----------------+-----------------+-------------------------+-------------------------+----------------------+-----------------------+------------------+--+
| 25712    | CUST_NAME_16712  | ACTIVE_EMUI_VERSION_16712  | 2015-10-04 01:00:03.0  | 2015-10-04
02:00:03.0  | 123372053566    | -223372020142   | 12345695613.1234000000  | 22345695613.1234000000
 | 1.12345674897976E10  | -1.12345674897976E10  | 16713            |
| 25715    | CUST_NAME_16715  | ACTIVE_EMUI_VERSION_16715  | 2015-10-07 01:00:03.0  | 2015-10-07
02:00:03.0  | 123372053569    | -223372020139   | 12345695616.1234000000  | 22345695616.1234000000
 | 1.12345674897976E10  | -1.12345674897976E10  | 16716 

4) Output On Presto CLI:
a)Expected Output on Presto: Similar to the output on beeline

b) Actual output on Presto:

 cust_id | cust_name | active_emui_version | dob | doj | bigint_column1 | bigint_column2 |
decimal_column1 | decimal_column2 | double_column1 |
---------+-----------+---------------------+-----+-----+----------------+----------------+-----------------+-----------------+----------------+
(0 rows)
(END)





--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message