carbondata-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vandana Yadav (JIRA)" <j...@apache.org>
Subject [jira] [Created] (CARBONDATA-979) Incorrect result displays to user in presto integration as compare to CarbonData.
Date Mon, 24 Apr 2017 12:49:04 GMT
Vandana Yadav created CARBONDATA-979:
----------------------------------------

             Summary: Incorrect result displays to user in presto integration as compare to
CarbonData.
                 Key: CARBONDATA-979
                 URL: https://issues.apache.org/jira/browse/CARBONDATA-979
             Project: CarbonData
          Issue Type: Bug
          Components: data-query, presto-integration
    Affects Versions: 1.1.0
         Environment: Spark 2.1,Presto 0.66
            Reporter: Vandana Yadav
            Priority: Minor
         Attachments: 2000_UniqData.csv

Incorrect result displays to user in presto integration as compare to CarbonData (As in Carbondata
our result set include null values but in presto it exclude those).

Steps to reproduce :

1. In CarbonData:

a) Create table:
CREATE TABLE uniqdata (CUST_ID int,CUST_NAME String,ACTIVE_EMUI_VERSION string, DOB timestamp,
DOJ timestamp, BIGINT_COLUMN1 bigint,BIGINT_COLUMN2 bigint,DECIMAL_COLUMN1 decimal(30,10),
DECIMAL_COLUMN2 decimal(36,10),Double_COLUMN1 double, Double_COLUMN2 double,INTEGER_COLUMN1
int) STORED BY 'org.apache.carbondata.format' TBLPROPERTIES ("TABLE_BLOCKSIZE"= "256 MB");


b) Load data : 
LOAD DATA INPATH 'hdfs://localhost:54310/2000_UniqData.csv' into table uniqdata OPTIONS('DELIMITER'=','
, 'QUOTECHAR'='"','BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID,CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1');


2. In presto 
a) Execute the query: 
select CUST_NAME from uniqdata where CUST_NAME !='CUST_NAME_01844' order by CUST_NAME
 expected result : it should display all cust_name except "cust_name_01844"

Actual result: 
In CarbonData:
"| CUST_NAME_01995  |
| CUST_NAME_01996  |
| CUST_NAME_01997  |
| CUST_NAME_01998  |
| CUST_NAME_01999  |
+------------------+--+
2,012 rows selected (1.777 seconds)
"
In presto:
"CUST_NAME_01997 
 CUST_NAME_01998 
 CUST_NAME_01999 
(2000 rows)

Query 20170418_105903_00012_disp5, FINISHED, 1 node
Splits: 18 total, 18 done (100.00%)
3:21 [2.01K rows, 1.97KB] [10 rows/s, 10B/s]
"





--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message