carbondata-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Liang Chen (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CARBONDATA-1920) Sparksql query result is not same as presto on same sql
Date Fri, 22 Dec 2017 02:44:00 GMT

    [ https://issues.apache.org/jira/browse/CARBONDATA-1920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16300897#comment-16300897
] 

Liang Chen commented on CARBONDATA-1920:
----------------------------------------

Have you used carbondata 1.3.0-master code and spark 2.1 to test ?
Same issues ?

Regards
Liang

> Sparksql query result is not same as presto on same sql 
> --------------------------------------------------------
>
>                 Key: CARBONDATA-1920
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-1920
>             Project: CarbonData
>          Issue Type: Bug
>          Components: presto-integration
>    Affects Versions: 1.3.0
>         Environment: spark2.1,presto 0.187
>            Reporter: anubhav tarar
>            Assignee: anubhav tarar
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> i  use carbondata version is 1.2.0 and spark version is 1.6.0.
>      in my test case
>    1.Creating a Table
>   cc.sql("create table IF NOT EXISTS  test.table5(id string,name
> String,city String,age int) stored by 'carbondata' *
> tblproperties('DICTIONARY_INCLUDE' = 'age')* ")
>   2.load csv data into table,data like this:
>     id,name,city,age
>     1,david,shenzhen,31
>     88,eason,shenzhen,27
>     3,jarry,wuhan,35
>    3.select from sparksql,result is :
>   +-----+--------+-----------+------+--+
> | id  |  name  |   city    | age  |
> +-----+--------+-----------+------+--+
> | 1   | david  | shenzhen  | 31   |
> | 3   | jarry  | wuhan     | 35   |
> | 88  | eason  | shenzhen  | 27   |
> +-----+--------+-----------+------+--+
>    this result is correct
>  4.select from presto,result is:
>  id | name  |   city   | age
> ----+-------+----------+-----
>  1  | david | shenzhen |   3
>  3  | jarry | wuhan    |   4
>  88 | eason | shenzhen |   2
> (3 rows)
>   look at the age filed,is wrong
> I know why this happens because I used dictionary encoding in the age field。



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message