phoenix-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Csaba Skrabak (JIRA)" <j...@apache.org>
Subject [jira] [Created] (PHOENIX-4568) Duplicate entries in the GroupBy structure when running AggregateIT.testTrimDistinct
Date Tue, 30 Jan 2018 16:18:00 GMT
Csaba Skrabak created PHOENIX-4568:
--------------------------------------

             Summary: Duplicate entries in the GroupBy structure when running AggregateIT.testTrimDistinct
                 Key: PHOENIX-4568
                 URL: https://issues.apache.org/jira/browse/PHOENIX-4568
             Project: Phoenix
          Issue Type: Bug
    Affects Versions: 4.12.0
         Environment: minicluster
            Reporter: Csaba Skrabak
            Assignee: Csaba Skrabak
             Fix For: 4.14.0


>From sme-hbase hipchat room:
Pulkit Bhardwaj·10:31

i'm seeing a weird issue with phoenix, appreciate some thoughts

Created a simple table in phoenix
{noformat}
0: jdbc:phoenix:> create table test_select(nam VARCHAR(20), address VARCHAR(20), id BIGINT
. . . . . . . . > constraint my_pk primary key (id));

0: jdbc:phoenix:> upsert into test_select (nam, address,id) values('pulkit','badaun',1);

0: jdbc:phoenix:> select * from test_select;
+---------+----------+-----+
|   NAM   | ADDRESS  | ID  |
+---------+----------+-----+
| pulkit  | badaun   | 1   |
+---------+----------+-----+


0: jdbc:phoenix:> select distinct 'harshit' as "test_column", nam from test_select;
+--------------+---------+
| test_column  |   NAM   |
+--------------+---------+
| harshit      | pulkit  |
+--------------+---------+


0: jdbc:phoenix:> select distinct 'harshit' as "test_column", trim(nam), trim(nam) from
test_select;
+--------------+----------------+----------------+
| test_column  |   TRIM(NAM)    |   TRIM(NAM)    |
+--------------+----------------+----------------+
| harshit      | pulkitpulkit  | pulkitpulkit  |
+--------------+----------------+----------------+
{noformat}

When I apply a trim on the nam column and use it multiple times, the output has the cell data
duplicated!
{noformat}
0: jdbc:phoenix:> select distinct 'harshit' as "test_column", trim(nam), trim(nam), trim(nam)
from test_select;
+--------------+-----------------------+-----------------------+-----------------------+
| test_column  |       TRIM(NAM)       |       TRIM(NAM)      
|       TRIM(NAM)       |
+--------------+-----------------------+-----------------------+-----------------------+
| harshit      | pulkitpulkitpulkit  | pulkitpulkitpulkit  | pulkitpulkitpulkit  |
+--------------+-----------------------+-----------------------+-----------------------+
{noformat}

Wondering if someone has seen this before??

One thing to note is, if I remove the —— distinct 'harshit' as "test_column" ——  The
issue is not seen
{noformat}
0: jdbc:phoenix:> select trim(nam), trim(nam), trim(nam) from test_select;
+------------+------------+------------+
| TRIM(NAM)  | TRIM(NAM)  | TRIM(NAM)  |
+------------+------------+------------+
| pulkit     | pulkit     | pulkit     |
+------------+------------+------------+
{noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message