carbondata-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From ravipesala <>
Subject [GitHub] incubator-carbondata issue #644: [CARBONDATA-757]Big decimal optimization
Date Fri, 10 Mar 2017 15:58:46 GMT
Github user ravipesala commented on the issue:
    Test results witj 100 million data 
    CREATE TABLE perftesta (c1 string,c2 string,c3 string,c4 string,c5 string,c6 bigint,c7
decimal(7,2),c8 int,c9 decimal(7,2),c10 decimal(15,2)) STORED BY 'carbondata'
    Q1 -> SELECT count(c1),count(c2),count(c3),count(c4),count(c5),count(c6),count(c7),count(c8),count(c9),count(c10)
FROM perftesta99;
    Q2 -> SELECT sum(c7), sum(c8), sum(9), sum(c10) FROM perftesta99 WHERE c2="P2_75" and
    Q3 -> SELECT c2, c5, count(distinct c1), sum(c7) FROM perftesta99 WHERE c4="P4_4" and
c5="P5_7" GROUP BY c2, c5;
    **Master Code**
    Load time  -> 576 seconds
    Data size after load  -> 1800MB
    Query(first_reading, second_reading)
    Q1(25.27, 21.794)
    Q2(27.296, 28.21)
    Q3(7.383, 5.103)
    **This PR Code**
    Load time -> 431 seconds
    Data size after load  -> 1720MB
    Query(first_reading, second_reading)
    Q2(24.102, 23.322)

If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at or file a JIRA ticket
with INFRA.

View raw message