drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rahul Challapalli (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-1652) flatten causing the sqlline to hang indefinitely when dealing with 0.1 GB data
Date Fri, 07 Nov 2014 01:37:34 GMT
Rahul Challapalli created DRILL-1652:
----------------------------------------

             Summary: flatten causing the sqlline to hang indefinitely when dealing with 0.1
GB data
                 Key: DRILL-1652
                 URL: https://issues.apache.org/jira/browse/DRILL-1652
             Project: Apache Drill
          Issue Type: Bug
          Components: Functions - Drill, Storage - JSON
            Reporter: Rahul Challapalli


git.commit.id.abbrev=60aa446

I ran the below test against the private branch of Jason which has some patches for bugs related
to flatten which are not yet merged into the master.

I used the below json record and copied it over 100000 times

{code}
{
    "uid" : 1,
    "events" : [
        { "evnt_id":"e1", "campaign_id":"c1", "event_name":"e1_name", "event_time":1000000},
        { "evnt_id":"e2", "campaign_id":"c1", "event_name":"e2_name", "event_time":2000000},
        { "evnt_id":"e3", "campaign_id":"c1", "event_name":"e3_name", "event_time":3000000},
        { "evnt_id":"e4", "campaign_id":"c1", "event_name":"e4_name", "event_time":4000000},
        { "evnt_id":"e5", "campaign_id":"c2", "event_name":"e5_name", "event_time":5000000},
        { "evnt_id":"e6", "campaign_id":"c1", "event_name":"e6_name", "event_time":6000000},
        { "evnt_id":"e7", "campaign_id":"c1", "event_name":"e7_name", "event_time":7000000},
        { "evnt_id":"e8", "campaign_id":"c2", "event_name":"e8_name", "event_time":8000000},
        { "evnt_id":"e9", "campaign_id":"c2", "event_name":"e9_name", "event_time":9000000}
    ],
   "transactions" : [
        { "trans_id":"t1", "amount":100, "trans_time":7777777},
        { "trans_id":"t2", "amount":100, "trans_time":8888888}
    ]
}
{code}

The below query which applies a flatten function hangs after returning a few thousand records
{code}
 select uid, flatten(transactions) from `json_kvgenflatten/transactions.json`;
{code}

Select * from the file also fails after displaying a bunch of records
{code}
select * from `json_kvgenflatten/transactions.json`;
{code}

The below query works
{code}
select uid from `json_kvgenflatten/transactions.json`;
{code}

I attached the error logs for the 2 failing queries



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message