drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Khurram Faraaz (JIRA)" <j...@apache.org>
Subject [jira] [Created] (DRILL-3633) FIRST_VALUE , LAST_VALUE functions take too long to complete
Date Wed, 12 Aug 2015 18:51:45 GMT
Khurram Faraaz created DRILL-3633:
-------------------------------------

             Summary: FIRST_VALUE , LAST_VALUE functions take too long to complete
                 Key: DRILL-3633
                 URL: https://issues.apache.org/jira/browse/DRILL-3633
             Project: Apache Drill
          Issue Type: Bug
          Components: Execution - Flow
    Affects Versions: 1.2.0
         Environment: private-branch https://github.com/adeneche/incubator-drill/tree/new-window-funcs
            Reporter: Khurram Faraaz
            Assignee: Chris Westin


Query that uses FIRST_VALUE function takes twelve minutes to complete on developers private
branch.

{code}
select first_value(key1) over(partition by key2 order by key1) firstValue from `twoKeyJsn.json`;

...

26,212,355 rows selected (720.229 seconds)
0: jdbc:drill:schema=dfs.tmp> 
{code}

{code}
select last_value(key1) over(partition by key2 order by key1) firstValue from `twoKeyJsn.json`;

...

+------------------+
26,212,355 rows selected (239.109 seconds)
{code}

number of rows in the JSON file
{code}
0: jdbc:drill:schema=dfs.tmp> select count(*) from `twoKeyJsn.json`;
+-----------+
|  EXPR$0   |
+-----------+
| 26212355  |
+-----------+
1 row selected (13.949 seconds)
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message