Return-Path: X-Original-To: apmail-drill-issues-archive@minotaur.apache.org Delivered-To: apmail-drill-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id F09F1177F6 for ; Wed, 13 May 2015 20:21:05 +0000 (UTC) Received: (qmail 74656 invoked by uid 500); 13 May 2015 20:21:05 -0000 Delivered-To: apmail-drill-issues-archive@drill.apache.org Received: (qmail 74633 invoked by uid 500); 13 May 2015 20:21:05 -0000 Mailing-List: contact issues-help@drill.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@drill.apache.org Delivered-To: mailing list issues@drill.apache.org Received: (qmail 74620 invoked by uid 99); 13 May 2015 20:21:05 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 13 May 2015 20:21:05 +0000 Date: Wed, 13 May 2015 20:21:05 +0000 (UTC) From: "Deneche A. Hakim (JIRA)" To: issues@drill.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (DRILL-2281) Drill never returns when we use aggregate functions after a join with an order by MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/DRILL-2281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Deneche A. Hakim updated DRILL-2281: ------------------------------------ Fix Version/s: (was: 1.0.0) 1.1.0 > Drill never returns when we use aggregate functions after a join with an order by > --------------------------------------------------------------------------------- > > Key: DRILL-2281 > URL: https://issues.apache.org/jira/browse/DRILL-2281 > Project: Apache Drill > Issue Type: Bug > Components: Execution - Relational Operators > Reporter: Rahul Challapalli > Assignee: Deneche A. Hakim > Fix For: 1.1.0 > > Attachments: data.json > > > git.commit.id.abbrev=6676f2d > The below query never returns : (Order by seems to be the culprit) > {code} > create view v1 as select uid, flatten(events) event from `data.json`; > create view v2 as select uid, flatten(transactions) transaction from `data.json`; > select v1.uid, MAX(v2.transaction.amount), MIN(v1.event.event_time) from v1 inner join v2 on v1.uid = v2.uid where v2.transaction.trans_time < 0 group by v1.uid order by v1.uid; > {code} > There seems to be constant activity in the drillbit.log file. The below message is continuously displayed in the log file > {code} > 2015-02-20 23:35:04,450 [2b183b65-4551-bb9a-35ca-b71b9eedc4d6:frag:1:2] INFO o.a.d.exec.vector.BaseValueVector - Realloc vector null. [32768] -> [65536] > 2015-02-20 23:35:04,451 [2b183b65-4551-bb9a-35ca-b71b9eedc4d6:frag:1:2] INFO o.a.d.exec.vector.BaseValueVector - Realloc vector null. [32768] -> [65536] > 2015-02-20 23:35:04,451 [2b183b65-4551-bb9a-35ca-b71b9eedc4d6:frag:1:2] INFO o.a.d.exec.vector.BaseValueVector - Realloc vector null. [32768] -> [65536] > {code} > Drill returns correct data when we remove one of the agg functions or use multiple aggs from the same side of the join. The below queries work : > {code} > select v1.uid, MAX(v2.transaction.amount) from v1 inner join v2 on v1.uid = v2.uid where v2.transaction.trans_time < 0 group by v1.uid order by v1.uid; > select v1.uid, MAX(v2.transaction.amount), MAX(v2.transaction.amount) from v1 inner join v2 on v1.uid = v2.uid where v2.transaction.trans_time < 0 group by v1.uid order by v1.uid; > {code} > Attached the dataset which contains 2 records. I copied over the same 2 records 50000 times and ran the queries on the data set. Let me know if you need anything else. -- This message was sent by Atlassian JIRA (v6.3.4#6332)