Return-Path: X-Original-To: apmail-drill-issues-archive@minotaur.apache.org Delivered-To: apmail-drill-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id CF26B1853F for ; Thu, 30 Apr 2015 21:05:06 +0000 (UTC) Received: (qmail 38728 invoked by uid 500); 30 Apr 2015 21:05:06 -0000 Delivered-To: apmail-drill-issues-archive@drill.apache.org Received: (qmail 38702 invoked by uid 500); 30 Apr 2015 21:05:06 -0000 Mailing-List: contact issues-help@drill.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@drill.apache.org Delivered-To: mailing list issues@drill.apache.org Received: (qmail 38692 invoked by uid 99); 30 Apr 2015 21:05:06 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 Apr 2015 21:05:06 +0000 Date: Thu, 30 Apr 2015 21:05:06 +0000 (UTC) From: "Daniel Barclay (Drill) (JIRA)" To: issues@drill.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (DRILL-2141) Data type error in group by and order by for JSON MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/DRILL-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14522268#comment-14522268 ] Daniel Barclay (Drill) commented on DRILL-2141: ----------------------------------------------- This doesn't seem reproducible without further information. What exactly did "./nfl" in the query refer to when the query was run? That is, what exactly is the relationship of attached file FlumeData.1422748800086 to that reference? ( Having a copy of the attached JSON file at /tmp/nfs/FlumeData.1422748800086.json (with nothing else in nfl/) and using "from `dfs.tmp`.`nfl`" in the query did not yield an error. Having a copy of that file at /tmp/nfs/FlumeData.1422748800086 (with nothing else in nfl/) and using "from `dfs.tmp`.`nfl`" in the query expectedly yields a "table not found" error. Having a copy of that file at /tmp/nfl.json and using "from `dfs.tmp`.`nfl.json`" in the query did not yield an error. ) > Data type error in group by and order by for JSON > ------------------------------------------------- > > Key: DRILL-2141 > URL: https://issues.apache.org/jira/browse/DRILL-2141 > Project: Apache Drill > Issue Type: Bug > Components: Execution - Data Types > Affects Versions: 0.7.0 > Reporter: Andries Engelbrecht > Assignee: Daniel Barclay (Drill) > Fix For: 1.0.0 > > Attachments: FlumeData.1422748800086, drillbit.log, new_drillbit.log > > > When doing group by and oder by on complex nested JSON getting Data type errors. > Query: > select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) as rt_count from `./nfl` t where t.retweeted_status.`user`.name is not null group by t.retweeted_status.`user`.name order by count(t.retweeted_status.id) desc limit 10; > Screen output: > Query failed: Query failed: Failure while running fragment., Failure while reading vector. Expected vector class of org.apache.drill.exec.vector.NullableIntVector but was holding vector class org.apache.drill.exec.vector.NullableVarCharVector. [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ] > [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ] > java.lang.RuntimeException: java.sql.SQLException: Failure while executing query. > at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514) > at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148) > at sqlline.SqlLine.print(SqlLine.java:1809) > at sqlline.SqlLine$Commands.execute(SqlLine.java:3766) > at sqlline.SqlLine$Commands.sql(SqlLine.java:3663) > at sqlline.SqlLine.dispatch(SqlLine.java:889) > at sqlline.SqlLine.begin(SqlLine.java:763) > at sqlline.SqlLine.start(SqlLine.java:498) > at sqlline.SqlLine.main(SqlLine.java:460) > Drill log attached -- This message was sent by Atlassian JIRA (v6.3.4#6332)