drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daniel Barclay (Drill) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (DRILL-2141) Data type error in group by and order by for JSON
Date Thu, 30 Apr 2015 21:05:06 GMT

    [ https://issues.apache.org/jira/browse/DRILL-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14522268#comment-14522268

Daniel Barclay (Drill) commented on DRILL-2141:

This doesn't seem reproducible without further information.

What exactly did "./nfl" in the query refer to when the query was run?  That is, what exactly
is the relationship of attached file FlumeData.1422748800086 to that reference? 

Having a copy of the attached JSON file at /tmp/nfs/FlumeData.1422748800086.json (with nothing
else in nfl/) and using "from `dfs.tmp`.`nfl`" in the query did not yield an error.

Having a copy of that file at /tmp/nfs/FlumeData.1422748800086 (with nothing else in nfl/)
and using "from `dfs.tmp`.`nfl`" in the query expectedly yields a "table not found" error.

Having a copy of that file at /tmp/nfl.json and using "from `dfs.tmp`.`nfl.json`" in the query
did not yield an error.

> Data type error in group by and order by for JSON
> -------------------------------------------------
>                 Key: DRILL-2141
>                 URL: https://issues.apache.org/jira/browse/DRILL-2141
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Execution - Data Types
>    Affects Versions: 0.7.0
>            Reporter: Andries Engelbrecht
>            Assignee: Daniel Barclay (Drill)
>             Fix For: 1.0.0
>         Attachments: FlumeData.1422748800086, drillbit.log, new_drillbit.log
> When doing group by and oder by on complex nested JSON getting Data type errors.
> Query:
> select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) as rt_count
from `./nfl` t where t.retweeted_status.`user`.name is not null group by t.retweeted_status.`user`.name
order by count(t.retweeted_status.id) desc limit 10;
> Screen output:
> Query failed: Query failed: Failure while running fragment., Failure while reading vector.
 Expected vector class of org.apache.drill.exec.vector.NullableIntVector but was holding vector
class org.apache.drill.exec.vector.NullableVarCharVector. [ c6ea670f-5fa0-491c-acfb-5ccd128ec324
on drilldemo:31010 ]
> [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ]
> java.lang.RuntimeException: java.sql.SQLException: Failure while executing query.
> 	at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
> 	at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
> 	at sqlline.SqlLine.print(SqlLine.java:1809)
> 	at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
> 	at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
> 	at sqlline.SqlLine.dispatch(SqlLine.java:889)
> 	at sqlline.SqlLine.begin(SqlLine.java:763)
> 	at sqlline.SqlLine.start(SqlLine.java:498)
> 	at sqlline.SqlLine.main(SqlLine.java:460)
> Drill log attached

This message was sent by Atlassian JIRA

View raw message