drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jinfeng Ni (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (DRILL-5769) IndexOutOfBoundsException when querying JSON files
Date Thu, 07 Sep 2017 23:35:02 GMT

    [ https://issues.apache.org/jira/browse/DRILL-5769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16157860#comment-16157860
] 

Jinfeng Ni commented on DRILL-5769:
-----------------------------------

I can reproduce this problem with the following options (the options related to parquet is
not needed, since the query is for JSON)

{code}
drill.exec.functions.cast_empty_string_to_null
store.json.all_text_mode

select t.id
. . . . . . . . . . . > from dfs.`/drill/testdata/drill5769/???.json` t
. . . . . . . . . . . > where t.assetData.debt.couponPaymentFeature.interestBasis = '5';
Error: SYSTEM ERROR: IndexOutOfBoundsException: index: 1024, length: 1 (expected: range(0,
1024))
{code}

> IndexOutOfBoundsException when querying JSON files
> --------------------------------------------------
>
>                 Key: DRILL-5769
>                 URL: https://issues.apache.org/jira/browse/DRILL-5769
>             Project: Apache Drill
>          Issue Type: Bug
>          Components:  Server, Storage - JSON
>    Affects Versions: 1.10.0
>         Environment: *jdk_8u45_x64*
> *single drillbit running on zookeeper*
> *Following options set to TRUE:*
> drill.exec.functions.cast_empty_string_to_null
> store.json.all_text_mode
> store.parquet.enable_dictionary_encoding
> store.parquet.use_new_reader
>            Reporter: David Lee
>            Assignee: Jinfeng Ni
>             Fix For: 1.10.0, 1.11.0, 1.12.0
>
>         Attachments: 001.json, 100.json, 111.json
>
>
> *Running the following SQL on these three JSON files fail: *
> 001.json 100.json 111.json
> select t.id
> from dfs.`/tmp/???.json` t
> where t.assetData.debt.couponPaymentFeature.interestBasis = '5'
> *Error:*
> org.apache.drill.common.exceptions.UserRemoteException: SYSTEM ERROR: IndexOutOfBoundsException:
index: 1024, length: 1 (expected: range(0, 1024)) Fragment 0:0 [Error Id: xxxx.xxxx...
> *However running the same SQL on two out of three files works:*
> select t.id
> from dfs.`/tmp/1??.json` t
> where t.assetData.debt.couponPaymentFeature.interestBasis = '5'
> select t.id
> from dfs.`/tmp/?1?.json` t
> where t.assetData.debt.couponPaymentFeature.interestBasis = '5'
> select t.id
> from dfs.`/tmp/??1.json` t
> where t.assetData.debt.couponPaymentFeature.interestBasis = '5'
> *Changing the selected column from t.id to t.* also works: *
> select *
> from dfs.`/tmp/???.json` t
> where t.assetData.debt.couponPaymentFeature.interestBasis = '5'



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message