drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Khurram Faraaz (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (DRILL-2608) Union all query fails when json.all_text_mode=false
Date Fri, 27 Mar 2015 20:11:54 GMT

    [ https://issues.apache.org/jira/browse/DRILL-2608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14384522#comment-14384522
] 

Khurram Faraaz commented on DRILL-2608:
---------------------------------------

Here are first five lines from each of the JSON data files, that I have used in the query.

{code}
[root@centos-01 json_Data]# head -n 5 charData.json 
{"key":"itzVxYBb"}
{"key":"HQXshVBh"}
{"key":"pbNvkcsX"}
{"key":"sjHkVcvo"}
{"key":"LMqAMKzp"}
[root@centos-01 json_Data]# head -n 5 vrChrData.json 
{"key":"L18ko0NFyt68DzJRLPSHtlYZOgK8ijmyydsDrflzAAzs5l07yZ62ybMcs  jb87BH9ilEp3zIRkhGCaDXDohioXEAIqRiks7YUHEFWAzNTyU9EAe1jK58SphDwRwjfBb313APmflL3UVa6PczpmxgTgZl3nvwdmjz26YfWQuKDByzhtMPOTszq4iEiuG6H5HsbJ8C1M1"}
{"key":"twPWTXQeoVYD3eagLt4RQOqXN7ywh4eedDsvq1BKd97DQBJT53C2WNf5WYw4EQ6QPRDi6zkR rvFTrdRafhR6Czx2t7KC4cCLrzeogwA2Hzyhzi13ydt6RufPG3NqLGADhnfuvzNf0uhZdLQZp9bmzTFOfRzIdpsCr52TXvMe2r1pfqXDsU3k
VRey4FZ9FG9rKso9CP"}
{"key":"JTq tdzIDR K74AqIOx32Ypflr95W3euftPjFR9q7dVKE6VVSbCGqnnKJn4gbW5wJmq6M3WGlbqDJ5TUlQVsRPJKmWAOpvwX2FkgxLCKvcH0r5SleQQLiLfluXdsSLL2QlX3ioINbMV1qaLpeKSLloPKAY7y1DVykG5olGWvyXDAOPVLDRUnwdHNlo
XzbbU6mnbhgiD"}
{"key":"u 8DLWmYBoI1813EoQszdVie4uSrlRO6bDGhlH2YkMfgXvReS7rrrFj0FhlJvIb euPIbeFxSQFyI9ijYnsFh0t4mcD47K2WXDZdZ
CMBbD9w5ivsozkpmO37bFPSlsIsu1l2P1oRhL eP0pg KKXaRurTfpWHyA5n3hz09i2R1WIomZPHd5a34vVpRGbdmwWvnPQaCB"}
{"key":"EtZT CQCjc9g6oAtU8MnBSabqcIcU0s0X0xbOo8j28 UH0jjrJtYY8qMAnA99iN19QHFNIz8USZEMGuKqrCVPFJGQUcrexnoV1jY0JxNlCR6nlavWnpgFjkDABIe
fPIT1LbrM9eRzBdHSUNX13uXRnreI4VevMOyRbWRxCyJebFKn6oEP9eCii9W6AqFOtVlmZZ6CqC"}
[root@centos-01 json_Data]# head -n 5 dateData.json 
{"key":"2009-03-03"}
{"key":"2001-08-27"}
{"key":"2011-07-26"}
{"key":"1970-09-02"}
{"key":"1983-04-24"}
[root@centos-01 json_Data]# head -n 5 doubleData.json 
{"key":0.676345455975}
{"key":579877771.632}
{"key":0.442048767318}
{"key":165181895.498}
{"key":0.0635879843698}
[root@centos-01 json_Data]# head -n 5 intData.json 
{"key":-291344610}
{"key":10331709374297784}
{"key":-1184860307}
{"key":67862516816374816}
{"key":-1921037123}
[root@centos-01 json_Data]# head -n 5 timeStmpData.json 
{"key":"2015-03-26 19:04:55.489"}
{"key":"2015-03-26 19:04:55.489"}
{"key":"2015-03-26 19:04:55.490"}
{"key":"2015-03-26 19:04:55.490"}
{"key":"2015-03-26 19:04:55.539"}
{code}

> Union all query fails when json.all_text_mode=false
> ---------------------------------------------------
>
>                 Key: DRILL-2608
>                 URL: https://issues.apache.org/jira/browse/DRILL-2608
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Query Planning & Optimization
>    Affects Versions: 0.9.0
>         Environment: | 9d92b8e319f2d46e8659d903d355450e15946533 | DRILL-2580: Exit early
from HashJoinBatch if build side is empty | 26.03.2015 @ 16:13:53 EDT | Unknown     | 26.03.2015
@ 16:53:21 EDT |
>            Reporter: Khurram Faraaz
>            Assignee: Sean Hsuan-Yi Chu
>
> Union all query over JSON data file fails when store.json.all_text_mode is set to false,
and same query returns correct results when store.json.all_text_mode is set to true. Each
JSON data file had only one type of object {"key":<value>}, and the values in each of
the JSON data files were of same datatype. Test was executed on a 4 node cluster.
> {code}
> 0: jdbc:drill:> select key from `charData.json` union all select key from `dateData.json`
union all select key from `doubleData.json` union all select key from `intData.json` union
all select key from `timeStmpData.json` union all select key from `vrChrData.json`;
> Query failed: RemoteRpcException: Failure while running fragment., For input string:
"itzVxYBb" [ f1f81073-161c-4f24-89e5-37379413b01b on centos-04.qa.lab:31010 ]
> [ f1f81073-161c-4f24-89e5-37379413b01b on centos-04.qa.lab:31010 ]
> Error: exception while executing query: Failure while executing query. (state=,code=0)
> {code}
> Then I set alter session set `store.json.all_text_mode`=true;
> After setting son.all_text_mode to true, union all query returned correct results.
> {code}
> 0: jdbc:drill:> select key from `charData.json` union all select key from `dateData.json`
union all select key from `doubleData.json` union all select key from `intData.json` union
all select key from `timeStmpData.json` union all select key from `vrChrData.json`;
> ...
> +------------+
> 7,194 rows selected (0.462 seconds)
> {code}
> Resetting it back to false gives the same Exception
> {code}
> 0: jdbc:drill:> alter session set `store.json.all_text_mode`=false;
> +------------+------------+
> |     ok     |  summary   |
> +------------+------------+
> | true       | store.json.all_text_mode updated. |
> +------------+------------+
> 1 row selected (0.049 seconds)
> 0: jdbc:drill:> select key from `charData.json` union all select key from `dateData.json`
union all select key from `doubleData.json` union all select key from `intData.json` union
all select key from `timeStmpData.json` union all select key from `vrChrData.json`;
> Query failed: RemoteRpcException: Failure while running fragment., For input string:
"itzVxYBb" [ 412eda0e-cc22-43ae-b763-5e40a0326551 on centos-04.qa.lab:31010 ]
> [ 412eda0e-cc22-43ae-b763-5e40a0326551 on centos-04.qa.lab:31010 ]
> Error: exception while executing query: Failure while executing query. (state=,code=0)
> {code}
> Stack trace from drillbit.log
> {code}
> 2015-03-27 18:30:56,620 [2aea5e1e-88b9-3e4e-07b5-d7e46b29756f:frag:0:0] ERROR o.a.drill.exec.work.foreman.Foreman
- Error b9cb90bd-7d89-4061-8595-4c5ad983f3f3: RemoteRpcException: Failure while running fragment.,
For input string: "itzVxYBb" [ 412eda0e-cc22-43ae-b763-5e40a0326551 on centos-04.qa.lab:31010
]
> [ 412eda0e-cc22-43ae-b763-5e40a0326551 on centos-04.qa.lab:31010 ]
> org.apache.drill.exec.rpc.RemoteRpcException: Failure while running fragment., For input
string: "itzVxYBb" [ 412eda0e-cc22-43ae-b763-5e40a0326551 on centos-04.qa.lab:31010 ]
> [ 412eda0e-cc22-43ae-b763-5e40a0326551 on centos-04.qa.lab:31010 ]
>         at org.apache.drill.exec.work.foreman.QueryManager.statusUpdate(QueryManager.java:163)
[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
>         at org.apache.drill.exec.work.foreman.QueryManager$RootStatusReporter.statusChange(QueryManager.java:281)
[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
>         at org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:114)
[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
>         at org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:110)
[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
>         at org.apache.drill.exec.work.fragment.FragmentExecutor.internalFail(FragmentExecutor.java:230)
[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
>         at org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:182)
[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
>         at org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38)
[drill-common-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
>         at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
[na:1.7.0_75]
>         at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
[na:1.7.0_75]
>         at java.lang.Thread.run(Thread.java:745) [na:1.7.0_75]
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message