hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jose Rozanec <jose.roza...@mercadolibre.com>
Subject Query fails if condition placed on Parquet struct field
Date Tue, 03 May 2016 19:45:33 GMT
Hello,

We are running queries on Hive against parquet files.
In the schema definition, we have a parquet struct called device with a
string field user_agent.

If we run query from Example 1, it returns results as expected.
If we run query from Example 2, execution fails and exits with error.

Did anyone face a similar case?

Thanks!

*Example 1:*
SELECT *device.user_agent* FROM sometable WHERE ds >= '2016-03-30 00' AND
ds <= '2016-03-30 01' LIMIT 1;

*Example 2:*
SELECT *device.user_agent* FROM sometable WHERE ds >= '2016-03-30 00' AND
ds <= '2016-03-30 01' AND *device.user_agent* LIKE 'Mozilla%'  LIMIT 1;


The error and trace we get is:

Exception from container-launch.
FAILED: Execution Error, return code 2 from
org.apache.hadoop.hive.ql.exec.mr.MapRedTask
Container exited with a non-zero exit code 1

*Stack trace: ExitCodeException exitCode=1:*
* at org.apache.hadoop.util.Shell.runCommand(Shell.java:545)*
* at org.apache.hadoop.util.Shell.run(Shell.java:456)*
* at
org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:722)*
* at
org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:212)*
* at
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:302)*
* at
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:82)*
* at java.util.concurrent.FutureTask.run(FutureTask.java:262)*
* at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)*
* at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)*
* at java.lang.Thread.run(Thread.java:745)*

Mime
View raw message