impala-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dejan Prokić <dej...@nordeus.com>
Subject Bug with complex types
Date Tue, 27 Dec 2016 09:58:37 GMT
Hi,

I noticed a bug when selecting data from a table with complex type. This is
definition of my table:

CREATE TABLE event (
  user_id BIGINT,
  event_id INT,
  event_type_id INT,
  ts_bigint BIGINT,
  processed_ts_bigint BIGINT,
  event_map MAP<STRING, STRING>,
  server_id INT
)
PARTITIONED BY (date_id INT)
STORED AS PARQUET;

I want to run this query:

WITH
battles AS (
SELECT
  *
FROM event e,
  (SELECT
    MAX(CASE WHEN key='won' THEN value END) AS won
  FROM e.event_map) m
WHERE e.date_id = 2420 AND e.event_id = 10012
)
SELECT won, count(*) FROM battles
GROUP BY won;

Map element with key = 'won' has only 'true' and 'false' values. If battles
subquery has more than 5 million rows the query returns strange results,
and when it has less rows query returns correct data.

Are you aware of this issue? I didn't send you my data since it is pretty
large for mail (20 GB).

Thanks

*Dejan Prokić* | Data Engineer | Nordeus

Mime
View raw message