hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mostafa Mokhtar (JIRA)" <>
Subject [jira] [Created] (HIVE-10484) Vectorization : Big Table Retained Mapping duplicate column
Date Fri, 24 Apr 2015 22:29:38 GMT
Mostafa Mokhtar created HIVE-10484:

             Summary: Vectorization : Big Table Retained Mapping duplicate column
                 Key: HIVE-10484
             Project: Hive
          Issue Type: Bug
          Components: Tez, Vectorization
    Affects Versions: 1.2.0
            Reporter: Mostafa Mokhtar
            Assignee: Matt McCline
             Fix For: 1.2.0

With vectorization and tez enabled TPC-DS Q70 fails with 
Caused by: java.lang.RuntimeException: Big Table Retained Mapping duplicate column 6 in ordered
column map {6=(value column: 6, type name: int), 21=(value column: 21, type name: float),
22=(value column: 22, type name: int)} when adding value column 6, type int
	at org.apache.hadoop.hive.ql.exec.vector.VectorColumnOrderedMap.add(
	at org.apache.hadoop.hive.ql.exec.vector.VectorColumnOutputMapping.add(
	at org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinCommonOperator.determineCommonInfo(
	at org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinCommonOperator.<init>(
	at org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinGenerateResultOperator.<init>(
	at org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinInnerGenerateResultOperator.<init>(
	at org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinInnerLongOperator.<init>(
	... 49 more

 select s_state
               from  (select s_state as s_state, sum(ss_net_profit),
                             rank() over ( partition by s_state order by sum(ss_net_profit)
desc) as ranking
                      from   store_sales, store, date_dim
                      where  d_month_seq between 1193 and 1193+11
                            and date_dim.d_date_sk = store_sales.ss_sold_date_sk
                            and store.s_store_sk  = store_sales.ss_store_sk
                      group by s_state
                     ) tmp1
               where ranking <= 5

This message was sent by Atlassian JIRA

View raw message