hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Siying Dong (JIRA)" <j...@apache.org>
Subject [jira] Created: (HIVE-1737) Two Bugs for Estimating Row Sizes in GroupByOperator
Date Thu, 21 Oct 2010 00:59:27 GMT
Two Bugs for Estimating Row Sizes in GroupByOperator
----------------------------------------------------

                 Key: HIVE-1737
                 URL: https://issues.apache.org/jira/browse/HIVE-1737
             Project: Hive
          Issue Type: Bug
            Reporter: Siying Dong
            Assignee: Siying Dong


Two bugs:
1. if UDAF uses string type, Group-by will break as it tries to insert an ArrayList<Object>
to a HashMap<KeyWrapper, AggregationBuffer[]>.
2. The code to sample size of keys only handles String type and Text type, while in most cases,
they are org.apache.hadoop.hive.serde2.lazy.LazyString, so that 0 is always used.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message