hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ashish Thusoo (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-256) map side aggregation : number of output rows is same as number of input rows
Date Thu, 29 Jan 2009 04:43:59 GMT

    [ https://issues.apache.org/jira/browse/HIVE-256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12668339#action_12668339
] 

Ashish Thusoo commented on HIVE-256:
------------------------------------

>From what Namit told me, there were 3 bugs:

1. aggrPositions was not being initialized so we were not tracking the variable length fields
properly.
2. Instead of getFields he had to use getDeclaredFields as the former only gives public fields
where as most of our fields are private.
3. The numEntries stuff which would not let the code kick in if there were less thatn NUMROWESTIMATESIZE
of rows...

We had used 256 as a heuristic for unknown types...

> map side aggregation : number of output rows is same as number of input rows
> ----------------------------------------------------------------------------
>
>                 Key: HIVE-256
>                 URL: https://issues.apache.org/jira/browse/HIVE-256
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>             Fix For: 0.2.0
>
>         Attachments: patch-256.1.txt
>
>
> map side aggregation : number of output rows is same as number of input rows

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message