hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Namit Jain (JIRA)" <>
Subject [jira] Commented: (HIVE-474) Support for distinct selection on two or more columns
Date Thu, 21 Oct 2010 19:40:21 GMT


Namit Jain commented on HIVE-474:

1. add initEvaluators() in instead of
2. ReduceSinkDesc: use numKeys and getNumKeys() or change numKeys to numDistributionKeys -
   You may run into problems with serialization/deserialization
3. Add some comments in initEvaluatorsAndReturnStruct in ReduceSinkOperator
   -- explain that it is same as parent in case of no union for groupby
4. Can you more comments in GroupByOperator and SemanticAnalyzer also ?
   It looks OK, but it will help if there are more comments.

> Support for distinct selection on two or more columns
> -----------------------------------------------------
>                 Key: HIVE-474
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>          Components: Query Processor
>            Reporter: Alexis Rondeau
>            Assignee: Amareshwari Sriramadasu
>         Attachments: hive-474.0.4.2rc.patch, patch-474.txt
> The ability to select distinct several, individual columns as by example: 
> select count(distinct user), count(distinct session) from actions;   
> Currently returns the following failure: 
> FAILED: Error in semantic analysis: line 2:7 DISTINCT on Different Columns not Supported

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message