hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Namit Jain (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-647) SORT BY with GROUP ignored without LIMIT
Date Fri, 17 Jul 2009 22:58:15 GMT

    [ https://issues.apache.org/jira/browse/HIVE-647?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12732775#action_12732775
] 

Namit Jain commented on HIVE-647:
---------------------------------

Are you sure - I just tried the same query and it works for me - are you using trunk ?


Also, can you look at the plan file and search for numReducers 
(the plan file for the second job)

The plan file can be found by: hive.exec.plan from the tracker

> SORT BY with GROUP ignored without LIMIT
> ----------------------------------------
>
>                 Key: HIVE-647
>                 URL: https://issues.apache.org/jira/browse/HIVE-647
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Bill Graham
>
> For queries with GROUP BY and SORT BY, the sort is not handled properly when a LIMIT
is not supplied. If I run the following two queries, the first returns properly sorted results.
The second does not.
> SELECT user, SUM(numRequests) AS num FROM MyTable GROUP BY user SORT BY num DESC LIMIT
50;
> SELECT user, SUM(numRequests) AS num FROM MyTable GROUP BY user SORT BY num DESC;
> Explain is different for the two queries as well. The first uses 3 M/R jobs and the second
only uses 2, which might be part of the problem.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message