hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Harish Butani" <rhbut...@gmail.com>
Subject Review Request 22752: HIVE-7063: Optimize for the Top N within a Group use case
Date Wed, 18 Jun 2014 20:40:42 GMT

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/22752/
-----------------------------------------------------------

Review request for hive and Ashutosh Chauhan.


Bugs: HIVE-7063
    https://issues.apache.org/jira/browse/HIVE-7063


Repository: hive-git


Description
-------

It is common to rank within a Group/Partition and then only return the Top N entries within
each Group.
With Streaming mode for Windowing, we should push the post filter on the rank into the Windowing
processing as a Limit expression.


Diffs
-----

  ql/src/java/org/apache/hadoop/hive/ql/exec/PTFTopNHash.java PRE-CREATION 
  ql/src/java/org/apache/hadoop/hive/ql/exec/ReduceSinkOperator.java 03a64e8 
  ql/src/java/org/apache/hadoop/hive/ql/exec/TopNHash.java bc81467 
  ql/src/java/org/apache/hadoop/hive/ql/exec/vector/VectorReduceSinkOperator.java 11024da

  ql/src/java/org/apache/hadoop/hive/ql/plan/ReduceSinkDesc.java 8c1d336 
  ql/src/java/org/apache/hadoop/hive/ql/plan/ptf/WindowTableFunctionDef.java c547e62 
  ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java 7aaf455 
  ql/src/java/org/apache/hadoop/hive/ql/udf/ptf/WindowingTableFunction.java 2290766 
  ql/src/test/queries/clientpositive/windowing_streaming.q PRE-CREATION 
  ql/src/test/results/clientpositive/windowing_streaming.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/22752/diff/


Testing
-------

added new .q tests


Thanks,

Harish Butani


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message