hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "jiraposter@reviews.apache.org (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-2068) Speed up query "select xx,xx from xxx LIMIT xxx" if no filtering or aggregation
Date Thu, 07 Apr 2011 19:20:05 GMT

    [ https://issues.apache.org/jira/browse/HIVE-2068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13017092#comment-13017092
] 

jiraposter@reviews.apache.org commented on HIVE-2068:
-----------------------------------------------------


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/540/
-----------------------------------------------------------

Review request for hive and namit jain.


Summary
-------

For HIVE-2068


This addresses bug HIVE-2068.
    https://issues.apache.org/jira/browse/HIVE-2068


Diffs
-----

  trunk/cli/src/java/org/apache/hadoop/hive/cli/CliDriver.java 1086466 
  trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1086466 
  trunk/conf/hive-default.xml 1086466 
  trunk/hwi/src/java/org/apache/hadoop/hive/hwi/HWISessionItem.java 1086466 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/CommandNeedRetryException.java PRE-CREATION

  trunk/ql/src/java/org/apache/hadoop/hive/ql/Context.java 1086466 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java 1086466 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FetchTask.java 1086466 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/LimitOperator.java 1086466 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Task.java 1086466 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/GenMapRedUtils.java 1086466 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/SamplePruner.java 1086466 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java 1086466 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ParseContext.java 1086466 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/QBParseInfo.java 1086466 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 1086466 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/FetchWork.java 1086466 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/LimitDesc.java 1086466 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/processors/CommandProcessor.java 1086466 
  trunk/ql/src/test/org/apache/hadoop/hive/ql/QTestUtil.java 1086466 
  trunk/ql/src/test/queries/clientpositive/global_limit.q PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/global_limit.q.out PRE-CREATION 
  trunk/service/src/java/org/apache/hadoop/hive/service/HiveServer.java 1086466 

Diff: https://reviews.apache.org/r/540/diff


Testing
-------

added a test to test suite.


Thanks,

Siying



> Speed up query "select xx,xx from xxx LIMIT xxx" if no filtering or aggregation
> -------------------------------------------------------------------------------
>
>                 Key: HIVE-2068
>                 URL: https://issues.apache.org/jira/browse/HIVE-2068
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Siying Dong
>            Assignee: Siying Dong
>         Attachments: HIVE-2068.1.patch, HIVE-2068.2.patch, HIVE-2068.3.patch, HIVE-2068.4.patch
>
>
> Currently, "select xx,xx from xxx where ...(only partition conditions) LIMIT xxx" will
start a MapReduce job with input to be the whole table or partition. The latency can be huge
if the table or partition is big. We could reduce number of input files to speed up the queries.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message