hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "pengcheng xiong" <>
Subject Review Request 25557: improve the speed of col stats update speed
Date Thu, 11 Sep 2014 21:13:48 GMT

This is an automatically generated e-mail. To reply, visit:

Review request for hive.

Repository: hive-git


Major improvement
(1) All the partition status update/insert is now done in one transaction.
(2) Rather than to use a query to update per col per partition (total query = #col * # part),
now we use 1 query to delete everything and then use 1 query to insert everything. The transaction
makes sure that this happens in ACID mode.


  common/src/java/org/apache/hadoop/hive/conf/ 9df6656 
  metastore/src/java/org/apache/hadoop/hive/metastore/ 33745e4 
  metastore/src/java/org/apache/hadoop/hive/metastore/ 59d5244 
  metastore/src/java/org/apache/hadoop/hive/metastore/ 637a39a 
  metastore/src/java/org/apache/hadoop/hive/metastore/ 5c5ed7f 
  metastore/src/test/org/apache/hadoop/hive/metastore/ 5905efe

  metastore/src/test/org/apache/hadoop/hive/metastore/ 88b0791

  ql/src/test/queries/clientpositive/analyze_tbl_part.q 9040bd4 
  ql/src/test/results/clientpositive/analyze_tbl_part.q.out 40b926c 




pengcheng xiong

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message