hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "pengcheng xiong" <>
Subject Re: Review Request 25557: improve the speed of col stats update speed
Date Fri, 12 Sep 2014 03:53:46 GMT

This is an automatically generated e-mail. To reply, visit:

(Updated Sept. 12, 2014, 3:53 a.m.)

Review request for hive.


rebase to trunk

Repository: hive-git


Major improvement
(1) All the partition status update/insert is now done in one transaction.
(2) Rather than to use a query to update per col per partition (total query = #col * # part),
now we use 1 query to delete everything and then use 1 query to insert everything. The transaction
makes sure that this happens in ACID mode.

Diffs (updated)

  common/src/java/org/apache/hadoop/hive/conf/ 9df6656 
  metastore/src/java/org/apache/hadoop/hive/metastore/ 33745e4 
  metastore/src/java/org/apache/hadoop/hive/metastore/ 5a8591a 
  metastore/src/java/org/apache/hadoop/hive/metastore/ 637a39a 
  metastore/src/java/org/apache/hadoop/hive/metastore/ 5c5ed7f 
  metastore/src/test/org/apache/hadoop/hive/metastore/ 5905efe

  metastore/src/test/org/apache/hadoop/hive/metastore/ 88b0791

  ql/src/test/queries/clientpositive/analyze_tbl_part.q 9040bd4 
  ql/src/test/results/clientpositive/analyze_tbl_part.q.out 40b926c 




pengcheng xiong

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message