hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "pengcheng xiong" <pxi...@hortonworks.com>
Subject Re: Review Request 25557: improve the speed of col stats update speed
Date Fri, 12 Sep 2014 03:53:46 GMT

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/25557/
-----------------------------------------------------------

(Updated Sept. 12, 2014, 3:53 a.m.)


Review request for hive.


Changes
-------

rebase to trunk


Repository: hive-git


Description
-------

Major improvement
(1) All the partition status update/insert is now done in one transaction.
(2) Rather than to use a query to update per col per partition (total query = #col * # part),
now we use 1 query to delete everything and then use 1 query to insert everything. The transaction
makes sure that this happens in ACID mode.


Diffs (updated)
-----

  common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 9df6656 
  metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 33745e4 
  metastore/src/java/org/apache/hadoop/hive/metastore/MetaStoreDirectSql.java 5a8591a 
  metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java 637a39a 
  metastore/src/java/org/apache/hadoop/hive/metastore/RawStore.java 5c5ed7f 
  metastore/src/test/org/apache/hadoop/hive/metastore/DummyRawStoreControlledCommit.java 5905efe

  metastore/src/test/org/apache/hadoop/hive/metastore/DummyRawStoreForJdoConnection.java 88b0791

  ql/src/test/queries/clientpositive/analyze_tbl_part.q 9040bd4 
  ql/src/test/results/clientpositive/analyze_tbl_part.q.out 40b926c 

Diff: https://reviews.apache.org/r/25557/diff/


Testing
-------


Thanks,

pengcheng xiong


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message