drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "abdelhakim deneche" <adene...@gmail.com>
Subject Review Request 28417: DRILL-1742 Use Hive stats when planning queries on Hive data sources
Date Mon, 24 Nov 2014 22:33:00 GMT

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/28417/
-----------------------------------------------------------

Review request for drill.


Bugs: DRILL-1742
    https://issues.apache.org/jira/browse/DRILL-1742


Repository: drill-git


Description
-------

HiveScan.getSplits() already gets the table and partitions metadata using MetaStoreUtils.
We compute the total number of rows using the numRows property and store the computed number
of rows in rowCount attribute which is later returned by getScanStats().


Diffs
-----

  contrib/storage-hive/core/src/main/java/org/apache/drill/exec/store/hive/HiveScan.java ddbc100


Diff: https://reviews.apache.org/r/28417/diff/


Testing
-------

created several partitioned and non-partitioned tables, loaded data in hive.

querying the tables I checked the logs to make sure the correct number of rows is computed.


Thanks,

abdelhakim deneche


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message