Return-Path: X-Original-To: apmail-incubator-drill-dev-archive@minotaur.apache.org Delivered-To: apmail-incubator-drill-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2AB9C107F1 for ; Tue, 25 Nov 2014 00:12:56 +0000 (UTC) Received: (qmail 49568 invoked by uid 500); 25 Nov 2014 00:12:55 -0000 Delivered-To: apmail-incubator-drill-dev-archive@incubator.apache.org Received: (qmail 49513 invoked by uid 500); 25 Nov 2014 00:12:55 -0000 Mailing-List: contact drill-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: drill-dev@incubator.apache.org Delivered-To: mailing list drill-dev@incubator.apache.org Received: (qmail 49500 invoked by uid 99); 25 Nov 2014 00:12:55 -0000 Received: from reviews-vm.apache.org (HELO reviews.apache.org) (140.211.11.40) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 Nov 2014 00:12:55 +0000 Received: from reviews.apache.org (localhost [127.0.0.1]) by reviews.apache.org (Postfix) with ESMTP id 245E36A5A5; Tue, 25 Nov 2014 00:12:54 +0000 (UTC) Content-Type: multipart/alternative; boundary="===============3064734855073476819==" MIME-Version: 1.0 Subject: Re: Review Request 28417: DRILL-1742 Use Hive stats when planning queries on Hive data sources From: "Aman Sinha" To: "Aman Sinha" , "abdelhakim deneche" , "drill" Date: Tue, 25 Nov 2014 00:12:54 -0000 Message-ID: <20141125001254.15977.55196@reviews.apache.org> X-ReviewBoard-URL: https://reviews.apache.org Auto-Submitted: auto-generated Sender: "Aman Sinha" X-ReviewGroup: drill-git X-ReviewRequest-URL: https://reviews.apache.org/r/28417/ X-Sender: "Aman Sinha" References: <20141124223300.15978.91592@reviews.apache.org> In-Reply-To: <20141124223300.15978.91592@reviews.apache.org> Reply-To: "Aman Sinha" X-ReviewRequest-Repository: drill-git --===============3064734855073476819== MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit ----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/28417/#review62903 ----------------------------------------------------------- contrib/storage-hive/core/src/main/java/org/apache/drill/exec/store/hive/HiveScan.java This value is not initialized and only assigned to in the copy constructor (other 2 constructors are not doing anything with it). contrib/storage-hive/core/src/main/java/org/apache/drill/exec/store/hive/HiveScan.java An assumption here is that the numRows property is always available...you should handle the scenario when it isn't. contrib/storage-hive/core/src/main/java/org/apache/drill/exec/store/hive/HiveScan.java Same as above. contrib/storage-hive/core/src/main/java/org/apache/drill/exec/store/hive/HiveScan.java Why does the estRowCount need to be here if it is not being used any more ? contrib/storage-hive/core/src/main/java/org/apache/drill/exec/store/hive/HiveScan.java Why is the property NO_EXACT_ROW_COUNT if we are getting the exact row count from Hive ? - Aman Sinha On Nov. 24, 2014, 10:33 p.m., abdelhakim deneche wrote: > > ----------------------------------------------------------- > This is an automatically generated e-mail. To reply, visit: > https://reviews.apache.org/r/28417/ > ----------------------------------------------------------- > > (Updated Nov. 24, 2014, 10:33 p.m.) > > > Review request for drill. > > > Bugs: DRILL-1742 > https://issues.apache.org/jira/browse/DRILL-1742 > > > Repository: drill-git > > > Description > ------- > > HiveScan.getSplits() already gets the table and partitions metadata using MetaStoreUtils. > We compute the total number of rows using the numRows property and store the computed number of rows in rowCount attribute which is later returned by getScanStats(). > > > Diffs > ----- > > contrib/storage-hive/core/src/main/java/org/apache/drill/exec/store/hive/HiveScan.java ddbc100 > > Diff: https://reviews.apache.org/r/28417/diff/ > > > Testing > ------- > > created several partitioned and non-partitioned tables, loaded data in hive. > > querying the tables I checked the logs to make sure the correct number of rows is computed. > > > Thanks, > > abdelhakim deneche > > --===============3064734855073476819==--