Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 362ED200CAB for ; Sun, 18 Jun 2017 15:57:10 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 2A2BC160BE3; Sun, 18 Jun 2017 13:57:10 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 6CF10160BCC for ; Sun, 18 Jun 2017 15:57:09 +0200 (CEST) Received: (qmail 5598 invoked by uid 500); 18 Jun 2017 13:57:08 -0000 Mailing-List: contact derby-dev-help@db.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: Delivered-To: mailing list derby-dev@db.apache.org Received: (qmail 5588 invoked by uid 99); 18 Jun 2017 13:57:08 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 18 Jun 2017 13:57:08 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id C25B0C0334 for ; Sun, 18 Jun 2017 13:57:07 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.202 X-Spam-Level: X-Spam-Status: No, score=-99.202 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id 2oWzVSCb-zjQ for ; Sun, 18 Jun 2017 13:57:06 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id C3ADD5FB17 for ; Sun, 18 Jun 2017 13:57:05 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 8037EE05BF for ; Sun, 18 Jun 2017 13:57:03 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id B584E21E14 for ; Sun, 18 Jun 2017 13:57:00 +0000 (UTC) Date: Sun, 18 Jun 2017 13:57:00 +0000 (UTC) From: "Harshvardhan Gupta (JIRA)" To: derby-dev@db.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Comment Edited] (DERBY-6940) Enhance derby statistics for more accurate selectivity estimates. MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Sun, 18 Jun 2017 13:57:10 -0000 [ https://issues.apache.org/jira/browse/DERBY-6940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16053198#comment-16053198 ] Harshvardhan Gupta edited comment on DERBY-6940 at 6/18/17 1:56 PM: -------------------------------------------------------------------- Attaching an initial patch which add the following statistics information - 1) The minimum value of each column appearing in an index. 2) The maximum value of each column appearing in an index. 3) The null counts for each column appearing in an index. I am currently storing column level values for minValue and maxValue as DataValueDescriptor object. I am not entirely sure if this is the best way to store it for later use in comparison for determining selectivity estimates. To invoke the procedure extraStatistics call SYSCS_UTIL.SYSCS_UPDATE_STATISTICS(). Currently the extraStatistics procedure is not invoked by the create Index statement, we will need to write a variant for "Create Index" statements to collect the extra statistics during index creation time as well. was (Author: harshvardhan145): Attaching an initial patch which add the following statistics information - 1) The minimum value of each column appearing in an index. 2) The maximum value of each column appearing in an index. 3) The null counts for each column appearing in an index. I am currently storing column level values as DataValueDescriptor object. I am not entirely sure if this is the best way to store it for later use in comparison for determining selectivity estimates. To invoke the procedure extraStatistics call SYSCS_UTIL.SYSCS_UPDATE_STATISTICS(). Currently the extraStatistics procedure is not invoked by the create Index statement, we will need to write a variant for "Create Index" statements to collect the extra statistics during index creation time as well. > Enhance derby statistics for more accurate selectivity estimates. > ----------------------------------------------------------------- > > Key: DERBY-6940 > URL: https://issues.apache.org/jira/browse/DERBY-6940 > Project: Derby > Issue Type: Sub-task > Components: SQL > Reporter: Harshvardhan Gupta > Assignee: Harshvardhan Gupta > Priority: Minor > Attachments: derby-6940.diff > > > Derby should collect extra statistics during index build time, statistics refresh time which will help optimizer make more precise selectivity estimates and chose better execution paths. -- This message was sent by Atlassian JIRA (v6.4.14#64029)