Return-Path: X-Original-To: apmail-asterixdb-dev-archive@minotaur.apache.org Delivered-To: apmail-asterixdb-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C236418F6E for ; Fri, 18 Dec 2015 18:18:22 +0000 (UTC) Received: (qmail 31123 invoked by uid 500); 18 Dec 2015 18:18:22 -0000 Delivered-To: apmail-asterixdb-dev-archive@asterixdb.apache.org Received: (qmail 31065 invoked by uid 500); 18 Dec 2015 18:18:22 -0000 Mailing-List: contact dev-help@asterixdb.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@asterixdb.incubator.apache.org Delivered-To: mailing list dev@asterixdb.incubator.apache.org Received: (qmail 31053 invoked by uid 99); 18 Dec 2015 18:18:22 -0000 Received: from Unknown (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Dec 2015 18:18:22 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 01C5EC483C for ; Fri, 18 Dec 2015 18:18:22 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.899 X-Spam-Level: ** X-Spam-Status: No, score=2.899 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id XHClSiPF65pU for ; Fri, 18 Dec 2015 18:18:12 +0000 (UTC) Received: from mail-pf0-f170.google.com (mail-pf0-f170.google.com [209.85.192.170]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 0C235201A4 for ; Fri, 18 Dec 2015 18:18:12 +0000 (UTC) Received: by mail-pf0-f170.google.com with SMTP id n128so34984927pfn.0 for ; Fri, 18 Dec 2015 10:18:11 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=sender:from:content-type:message-id:mime-version:subject:date :references:to:in-reply-to; bh=O6j7Gyjqn7pvfhMRXSWUQjO6ALCPeTP6c6Mi/2Ko6lA=; b=wc34zE7fsymJLBdLG3eAQIy8JYhQSfDenBo5QTVZT3A8SECokQfBR6JhBwgKyi2CHj L66KCk+NIZdJthThosO/NMgcAaVDUIukpO9v2L0ojFZbxejlffMjwDKOtsdXpwQThhoy 8228OvzuqxNaVIOo1sKt41zOrqoNP6Hqzpcjs6Zmdd1heju14wruSonP3K+paURDr22V EyeyPQNuuO+lazJqcUJkZoRv8+VGg7wf/yj5UNoiIb9+rD6Zf/Ht/rey130UlNCrLpmr ufBA2dInT2Vgr/eZhag9D7rMuE4rDqD4lmVzI1xVn9tDI44v8+AWOZbpRKzNFOhGfZDP fSfQ== X-Received: by 10.98.8.212 with SMTP id 81mr7310577pfi.165.1450462690727; Fri, 18 Dec 2015 10:18:10 -0800 (PST) Received: from dhcp-053161.ics.uci.edu (dhcp-053161.ics.uci.edu. [128.195.53.161]) by smtp.gmail.com with ESMTPSA id r90sm19283801pfa.12.2015.12.18.10.18.09 for (version=TLS1 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Fri, 18 Dec 2015 10:18:10 -0800 (PST) Sender: Ildar Absalyamov From: Ildar Absalyamov Content-Type: multipart/alternative; boundary="Apple-Mail=_9EDABA8A-68E7-499D-8024-94841655F447" Message-Id: <56A49DCD-A24A-47DC-AFA7-604A6BFE1B1F@gmail.com> Mime-Version: 1.0 (Mac OS X Mail 9.1 \(3096.5\)) Subject: Re: Histograms and statistics on Asterix Date: Fri, 18 Dec 2015 10:18:09 -0800 References: <5672FBF7.6020504@gmail.com> To: dev@asterixdb.incubator.apache.org In-Reply-To: X-Mailer: Apple Mail (2.3096.5) --Apple-Mail=_9EDABA8A-68E7-499D-8024-94841655F447 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=us-ascii Hi Wail, I do have some doc which can give you some idea of how stats collection = will work = https://docs.google.com/document/d/12aP8Pzp68b_HxJe-svFmG9dHFX6zliGw-ErNSd= _Av3I/edit?usp=3Dsharing = . But it does not describe the statistical = synopsis format. We were planning to use wavelets instead to histograms = for these LSM-based statistics. Let me know if you want to know more on = how wavelets work and why they should be used. > On Dec 18, 2015, at 07:23, Wail Alkowaileet = wrote: >=20 > Good to know. > Is there any type of design document ? >=20 > P.S. Sattam told me that there was some sort of a comparative study = shows > the difference in performance of open vs. closed types. Where can I = find it? >=20 > Thanks! >=20 > On Thu, Dec 17, 2015 at 9:16 PM, Mike Carey wrote: >=20 >> Ildar is working on that (camping on the LSM lifecycle) and Wenhai is >> doing some work on histograms that's intended for in-flight use = during >> query processing (to do dynamic partitioning, e.g., during parallel = sorts >> and joins). >>=20 >>=20 >> On 12/17/15 1:53 AM, Wail Alkowaileet wrote: >>=20 >>> Hi Devs, >>>=20 >>> I want to ask if there's any on going work on building histograms = and >>> stats >>> for Asterix ? >>>=20 >>=20 >>=20 >=20 >=20 > --=20 >=20 > *Regards,* > Wail Alkowaileet Best regards, Ildar --Apple-Mail=_9EDABA8A-68E7-499D-8024-94841655F447--