Return-Path: X-Original-To: apmail-apex-dev-archive@minotaur.apache.org Delivered-To: apmail-apex-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 55B0D1832D for ; Wed, 9 Dec 2015 08:18:17 +0000 (UTC) Received: (qmail 80001 invoked by uid 500); 9 Dec 2015 08:18:13 -0000 Delivered-To: apmail-apex-dev-archive@apex.apache.org Received: (qmail 79932 invoked by uid 500); 9 Dec 2015 08:18:13 -0000 Mailing-List: contact dev-help@apex.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@apex.incubator.apache.org Delivered-To: mailing list dev@apex.incubator.apache.org Received: (qmail 79919 invoked by uid 99); 9 Dec 2015 08:18:13 -0000 Received: from Unknown (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Dec 2015 08:18:13 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 19C10C9936 for ; Wed, 9 Dec 2015 08:18:13 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 3.001 X-Spam-Level: *** X-Spam-Status: No, score=3.001 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=3, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=datatorrent-com.20150623.gappssmtp.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id okygPW3-K-Pz for ; Wed, 9 Dec 2015 08:18:00 +0000 (UTC) Received: from mail-wm0-f41.google.com (mail-wm0-f41.google.com [74.125.82.41]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id AED8B24C0D for ; Wed, 9 Dec 2015 08:17:59 +0000 (UTC) Received: by wmvv187 with SMTP id v187so248434432wmv.1 for ; Wed, 09 Dec 2015 00:12:52 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=datatorrent-com.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=O5Vh3RlnDo1O2A3sGgFuOH33QxZqG9vbTuHxK6DI8pY=; b=iq/RJBYPreUPZ6iW+LnFad+7IqS15eK2hxSjo5bQTvThGgz0AH84QCp9x+03AhCllW 4OOhxW7qdJvPk+KiMWPrChIMzJedOLCcjyHyOsbG1ka8iOwnjnHqmcAGOtE0DdcFotlu caPBzCIrSVCypm6O/2vZGy8i54Inwo6jnr0mi9fX+r6RG6NIdCoaYM21SRVCCDZW5bEi mRDgcGw8pxzdNh443c5KCAL4C19c1pu+CrRXPX9dR6wL/s+5CT3nxYRSnhJnjoReLmYY lZB0LAgNBP09jXQVSsDi0q38tnndT3CpDmbDWD4dqmT3AUMK7B5z2Tv8cOCzxAUWars/ MQcg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=O5Vh3RlnDo1O2A3sGgFuOH33QxZqG9vbTuHxK6DI8pY=; b=XIx/NPmT0cAXI00LLfUSpg9cy3Sf6yKMhTuSn/Z6qdAH/qsCAZCXTCDFuaSH9yg6pg Ee8X00lejl3naceFuo+LLR+iR8UlIRR0Y/ONVYGPli0zMzUgfXX7D8X3x9VuhtlYb/iT jWwo+i2tYy1TpSYHNgnknhE6/GXyYyrDUesT2TqxrUaRlf/tvvTWupo8Jh5+OTsOrN0G o/tXhRF3Ea57dwYRZEWhmci82Hyw8cZrForL6TYdq8i6IFS9Kdb3rKeJ+VIBqkM+ySlf m0UjfvxMiExCI9+OaqyXAz04Lb0b3ijcYloEHVHx8MKEykcM6b7Xek02DzHGaQNr26Oq YEMA== X-Gm-Message-State: ALoCoQk2rEaLF+XIcGWfythXhoW11kyGebeDdKxYspAvAy4Famz9XhegbdLUvm0EFtvnxSfddhhTf1Vc4Mx3/PWZuVqd7Tj6+2TnLE5fCx6mUJxabrySv2U= MIME-Version: 1.0 X-Received: by 10.194.174.73 with SMTP id bq9mr4356203wjc.115.1449648771962; Wed, 09 Dec 2015 00:12:51 -0800 (PST) Received: by 10.28.127.209 with HTTP; Wed, 9 Dec 2015 00:12:51 -0800 (PST) In-Reply-To: References: Date: Wed, 9 Dec 2015 00:12:51 -0800 Message-ID: Subject: Re: BloomFilter in Malhar From: Chandni Singh To: dev@apex.incubator.apache.org Content-Type: multipart/alternative; boundary=089e013d1a5e1a32b8052672a810 --089e013d1a5e1a32b8052672a810 Content-Type: text/plain; charset=UTF-8 Chaitanya, I believe you have an implementation of BloomFilter in your folk. Do you think that can be added to Malhar? Chandni On Tue, Dec 8, 2015 at 9:02 PM, David Yan wrote: > Bloom Filter, MinHash, and HyperLogLog are some of the commonly used > algorithms in Big Data. I think having them in the Malhar library would be > a good idea. > > There's a ticket for HyperLogLog created long time ago: > https://malhar.atlassian.net/browse/MLHR-1822 > > On Tue, Dec 8, 2015 at 5:42 PM, Chandni Singh > wrote: > > > Hi, > > > > We need to add a BloomFilter implementation in Malhar. ManagedState has a > > use for it and I am pretty sure we will come up more and more use cases > > that will need it. Tim's suggestion on Spill-able/Spooled data structures > > may use it too. > > > > Chandni > > > --089e013d1a5e1a32b8052672a810--