apex-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From David Yan <da...@datatorrent.com>
Subject Re: BloomFilter in Malhar
Date Wed, 09 Dec 2015 05:02:38 GMT
Bloom Filter, MinHash, and HyperLogLog are some of the commonly used
algorithms in Big Data.  I think having them in the Malhar library would be
a good idea.

There's a ticket for HyperLogLog created long time ago:

On Tue, Dec 8, 2015 at 5:42 PM, Chandni Singh <chandni@datatorrent.com>

> Hi,
> We need to add a BloomFilter implementation in Malhar. ManagedState has a
> use for it and I am pretty sure we will come up more and more use cases
> that will need it. Tim's suggestion on Spill-able/Spooled data structures
> may use it too.
> Chandni

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message