lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From jim ferenczi <jim.feren...@gmail.com>
Subject Re: Bloom filter
Date Wed, 30 Jul 2014 06:55:01 GMT
Hi Per,
First of all the BloomFilter implementation in Lucene is not exactly a
bloom filter. It uses only one hash function and you cannot set the false
positive ratio beforehand. ElasticSearch has its own bloom filter
implementation (using "guava like" BloomFilter), you should take a look at
their implementation if you really need this feature.
What is your use-case ? If your index fits in RAM the bloom filter won't
help (and it may have a negative impact if you have a lot of segments). In
fact the only use case where the bloom filter can help is when your term
dictionary does not fit in RAM which is rarely the case.

Regards,
Jim



2014-07-28 16:13 GMT+02:00 Per Steffensen <steff@designware.dk>:

> Yes I found that one, along with SOLR-3950. Well at least it seems like
> the support is there in Lucene. I will figure out myself how to make it
> work via Solr, the way I need it to work. My use-case is not as specified
> in SOLR-1375, but the solution might be the same. Any input is of course
> still very much appreciated.
>
> Regards, Per Steffensen
>
>
> On 28/07/14 15:42, Lukas Drbal wrote:
>
>> Hi Per,
>>
>> link to jira - https://issues.apache.org/jira/browse/SOLR-1375 Unresolved
>> ;-)
>>
>> L.
>>
>>
>> On Mon, Jul 28, 2014 at 1:17 PM, Per Steffensen <steff@designware.dk>
>> wrote:
>>
>>  Hi
>>>
>>> Where can I find documentation on how to use Bloom filters in Solr (4.4).
>>> http://wiki.apache.org/solr/BloomIndexComponent seems to be outdated -
>>> there is no BloomIndexComponent included in 4.4 code.
>>>
>>> Regards, Per Steffensen
>>>
>>>
>>
>>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message