lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ken Krugler <kkrugler_li...@transpac.com>
Subject Re: N-dimensional Point Indexing
Date Thu, 18 Oct 2018 00:32:24 GMT
I’ve been looking at directly storing feature vectors and providing scoring/filtering support.

This is for vectors consisting of (typically 300 - 2048) floats or doubles.

It’s following the same pattern as geospatial support - so a new field type and query/parser,
plus plumbing to hook it into Solr.

Before I go much further, is there anything like this already done, or in the works?

Thanks,

— Ken


> On Feb 26, 2018, at 4:24 PM, Luís Filipe Nassif <lfcnassif@gmail.com> wrote:
> 
> Thank you, Adrian.
> 
> Em 26 de fev de 2018 21:19, "Adrien Grand" <jpountz@gmail.com> escreveu:
> 
>> Yes it is.
>> 
>> Le mar. 27 févr. 2018 à 00:03, Luís Filipe Nassif <lfcnassif@gmail.com>
a
>> écrit :
>> 
>>> Hi Lucene community,
>>> 
>>> Is BinaryPoint limited up to 8 dimensions?
>>> 
>>> Thanks,
>>> Luis
>>> 
>>> Em 6 de fev de 2018 16:07, "Luís Filipe Nassif" <lfcnassif@gmail.com>
>>> escreveu:
>>> 
>>> Is it limited up to 8 dimensions as described at
>>> https://www.elastic.co/blog/lucene-points-6.0?
>>> 
>>> 2018-02-06 15:35 GMT-02:00 Luís Filipe Nassif <lfcnassif@gmail.com>:
>>> 
>>>> Sorry, I was looking at the wrong place. Should I use BinaryPoint (
>>>> https://lucene.apache.org/core/6_0_0/core/org/apache/lucene
>>>> /document/BinaryPoint.html) ?
>>>> 
>>>> 2018-02-06 14:17 GMT-02:00 Luís Filipe Nassif <lfcnassif@gmail.com>:
>>>> 
>>>>> Hi all,
>>>>> 
>>>>> Lucene is able to index generic n-dimensional points for efficient
>>>>> similarity or nearest neightbors search? I have looked at spatial
>>> package
>>>>> in the past but seems it is specific to geo points? The use case is to
>>>>> index image feature vectors to search for similar images in a corpus.
>>>>> 
>>>>> Currently we are using lucene to text search and we would like to not
>>>>> have to manage two different index structures, synchronize commits, so
>>> on.
>>>>> 
>>>>> Thank you,
>>>>> Luis Nassif

--------------------------
Ken Krugler
+1 530-210-6378
http://www.scaleunlimited.com
Custom big data solutions & training
Flink, Solr, Hadoop, Cascading & Cassandra


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message