Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 63F93186BE for ; Thu, 6 Aug 2015 08:49:13 +0000 (UTC) Received: (qmail 50987 invoked by uid 500); 6 Aug 2015 08:49:11 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 50928 invoked by uid 500); 6 Aug 2015 08:49:11 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 50915 invoked by uid 99); 6 Aug 2015 08:49:11 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 Aug 2015 08:49:11 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 155251A98D0 for ; Thu, 6 Aug 2015 08:49:11 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.9 X-Spam-Level: ** X-Spam-Status: No, score=2.9 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-west.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id sv_t-FOFGCMm for ; Thu, 6 Aug 2015 08:49:06 +0000 (UTC) Received: from mail-wi0-f179.google.com (mail-wi0-f179.google.com [209.85.212.179]) by mx1-us-west.apache.org (ASF Mail Server at mx1-us-west.apache.org) with ESMTPS id CC87820594 for ; Thu, 6 Aug 2015 08:49:05 +0000 (UTC) Received: by wicgj17 with SMTP id gj17so13124752wic.1 for ; Thu, 06 Aug 2015 01:48:58 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=KuBme/R/caUg9fL0QpPiaP24OAUTRcd8dGuhJBT00y0=; b=SvT+ujakesJKdA9k9hC20HCuqdP5GgKUWgp8CmNkLOuEaolnnGXGcQKXjo1UY2UAl1 h9A/d2sx1ltwUY5zj55czyRlVKeDfJBdXCVfmIWLUYFcxhYYzkAeopa0KH4L40buYm09 azCn55GLLY1GF0HFxjbK3BLYJakKqGEuUS9ji+RrCAjfDxihJuDKBGOU0YV/bflKX10J 7jYkb1otmALrgolTAl7qyy3PW2ARbuFd8Hb3TifW4icv26S6PHpZ4+ptkg2MQNVEYr+n 2Hje9wcyMeEyyUDNvGJtr5N9Y7bEyRfDxqYnCXpn3TJ+tE47JuXiqHbMrOgEdaV3Mw2t PyTw== MIME-Version: 1.0 X-Received: by 10.194.175.200 with SMTP id cc8mr1161685wjc.87.1438850938163; Thu, 06 Aug 2015 01:48:58 -0700 (PDT) Received: by 10.28.178.129 with HTTP; Thu, 6 Aug 2015 01:48:58 -0700 (PDT) In-Reply-To: References: Date: Thu, 6 Aug 2015 12:48:58 +0400 Message-ID: Subject: Re: How to index & search arrays of double? From: Phaneendra N To: java-user@lucene.apache.org Content-Type: multipart/alternative; boundary=089e013d04dc0dd7ec051ca097b8 --089e013d04dc0dd7ec051ca097b8 Content-Type: text/plain; charset=UTF-8 Hello Stan, Great question. I come across with one such implementation based on lucene. Its called LIRE . This is an open source project. http://www.lire-project.net/ You might get some ideas there. Please let me know if you find answers to your specific questions there. I'm curious. Thanks Phaneendra On Thu, Aug 6, 2015 at 12:39 PM, Estanislao Oubel < estanislao.oubel@gmail.com> wrote: > Hello everybody, > > I'm currently investigating methods for content-based image retrieval. In > this context, I would like to index documents containing arrays of doubles > and then perform an approximate search based on these arrays. For example, > I would like to insert in the index three documents (d1,d2,d3) containing a > field called feature1, a vector of doubles of dimension 3: > > d1_feature1 = [0.5 1.8 2.4]. > d2_feature1 = [30.1 0 9.1]. > d3_feature1 = [0.6 5.8 2.0]. > > Now, I would like that lucene gives me d1 when I search a document > containing [0.51 1.79 2.41] (because d1 is the closest one according to a > distance L1 for example). > > Is it possible to do this type of things with lucene? More specifically: > 1. Does lucene support arrays of doubles as field type? > 2. Is it possible to search documents based on custom distances between > these arrays? > > If so, can you provide some clues about how to implement it? (fields types > and classes to use, or an example) > > Thanks! > > Stan > --089e013d04dc0dd7ec051ca097b8--