lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <erickerick...@gmail.com>
Subject Re: Extracting contact data
Date Wed, 13 Jan 2010 17:05:58 GMT
Before answering, how to you measure "proximity"? You can make
Lucene work with locations (there's an example in Lucene In Action)
readily enough though....

HTH
Erick

On Wed, Jan 13, 2010 at 11:39 AM, Ortelli, Gian Luca <
gianluca.ortelli@truvo.com> wrote:

> Hi community,
>
>
>
> I have a general understanding of Lucene concepts, and I'm wondering if
> it's the right tool for my job:
>
>
>
> - I need to extract data like e.g. time intervals ("8am - 12pm"), street
> addresses from a set of files. The common issue with this data unit is
> that they contain spaces and are not always definable through regexes.
>
>
>
> - the extraction must take into consideration the "proximity": for
> example, a mail address which is close to the work "Contacts" will
> receive a higher rank, since I'm looking for contact data.
>
>
>
> Do you think I can get any advantage from building a solution on Lucene?
>
>
>
>  Gianluca
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message