lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <erickerick...@gmail.com>
Subject Re: Extracting contact data
Date Thu, 14 Jan 2010 13:18:27 GMT
Ooooh boy, glad I asked the question because I was thinking in terms
of real-world locations of the addresses <G>, so nothing I would
have written would have had any relevance whatsoever......

Erick

On Wed, Jan 13, 2010 at 12:05 PM, Erick Erickson <erickerickson@gmail.com>wrote:

> Before answering, how to you measure "proximity"? You can make
> Lucene work with locations (there's an example in Lucene In Action)
> readily enough though....
>
> HTH
> Erick
>
>
> On Wed, Jan 13, 2010 at 11:39 AM, Ortelli, Gian Luca <
> gianluca.ortelli@truvo.com> wrote:
>
>> Hi community,
>>
>>
>>
>> I have a general understanding of Lucene concepts, and I'm wondering if
>> it's the right tool for my job:
>>
>>
>>
>> - I need to extract data like e.g. time intervals ("8am - 12pm"), street
>> addresses from a set of files. The common issue with this data unit is
>> that they contain spaces and are not always definable through regexes.
>>
>>
>>
>> - the extraction must take into consideration the "proximity": for
>> example, a mail address which is close to the work "Contacts" will
>> receive a higher rank, since I'm looking for contact data.
>>
>>
>>
>> Do you think I can get any advantage from building a solution on Lucene?
>>
>>
>>
>>  Gianluca
>>
>>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message