lucene-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <>
Subject Re: Lucene rich-text search with returned hyperlinks
Date Mon, 11 May 2009 04:19:18 GMT
Yes.  This can be done using Lucene.

But, this is subject to a few liberal interpretations of what you asked
for.  To wit, I am assuming that you want to find interesting documents from
a bunch of documents, not just search a single document for matches.

The span queries that another poster mentioned would be good as would sloppy
phrase queries.

Depending on which European languages you need to handle, there may be some
work you need to do to deal with morphological analysis.  Lucene has
reasonable support for English and somewhat more rudimentary support for a
few other European languages.  Support for Asian languages is very basic at

On Sun, May 10, 2009 at 7:43 PM, apgw <> wrote:

> I am new to Lucene. Is this the right utility to use for the following use
> case:
> 1) Find a search term - eg. 'lithium battery' in some technical rich-text
> data (can be in any european language), 4K - 64K size, and return the exact
> position in the text so that the occurrence can be turned into a hyperlink
> within the text, and the full text returned to the user with the embedded
> hyperlinks which he can select if he is interested.
> 2) Also find and hyperlink "lithium batteries", or "lithium hydride
> batteries" (with lower ranking) and so on.

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message