lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Polites" <>
Subject Re: search document for keywords and keyphrases
Date Fri, 11 Aug 2006 10:11:31 GMT
Yes you could use lucene for this, but it may be overkill for your
requirement.  If I understand you correctly, all you need to is find
documents which match "any" of the words in your list?  Do you need to rank
the results?   If not, it's probably easier just to create your own inverted
index of the documents you need to search.  If you actually need to rank
results, then lucene is probably easier as it does this for you.

On 8/11/06, Eugeny N Dzhurinsky <> wrote:
> Hello!
> I have an assigment, which will require to search documents for keywords
> or
> keyphrases.
> For instance, I have a database of keywords/keyphrases, which might
> contain
> several millions items. Now I need to find if document contains any of the
> keywords/phrases listed in that database.
> I was thinking on implementing finite-state machine, and use b-trees, so I
> will iterate document char by char and go down the tree unless I find some
> word or phrase which matches character sequence.
> I think Lucene is doing in the same way when performs searching, so may be
> I
> can use Lucene?
> --
> Eugene N Dzhurinsky
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message