lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "German Kondolf" <german.kond...@gmail.com>
Subject Re: Controlled Indexing -New Feature
Date Sat, 29 Nov 2008 00:38:34 GMT
You could use a "reverse" stop-word filter.

The straight "StopFilter" actually removes the keywords that match with a
given Set of words, you could do the reverse logic of that an remove ALL
keywords that doesn't match that Set.

Take a look at StopFilter and StandardAnalyzer ;)

On Fri, Nov 28, 2008 at 4:38 PM, Amir Hossein Jadidinejad <
amir.jadidi@yahoo.com> wrote:

> Hi,
> I'm going to index some documents only with known phrases. Let me describe:
> Suppose that I have a controlled vocabulary(a list of some known
> phrases/words). I intend to
> index ONLY these phrases within my documents and have a retrieval model
> (for example simple VS-TF.IDF). In way, each index item is one of the
> predefined
> phrase.
> Is it possible to handle this situation with Lucene?
> What is the best way?!
> Greatly appreciate any comments or answers.
> Kind regards,
>
>
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message