lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Amir Hossein Jadidinejad <amir.jad...@yahoo.com>
Subject Re: Controlled Indexing -New Feature
Date Sat, 29 Nov 2008 07:03:10 GMT
Yes, It's good to filter words but suppose that my controlled vocabulary contains synonyms
and etc.Exactly, I want to supervise on indexing process.

Thank you very much.


________________________________
From: German Kondolf <german.kondolf@gmail.com>
To: java-user@lucene.apache.org
Sent: Friday, November 28, 2008 7:38:34 PM
Subject: Re: Controlled Indexing -New Feature

You could use a "reverse" stop-word filter.

The straight "StopFilter" actually removes the keywords that match with a
given Set of words, you could do the reverse logic of that an remove ALL
keywords that doesn't match that Set.

Take a look at StopFilter and StandardAnalyzer ;)

On Fri, Nov 28, 2008 at 4:38 PM, Amir Hossein Jadidinejad <
amir.jadidi@yahoo.com> wrote:

> Hi,
> I'm going to index some documents only with known phrases. Let me describe:
> Suppose that I have a controlled vocabulary(a list of some known
> phrases/words). I intend to
> index ONLY these phrases within my documents and have a retrieval model
> (for example simple VS-TF.IDF). In way, each index item is one of the
> predefined
> phrase.
> Is it possible to handle this situation with Lucene?
> What is the best way?!
> Greatly appreciate any comments or answers.
> Kind regards,
>
>
>
>



      
Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message