lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Benoit Mercier <benoit.merc...@member.fsf.org>
Subject Re: "or" as a search term : SOLVED
Date Fri, 14 Jan 2011 03:21:20 GMT
Thank you Robert and Hongyinjie for your support.

I managed to solved my problem.  It was simply a wrong application 
configuration.  I am using PerFieldAnalyzerWrapper, with analysers 
injected via Spring.  A stupid line inversion in a Spring application 
context file that didn't hurd during several years !

Conclusion: Lucene 3.0.3 can index and query without any problem 
reserved words like OR, AND or NOT  if they are not part of the analyzer 
stopWords.

Best regards,

Benoit


On 2011-01-13 00:50, hongyinjie wrote:
> use "TokenStream" to print out the Tokens after using FrenchAnalyzer; test it work well,
and the result is just you want?
>
> You can use another tools "Luke" to see the "Lucene index File", is there any token of
"or"(French) ?
> And you can query using the "Luke"...
>
>
>
> Good Luck
>
>
>
> 2011-01-13
>
>
>
> ---------------------------------
> Yinjie Hong
> Ph. D. Student
> College of Computer Science, Zhejiang University
>
> Tel:       86-571-87952026
> E-Mail: hongyj(at)zju(dot)edu(dot)cn
> Office:  Room 400, Teaching Building #11, Yuquan Campus, Zhejiang University
>
>
>
>
>
>
> Thank you for your reply.
>
> I am using my own FrenchAnalyzer for lexical analysis.  It extends
> org.apache.lucene.analysis.Analyzer and my stopwords set is empty.
>
> Benoit
>
> On 2011-01-12 23:05, Robert Muir wrote:
>> On Wed, Jan 12, 2011 at 10:38 PM, Benoit Mercier
>> <benoit.mercier@member.fsf.org>   wrote:
>>> Hi,
>>>
>>> I am happily using Lucene for several years to offer French lexical analysis
>>> tools to university researchers.   Today, one of them decided to analyze the
>>> use of the French word "or" (meaning "gold" in French) in one of my corpus
>>> powered by Lucene...  And, as you probably already guessed, no results...
>>>
>> What analyzer are you using?
>>
>> By default, StandardAnalyzer and StopAnalyzer uses a set of english
>> stopwords. For french, this list is probably not appropriate.
>> If you look at the javadocs, you can pass in your own set of
>> stopwords... for lexical analysis maybe this should be an empty set.
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-user-help@lucene.apache.org
>>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message