lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "hongyinjie"<>
Subject Re: "or" as a search term
Date Thu, 13 Jan 2011 05:50:59 GMT
use "TokenStream" to print out the Tokens after using FrenchAnalyzer; test it work well, and
the result is just you want?

You can use another tools "Luke" to see the "Lucene index File", is there any token of "or"(French)
And you can query using the "Luke"...

Good Luck


Yinjie Hong
Ph. D. Student
College of Computer Science, Zhejiang University
Tel:       86-571-87952026
E-Mail: hongyj(at)zju(dot)edu(dot)cn
Office:  Room 400, Teaching Building #11, Yuquan Campus, Zhejiang University

Thank you for your reply. 

I am using my own FrenchAnalyzer for lexical analysis.  It extends  
org.apache.lucene.analysis.Analyzer and my stopwords set is empty. 


On 2011-01-12 23:05, Robert Muir wrote: 
> On Wed, Jan 12, 2011 at 10:38 PM, Benoit Mercier 
> <>  wrote: 
>> Hi, 
>> I am happily using Lucene for several years to offer French lexical analysis 
>> tools to university researchers.   Today, one of them decided to analyze the 
>> use of the French word "or" (meaning "gold" in French) in one of my corpus 
>> powered by Lucene...  And, as you probably already guessed, no results... 
> What analyzer are you using? 
> By default, StandardAnalyzer and StopAnalyzer uses a set of english 
> stopwords. For french, this list is probably not appropriate. 
> If you look at the javadocs, you can pass in your own set of 
> stopwords... for lexical analysis maybe this should be an empty set. 
> --------------------------------------------------------------------- 
> To unsubscribe, e-mail: 
> For additional commands, e-mail: 

To unsubscribe, e-mail: 
For additional commands, e-mail: 
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message