lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Steven Rowe <sar...@syr.edu>
Subject Re: using a french specific analyser without stemming
Date Mon, 04 Jun 2007 15:52:51 GMT
Bonjour Mathieu,

Mathieu Lecarme wrote:
> For a project with a lot ofLucene search (via Compass), I had some
> troubles with stemming. Stemming is nice for enlarge search range, but
> make completion strange.
> So FrenchAnalyzer was not usable. A simpler StandardAnalyzer makes the
> job right, except for some french speciality, like elision. In french
> "the plane" is translated by "l'avion" and not "le avion", and the
> StandardTokenizer, used by StandardFilter can't tokenize it right. So, I
> make a specific filter (ElisionFilter), how can I give it to Lucene?
> With a Jira ticket, with the mailing list?

Here's a good place to start:

   http://wiki.apache.org/jakarta-lucene/HowToContribute

FYI, 99% of modifications/additions to Lucene begin life as JIRA issues.

-- 
Steve Rowe
Center for Natural Language Processing
http://www.cnlp.org/tech/lucene.asp

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message