lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ernesto De Santis" <ernesto.desan...@colaborativa.net>
Subject Re: spanish stemmer
Date Mon, 23 Aug 2004 19:54:08 GMT
Hello Grant

Thanks for your response.

I have a basic undertanding about analyzers. The problem is that I think
that the words finished in 'bol' need are striped.
like:

original        ->        generated word
tornillos       ->        tornill

I need:

basquetbol  ->        basquet

Bye, Ernesto.


----- Original Message ----- 
From: "Grant Ingersoll" <GSIngers@syr.edu>
To: <lucene-user@jakarta.apache.org>
Sent: Monday, August 23, 2004 4:09 PM
Subject: Re: spanish stemmer


Ernesto,


http://snowball.tartarus.org/texts/introduction.html might help w/ your
understanding.  The link provides basic info on why stemmer's are valuable
(not necessarily any insight on how the Spanish version works).  Of course,
they don't solve every problem and in some cases may make things worse.

A stemmer is not required to return a whole word.

Hope this helps.

>>> ernesto.desantis@colaborativa.net 8/23/2004 9:29:30 AM >>>
Hello

I use the Snowball jar for implement my SpanishAnalyzer. I found that the
words finished in 'bol' are not stripped.
For example:

In spanish for say basketball, you can say basquet or basquetbol. But for
SpanishStemmer are different words.
Idem with voley and voleybol.

Not idem with futbol (football), we not say fut for futbol. But 'fut' donĀ“t
exist in spanish.

you think that I are correct?

you can change this?

Ernesto.


---
Outgoing mail is certified Virus Free.
Checked by AVG anti-virus system (http://www.grisoft.com).
Version: 6.0.737 / Virus Database: 491 - Release Date: 11/08/2004


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org




---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message