lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Anders Nielsen" <and...@visator.dk>
Subject RE: Token retrieval question
Date Fri, 12 Oct 2001 08:50:05 GMT
Can't you just keep 2 fields, one with the stemmed version of the text used
for indexing purposes (index but not stored) and a second field with the
original text (un-indexed but stored). Then when you know you got a match on
the nth term in the stemmed version, you can use the same Analyzer but
without the stemming on the stored text field, and take the nth term from
that?

The only trouble I can see with that is if the stemmer either skips terms or
makes two terms into one.

regards,
Anders Nielsen

-----Original Message-----
From: Alex Murzaku [mailto:murzaku@earthlink.net]
Sent: 12. oktober 2001 03:44
To: lucene-dev@jakarta.apache.org
Subject: RE: Token retrieval question


Mime
View raw message