lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Morus Walter <morus.wal...@tanto.de>
Subject Re: How extract a Field.Text(String, String) field to process it with a Stylesheet?
Date Fri, 15 Oct 2004 12:40:23 GMT
Otis Gospodnetic writes:
> That's likely because you used an Analyzer that stripped the XML (<, >,
> etc.) from the original text.  If you want to preserve the original
> text, use an Analyzer that doesn't throw your XML away.  You can write
> your own Analyzer that doesn't discard anything, for instance.
> 
An analyzer doesn't change the stored content. Only the indexed tokens.
So if something threw away the tags (or just the spectial characters) it
must have been before Field.Text(String, String) was called.
This of course wouldn't be surprising, since indexing xml often means
to extract the text from an xml document and index that text.

Morus

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message