lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <otis_gospodne...@yahoo.com>
Subject Re: How extract a Field.Text(String, String) field to process it with a Stylesheet?
Date Fri, 15 Oct 2004 13:33:48 GMT
That's true, sorry for the confusion.  The original text is stored
verbatim.

Otis

--- Morus Walter <morus.walter@tanto.de> wrote:

> Otis Gospodnetic writes:
> > That's likely because you used an Analyzer that stripped the XML
> (<, >,
> > etc.) from the original text.  If you want to preserve the original
> > text, use an Analyzer that doesn't throw your XML away.  You can
> write
> > your own Analyzer that doesn't discard anything, for instance.
> > 
> An analyzer doesn't change the stored content. Only the indexed
> tokens.
> So if something threw away the tags (or just the spectial characters)
> it
> must have been before Field.Text(String, String) was called.
> This of course wouldn't be surprising, since indexing xml often means
> to extract the text from an xml document and index that text.
> 
> Morus
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
> 
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message