lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Daniel Naber <daniel.na...@t-online.de>
Subject Re: demo IndexHTML parser breaks unicode?
Date Fri, 24 Sep 2004 19:17:28 GMT
On Friday 24 September 2004 19:58, Fred Toth wrote:

> I've got unicode in my source HTML. In particular, within meta tags,
> and it's getting broken by the indexer. Note that I'm not trying to
> query on any of this, just store and retrieve document titles with
> unicode characters.

Please try again with the code from CVS, Christoph Goller committed a fix 
for this problem (at least I think it was this problem) 1-3 weeks ago.

Regards
 Daniel

-- 
http://www.danielnaber.de

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message