lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Keith Gunn <kg...@csd.abdn.ac.uk>
Subject problems with HTML Parser
Date Wed, 14 Aug 2002 16:46:52 GMT
Has anyone noticed that the HTML Parser that comes with
Lucene joins terms together when parsing a file.
I used to think it was my PDFParser but after fixing that
I found out it was the HMTLParser.

I managed to find a replacement parser that doesn't join terms.

Just wondered if anyone had come across this problem??




--
To unsubscribe, e-mail:   <mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>


Mime
View raw message