lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Maurits van Wijland <m.vanwijl...@quicknet.nl>
Subject Re: problems with HTML Parser
Date Thu, 15 Aug 2002 01:56:28 GMT
Keith,

I haven't noticed the problem with the Parser...but you trigger me
by saying that you have a PDFParser!!!

Are you able to contribute this PDFParser??

Maurits.
----- Original Message -----
From: "Keith Gunn" <kgunn@csd.abdn.ac.uk>
To: "Lucene Users List" <lucene-user@jakarta.apache.org>
Sent: Wednesday, August 14, 2002 9:46 AM
Subject: problems with HTML Parser


> Has anyone noticed that the HTML Parser that comes with
> Lucene joins terms together when parsing a file.
> I used to think it was my PDFParser but after fixing that
> I found out it was the HMTLParser.
>
> I managed to find a replacement parser that doesn't join terms.
>
> Just wondered if anyone had come across this problem??
>
>
>
>
> --
> To unsubscribe, e-mail:
<mailto:lucene-user-unsubscribe@jakarta.apache.org>
> For additional commands, e-mail:
<mailto:lucene-user-help@jakarta.apache.org>
>


--
To unsubscribe, e-mail:   <mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>


Mime
View raw message