lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject DO NOT REPLY [Bug 19253] - [PATCH] HTML parser should treat <td> as a word break element
Date Wed, 26 Nov 2003 17:01:46 GMT
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=19253>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=19253

[PATCH] HTML parser should treat <td> as a word break element





------- Additional Comments From konradk@ca.ibm.com  2003-11-26 17:01 -------
Thanks for fixing this bug.

The problem also occurs on closing tags.  Could a small change be made to set 
in Tags class to contain "</h1" - "</h5", "</p" ... so it includes closing 
tags by default?   Should I open a separate bug?

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org


Mime
View raw message