commons-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "James Strachan" <james_strac...@yahoo.co.uk>
Subject Re: digester + DOM
Date Wed, 26 Feb 2003 15:57:12 GMT
Rather than using JTidy to parse HTML (which makes a DOM) you could use
NekoHTML which is-a SAX parser that can handle HTML. Then you don't need to
use a DOM.

NekoHTML plugs right into Digester allowing you to fire Digester rules
straight from the SAX events coming out of the HTML

http://www.apache.org/~andyc/neko/doc/html/

James
-------
http://radio.weblogs.com/0112098/
----- Original Message -----
From: "Balazs Somogyi" <balazs.somogyi@FATHOMTECHNOLOGY.com>
To: <commons-user@jakarta.apache.org>
Sent: Wednesday, February 26, 2003 2:34 PM
Subject: digester + DOM


Hi,

Is it possible to feed digester with an already parsed XML (actually
XHTML).
I'm using JTidy to parse HTML and would like to extract some of its
elements but don't want to traverse manually the tree.

Thanks in advance for your help,
Balazs

__________________________________________________
Do You Yahoo!?
Everything you'll ever need on one web page
from News and Sport to Email and Music Charts
http://uk.my.yahoo.com

Mime
View raw message