Return-Path: Delivered-To: apmail-xml-cocoon-dev-archive@xml.apache.org Received: (qmail 18722 invoked by uid 500); 14 May 2002 17:41:00 -0000 Mailing-List: contact cocoon-dev-help@xml.apache.org; run by ezmlm Precedence: bulk list-help: list-unsubscribe: list-post: Reply-To: cocoon-dev@xml.apache.org Delivered-To: mailing list cocoon-dev@xml.apache.org Received: (qmail 18711 invoked from network); 14 May 2002 17:40:59 -0000 Message-ID: <00cb01c1fb6e$10c5fe30$670004c0@PC103> Reply-To: "Nicola Ken Barozzi" From: "Nicola Ken Barozzi" To: References: <1489301c1fb41$21666810$9d0f078d@cyborg> Subject: Re: NekoHTML in Cocoon.... Date: Tue, 14 May 2002 19:37:48 +0200 MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 8bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2600.0000 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2600.0000 X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N Yes, it should be faster. Also, I had to "patch" JTidy because it wasn't cleaning HTML correctly with duplicate attributes. I submitted a bug report to them, and they told me to wait the next C version to be ported to Java. This says it all... -- Nicola Ken Barozzi nicolaken@apache.org - verba volant, scripta manent - (discussions get forgotten, just code remains) --------------------------------------------------------------------- ----- Original Message ----- From: "J�rn Heid" To: Sent: Tuesday, May 14, 2002 2:16 PM Subject: AW: NekoHTML in Cocoon.... I haven't tested Tidy yet. From Sourceforge I got "JTidy is a Java port of HTML Tidy, a HTML syntax checker and pretty printer. Like its non-Java cousin, JTidy can be used as a tool for cleaning up malformed and faulty HTML. In addition, JTidy provides a DOM parser for real-world HTML.". So is Tidy just DOM based? If so, and Necko supports SAX, Necko should be faster... -----Urspr�ngliche Nachricht----- Von: Reinhard P�tz [mailto:reinhard_poetz@gmx.net] Gesendet: Dienstag, 14. Mai 2002 14:07 An: cocoon-dev@xml.apache.org Betreff: RE: NekoHTML in Cocoon.... What are there any differences to HTML Tidy (speed, functionality)? Reinhard > -----Original Message----- > From: J�rn Heid [mailto:heid@fh-heilbronn.de] > Sent: Tuesday, May 14, 2002 1:27 PM > To: cocoon-dev@xml.apache.org > Subject: NekoHTML in Cocoon.... > > > > Necko is an HTML parser based on Xerces who can parse 'normal' HTML (not > XHTML) and prodcues pure XML (SAX). > http://www.apache.org/~andyc/nekohtml/doc/index.html > > Just thinking if it could be usefull... (I haven't tried it yet ;). > > One possibible use case would be the ability for Cocoon developers to > use old html files and change them with XSLT. E.g. including news pages > not based on XML in Cocoon, filtering information from external pages > and so on. > > What do you think? I will ask Andy Clark for permission if you think it > would be usefull for Cocoon. > > JOERN > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: cocoon-dev-unsubscribe@xml.apache.org > For additional commands, email: cocoon-dev-help@xml.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: cocoon-dev-unsubscribe@xml.apache.org For additional commands, email: cocoon-dev-help@xml.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: cocoon-dev-unsubscribe@xml.apache.org For additional commands, email: cocoon-dev-help@xml.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: cocoon-dev-unsubscribe@xml.apache.org For additional commands, email: cocoon-dev-help@xml.apache.org