camel-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ramon Buckland" <ra...@thebuckland.com>
Subject Re: [jira] Commented: (CAMEL-1184) Add a new Dataformat - tidyMarkup - which allows us to unmarshal bad HTML to good (XML) Html.
Date Thu, 11 Dec 2008 14:22:40 GMT
BTW James,

Good call in suggesting tidyMarkup (instead of my original wellFormedHtml)
As it turns out, a nasty sample file I found (with TagSoup) was not what one
could consider HTML at all

http://home.ccil.org/~cowan/XML/tagsoup/extreme.html

This comes out to well formed XML just fine. (I wasn't surprised of course,
but the name then suited the results :-)

(needless to say I added this to the test case)
r.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message