abdera-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steven Isaacson" <spisa...@gmail.com>
Subject Re: Removing invalid characters from XML
Date Thu, 05 Oct 2006 19:58:36 GMT
I will do some more in depth testing to see exactly what is occuring but it
looks like a ^L control character that was in an HTML file made my XML
document to not be well formed.

-steve

On 10/5/06, James M Snell <jasnell@gmail.com> wrote:
>
> Axiom and Stax should be doing all the escaping you need.  For instance,
> if you specify entry.setContent("<foo>"); it should escape the < and >
> appropriately.  I haven't tested it on any control characters yet.
>
> - James
>
> Steven Isaacson wrote:
> > Is there a utility code provided to remove invalid characters from text,
> > html or xhtml in the content of an entry? Either in Abdera or in a
> > dependency. I know it isn't that hard but it would be nice if I didn't
> have
> > to write it yet again by myself.
> >
> > Currently I wrote a quick method to esacape the HTML but that doesn't
> help
> > when control characters are present in the HTML.
> >
> > Thanks,
> > -steve
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message