Return-Path: Delivered-To: apmail-incubator-abdera-dev-archive@locus.apache.org Received: (qmail 23079 invoked from network); 5 Oct 2006 19:58:49 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 5 Oct 2006 19:58:49 -0000 Received: (qmail 94928 invoked by uid 500); 5 Oct 2006 19:58:48 -0000 Delivered-To: apmail-incubator-abdera-dev-archive@incubator.apache.org Received: (qmail 94905 invoked by uid 500); 5 Oct 2006 19:58:48 -0000 Mailing-List: contact abdera-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: abdera-dev@incubator.apache.org Delivered-To: mailing list abdera-dev@incubator.apache.org Received: (qmail 94896 invoked by uid 99); 5 Oct 2006 19:58:48 -0000 Received: from idunn.apache.osuosl.org (HELO idunn.apache.osuosl.org) (140.211.166.84) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 05 Oct 2006 12:58:48 -0700 Authentication-Results: idunn.apache.osuosl.org header.from=spisaacs@gmail.com; domainkeys=good X-ASF-Spam-Status: No, hits=2.5 required=5.0 tests=DNS_FROM_RFC_ABUSE,HTML_MESSAGE DomainKey-Status: good X-DomainKeys: Ecelerity dk_validate implementing draft-delany-domainkeys-base-01 Received: from [66.249.92.170] ([66.249.92.170:59964] helo=ug-out-1314.google.com) by idunn.apache.osuosl.org (ecelerity 2.1.1.8 r(12930)) with ESMTP id BD/79-04543-3F365254 for ; Thu, 05 Oct 2006 12:58:44 -0700 Received: by ug-out-1314.google.com with SMTP id y2so322031uge for ; Thu, 05 Oct 2006 12:58:37 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=kHI/imJfQQHyTfS3JaCCt+2NpNKlq5gFpMHvW1TSZVEA0RarIKduI1h1tZGhGDSL9SeoMUDb3RrUo9kwnzCcbJAhZOP/1kdrrzUyVBshEx8G6YJ7cckKnM8vbt9PgJBC7pPs4P82Lgk4Frftqv6uedZWH1t0ukCmbi8PSiVQygg= Received: by 10.78.201.15 with SMTP id y15mr853770huf; Thu, 05 Oct 2006 12:58:36 -0700 (PDT) Received: by 10.78.122.14 with HTTP; Thu, 5 Oct 2006 12:58:36 -0700 (PDT) Message-ID: <4933a880610051258t3dd5a881le20f913276126fe9@mail.gmail.com> Date: Thu, 5 Oct 2006 15:58:36 -0400 From: "Steven Isaacson" To: abdera-dev@incubator.apache.org Subject: Re: Removing invalid characters from XML In-Reply-To: <4525617F.1080401@gmail.com> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_28194_21222471.1160078316579" References: <4933a880610051244p5ae9e4a0y4f43b8838864bf7f@mail.gmail.com> <4525617F.1080401@gmail.com> X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N ------=_Part_28194_21222471.1160078316579 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline I will do some more in depth testing to see exactly what is occuring but it looks like a ^L control character that was in an HTML file made my XML document to not be well formed. -steve On 10/5/06, James M Snell wrote: > > Axiom and Stax should be doing all the escaping you need. For instance, > if you specify entry.setContent(""); it should escape the < and > > appropriately. I haven't tested it on any control characters yet. > > - James > > Steven Isaacson wrote: > > Is there a utility code provided to remove invalid characters from text, > > html or xhtml in the content of an entry? Either in Abdera or in a > > dependency. I know it isn't that hard but it would be nice if I didn't > have > > to write it yet again by myself. > > > > Currently I wrote a quick method to esacape the HTML but that doesn't > help > > when control characters are present in the HTML. > > > > Thanks, > > -steve > > > ------=_Part_28194_21222471.1160078316579--