Return-Path: Delivered-To: apmail-incubator-abdera-user-archive@locus.apache.org Received: (qmail 64933 invoked from network); 7 Aug 2007 17:50:46 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 7 Aug 2007 17:50:46 -0000 Received: (qmail 30227 invoked by uid 500); 7 Aug 2007 17:50:45 -0000 Delivered-To: apmail-incubator-abdera-user-archive@incubator.apache.org Received: (qmail 30220 invoked by uid 500); 7 Aug 2007 17:50:45 -0000 Mailing-List: contact abdera-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: abdera-user@incubator.apache.org Delivered-To: mailing list abdera-user@incubator.apache.org Received: (qmail 30202 invoked by uid 99); 7 Aug 2007 17:50:45 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Aug 2007 10:50:45 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jasnell@gmail.com designates 66.249.82.235 as permitted sender) Received: from [66.249.82.235] (HELO wx-out-0506.google.com) (66.249.82.235) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Aug 2007 17:50:39 +0000 Received: by wx-out-0506.google.com with SMTP id h30so1599377wxd for ; Tue, 07 Aug 2007 10:50:18 -0700 (PDT) DKIM-Signature: a=rsa-sha1; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:user-agent:mime-version:to:subject:references:in-reply-to:content-type:content-transfer-encoding; b=B5ofcnTqnREEGpeRhi9ImXeblaUw69R0rPBF4TlHS3iz6CPuGRE6QqXDxv7FCwP8ZrsNZSivaD7Sk72hkHa+n/ne5UoDgpMLcfkVwt/Pv6Z77tQyKa5yyHhiXPoQMN+HhSr5OarnUJcK6AOR2iizvWXoBnRr101c3Z8puueOaok= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:user-agent:mime-version:to:subject:references:in-reply-to:content-type:content-transfer-encoding; b=a97WFUTW6mRJrd9j/S31hC3lDhX6zIISS7o0Ixx2+XcE+diePlQ1+5LlQ9/quzA4EvYI1XYWt++55Nl8hF6eP6L4+9hfvEdrednYQzFg27ExiHMCMmFTzIBC6/KErbZuPVyLs8ADhiJ6uIht/LuzORsjmTxxdAd5N7fKPgbhzic= Received: by 10.90.69.8 with SMTP id r8mr6327677aga.1186509018527; Tue, 07 Aug 2007 10:50:18 -0700 (PDT) Received: from ?192.168.1.103? ( [67.181.218.96]) by mx.google.com with ESMTPS id l43sm8244847wrl.2007.08.07.10.50.17 (version=TLSv1/SSLv3 cipher=RC4-MD5); Tue, 07 Aug 2007 10:50:18 -0700 (PDT) Message-ID: <46B8B0D6.1010300@gmail.com> Date: Tue, 07 Aug 2007 10:50:14 -0700 From: James M Snell User-Agent: Thunderbird 2.0.0.0 (X11/20070326) MIME-Version: 1.0 To: abdera-user@incubator.apache.org Subject: Re: xml restricted characters References: <29a761a00708071046y113792d2i7ca125f271d37e27@mail.gmail.com> In-Reply-To: <29a761a00708071046y113792d2i7ca125f271d37e27@mail.gmail.com> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org The trunk now includes support for filtering out restricted characters when parsing. When enabled in ParserOptions, those characters are ignored completely and never appear within the parsed document. If you parse a doc with restricted chars then reserialize, they should not be present at all. There currently is no mechanism for filtering out restricted chars on serialization due to a limitation in the Axiom serialization code. I'm looking to resolve that later. In the meantime, there is a reader implementation that can do the filtering for you if you'd like. Look in org.apache.abdera.util.* for the reader. - James Brian Moseley wrote: > whose responsibility is it to account for entry content that contains > characters restricted in xml? > > i'm generating an entry with content of type "text/json". the content > contains a control character. the feed is serialized and written to > the output stream with no problems. > > my expectation is that an exception would be thrown by the axiom xml > serializer when it encounters a restricted character. should i instead > be detecting them in my content before setting it on the entry? is > there a utility for filtering them out? xml doesn't define any > escaping rules for restricted characters does it? >