Return-Path: Delivered-To: apmail-incubator-uima-user-archive@locus.apache.org Received: (qmail 67001 invoked from network); 7 Mar 2008 15:57:20 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 7 Mar 2008 15:57:20 -0000 Received: (qmail 53161 invoked by uid 500); 7 Mar 2008 15:57:16 -0000 Delivered-To: apmail-incubator-uima-user-archive@incubator.apache.org Received: (qmail 53146 invoked by uid 500); 7 Mar 2008 15:57:16 -0000 Mailing-List: contact uima-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: uima-user@incubator.apache.org Delivered-To: mailing list uima-user@incubator.apache.org Received: (qmail 53137 invoked by uid 99); 7 Mar 2008 15:57:16 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Mar 2008 07:57:16 -0800 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of burnlewis@gmail.com designates 209.85.200.171 as permitted sender) Received: from [209.85.200.171] (HELO wf-out-1314.google.com) (209.85.200.171) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Mar 2008 15:56:29 +0000 Received: by wf-out-1314.google.com with SMTP id 23so761252wfg.21 for ; Fri, 07 Mar 2008 07:56:49 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; bh=iZBEc5T7wX29RWB5wX9k0pz2x864pa3kmiim/6UQWu4=; b=BgGbzjTBGnSXzvUGBUVH0lzJjputmIF7Hrx7urXjgcDOR1PrYYDeAgl8EIhHTBDaJ9/LMbUgi7IGjXbBKGoUedtnqpBqwupXTACjSo7uypynk4JA0ba2vWNJg9rZZ46hsEKzXjK568dY37yh88dDSDvacU1TbUKaYLZwlPdQT2s= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=EzU4zLL9stX4oKqJpJkpBKS8qYlhWPjklBmgRQv229WknKYpXtAqOJAhR/lDkzCDX/8UpMbjeV08exABOvgtksdhmkqiLaIlR1Tj++4LZcPihwEgla4Fad+BjWB/euxYufostHRWmsuMZgsRVIqWLcY56Nt3REN1kSGPJ1jVvyY= Received: by 10.143.162.8 with SMTP id p8mr716458wfo.49.1204905408782; Fri, 07 Mar 2008 07:56:48 -0800 (PST) Received: by 10.142.142.9 with HTTP; Fri, 7 Mar 2008 07:56:48 -0800 (PST) Message-ID: <12012a0a0803070756t7e70b3eem8cb1c127c6d5748d@mail.gmail.com> Date: Fri, 7 Mar 2008 10:56:48 -0500 From: "Burn Lewis" To: uima-user@incubator.apache.org Subject: Re: Bewildered In-Reply-To: <47D04FA6.8040304@aptima.com> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_21270_30844899.1204905408782" References: <3623.1204750283@aptima.com> <47CFE781.2050008@schor.com> <47D00931.7060301@aptima.com> <205239E4006D14469A2D6CEC72AE925423D494@EXV01001.GlobalSP.local> <47D04FA6.8040304@aptima.com> X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_21270_30844899.1204905408782 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline XML 1.0 does not accept all Unicode characters .... the legal ones are: #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | [#x10000-#x10FFFF] So if you wish to serialize a CAS to a file or to a remote service you'll have to avoid the 29 legal (but useless?) low value ones. UIMA could replace or escape them but both have possibly undesirable side-effects (lost information & non-standard XML.) At the least this restriction should be documented. ------=_Part_21270_30844899.1204905408782--