Return-Path: X-Original-To: apmail-uima-user-archive@www.apache.org Delivered-To: apmail-uima-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id D7D1310352 for ; Fri, 28 Mar 2014 12:35:27 +0000 (UTC) Received: (qmail 51805 invoked by uid 500); 28 Mar 2014 12:35:26 -0000 Delivered-To: apmail-uima-user-archive@uima.apache.org Received: (qmail 51629 invoked by uid 500); 28 Mar 2014 12:35:26 -0000 Mailing-List: contact user-help@uima.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@uima.apache.org Delivered-To: mailing list user@uima.apache.org Received: (qmail 51606 invoked by uid 99); 28 Mar 2014 12:35:24 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 28 Mar 2014 12:35:24 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy includes SPF record at spf.trusted-forwarder.org) Received: from [108.166.43.65] (HELO smtp65.ord1c.emailsrvr.com) (108.166.43.65) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 28 Mar 2014 12:35:20 +0000 Received: from localhost (localhost.localdomain [127.0.0.1]) by smtp1.relay.ord1c.emailsrvr.com (SMTP Server) with ESMTP id 037971493A0; Fri, 28 Mar 2014 08:34:58 -0400 (EDT) X-Virus-Scanned: OK Received: by smtp1.relay.ord1c.emailsrvr.com (Authenticated sender: reshu.agarwal-AT-orkash.com) with ESMTPSA id 1D86C1483A6 for ; Fri, 28 Mar 2014 08:34:56 -0400 (EDT) Message-ID: <53356CA5.2040506@orkash.com> Date: Fri, 28 Mar 2014 18:05:49 +0530 From: "reshu.agarwal" Organization: Orkash Services Pvt Ltd User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130803 Thunderbird/17.0.8 MIME-Version: 1.0 To: user@uima.apache.org Subject: Re: status Lost=1 in DUCC References: <532ACDA2.6020101@orkash.com> <53329E76.3010709@orkash.com> <5332D412.7040306@orkash.com> <5333B7C4.3000705@orkash.com> <5334F3D7.3080508@orkash.com> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org On 03/28/2014 05:54 PM, Lou DeGenaro wrote: > Hi Reshu, > > Very good. It would be helpful if you could supply a small sample data > comprising "invalid XML characters" as a test case, to motivate DUCC to > detect and handle this situation more elegantly in terms of allowing the > user to recognize what's wrong. > > Lou. > > > On Fri, Mar 28, 2014 at 12:00 AM, reshu.agarwal wrote: > >> On 03/27/2014 08:13 PM, Lou DeGenaro wrote: >> >>> he data being sent are "values" rather than "keys" in your >>> CAS? If so, this is not really a "best practice" for DUCC use. >>> >> Hi Lou, >> >> This is not the problem of how I send the data. My document contains some >> invalid XML characters. So, problem resolved after I applied filter for >> that. >> >> Reshu. >> Ya Sure, Here is a sample document: "About the Human Rights House Network ( www.humanrightshouse.org ) The Human Rights House Network (HRHN) unites 87 human rights NGOs joining forces in 18 independent Human Rights Houses in 15 countries in Western Balkans, Eastern Europe and South Caucasus, East and Horn of Africa, and Western Europe. HRHN???s aim is to protect, empower and support human rights organisations locally and unite them in an international network of Human Rights Houses. The Human Rights House Foundation (HRHF), based in Oslo (Norway) with an office in Geneva (Switzerland), is HRHN???s secretariat. HRHF is international partner of the South Caucasus Network of Human Rights Defenders and the emerging Balkan Network of Human Rights Defenders. HRHF has consultative status with the United Nations and HRHN has participatory status with the Council of Europe. All applicants are requested to e-mail a motivation letter and curriculum vitae to: Anna Innocenti, International Advocacy Officer at the Human Rights House Foundation (HRHF), at ae;e;a.innf;centi@humae;rightshouse.f;rg ." Specific this line contains some invalid characters: "All applicants are requested to e-mail a motivation letter and curriculum vitae to: Anna Innocenti, International Advocacy Officer at the Human Rights House Foundation (HRHF), at ae;e;a.innf;centi@humae;rightshouse.f;rg ."" And we can find out the problem by trying the same document in UIMA AS. And this problem of invalid character was also in object other then document text which is passed in CAS. -- Thanks, Reshu Agarwal