Return-Path: X-Original-To: apmail-axis-c-user-archive@www.apache.org Delivered-To: apmail-axis-c-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 11FC19172 for ; Wed, 5 Oct 2011 12:26:48 +0000 (UTC) Received: (qmail 17144 invoked by uid 500); 5 Oct 2011 12:26:47 -0000 Delivered-To: apmail-axis-c-user-archive@axis.apache.org Received: (qmail 17120 invoked by uid 500); 5 Oct 2011 12:26:47 -0000 Mailing-List: contact c-user-help@axis.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: "Apache AXIS C User List" Delivered-To: mailing list c-user@axis.apache.org Received: (qmail 17112 invoked by uid 99); 5 Oct 2011 12:26:47 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Oct 2011 12:26:47 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [87.236.250.12] (HELO MX2.CEDAR.COM) (87.236.250.12) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Oct 2011 12:26:40 +0000 Received: from tcms01.coa.local (Not Verified[172.72.32.1]) by MX2.CEDAR.COM with MailMarshal (v6,8,3,9481) id ; Wed, 05 Oct 2011 13:24:37 +0100 X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="GB2312" Content-Transfer-Encoding: quoted-printable Subject: Problem extracting UTF-8 encoded data Date: Wed, 5 Oct 2011 13:24:04 +0100 Message-ID: In-Reply-To: X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Problem extracting UTF-8 encoded data Thread-Index: AcyDU2dfIaeuIRqsQzmthcaFwN+TTQAA9xFQ References: <7AB07ED26A8B0D4F835EE41AE26BD9041761D3@NESSNT11.ness.com> <1317803024.73302.YahooMailNeo@web94705.mail.in2.yahoo.com> From: "Graham Holden" To: "Apache AXIS C User List" X-Virus-Checked: Checked by ClamAV on apache.org Hello, I've written a service in Axis2/C (version 1.6.0) that seems to work fine= =20with normal ASCII data, but doesn't cope with UTF-8 encoded data. A mailing-list post from 2009 (http://marc.info/?l=3Daxis-user&m=3D126001= 317008901&w=3D2) says that this should work if the request conforms to SO= AP specs -- I think mine does; the relevant TCPMON snippet is shown below= . The offending line, about half-way through is: =09Archived File Name =A9=C0=A8g The two "offending" characters are 0xC3 (195) and 0xB9 (185) which, I bel= ieve, are the UTF-8 encoding of U+00F9 (=A8=B4 - lowercase 'u' with grave= ), and viewing the payload as an XML file in Internet Explorer seems to c= onfirm this. Without these two characters, everything works fine, but with them, the n= ode is being returned (by "axiom_node_get_next_sibling()") with empty con= tents (""). Do I need to do anything to "turn on" utf-8 encoding? Is this a problem = with Axis? Thanks in advance for any help people can give. Graham Holden. ------ ------ ------ TCPMON Begin ------ ------ ----- SENDING DATA.. /* sending time =3D 9:57:14*/ /* message uuid =3D 92bf09dc-1f65-44d9-854c-ef341ba2d4b4*/ --------------------- POST /axis2/services/archive HTTP/1.1 SOAPAction: "http://localhost:9099/axis2/services/archive" Content-Type: text/xml; =20 charset=3Dutf-8 Content-Length: 1456 User-Agent: httpPost Host: localhost:9099 Cache-Control: no-cache 127.0.0.1 31417 administrator administrator check V1X3_V1XML
XML File Archive ARCH_USER Deposited By string 32 ARCH_DATE Deposit Date date 0 BLOB BLOB ID blob 0 FILENAME Archived File Name =A9=C0=A8g string 255 DOC_KEY Document Key string 255 DOC_ID Unique Reference string 50 REV_NUMBER Version integer 0
------ ------ ------ TCPMON Begin ------ ------ ----- ---------------------------------------------------------------- Version One End-User Seminars REGISTER NOW! http://www.versionone.co.uk/seminars ---------------------------------------------------------------- Version One Ltd. is the author of intelligent electronic document delivery and imaging software. This software enables the electronic storage, retrieval, management, enhancement and delivery of business documents such as invoices, purchase orders and statements. Version One's= 'paperless office' technology is seamlessly integrated into all major=20 finance and ERP systems. With a typical ROI of less than six months,=20 Version One's solutions are enabling thousands of organisations to=20 save dramatic amounts of time and money.=20 Version One is a Member of BASDA (Business Application Software=20 Developers Association) and ESA (European Software Association). The opinions expressed within this email represent those of the=20 individual and not necessarily those of Version One Limited.=20 The contents of this email may be privileged and are confidential.=20 It may not be disclosed to or used by anyone other than the addressee(s),= =20 nor copied in any way. Version One Limited, Pentland House, Village Way, Wilmslow, Cheshire, SK9= =202GH, UK. Registered office: Munro House, Portsmouth Road, Cobham, Surrey, KT11 1TF= , UK Registered in England Number: 2443078 VAT Registration Number: 927 5479= =2083 Version One is an Advanced Computer Software PLC company. ----------------------------------------------------------------- =20 Think about the environment - Do you really need to print this email? --------------------------------------------------------------------- To unsubscribe, e-mail: c-user-unsubscribe@axis.apache.org For additional commands, e-mail: c-user-help@axis.apache.org