axis-c-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "nadir amra (JIRA)" <axis-c-...@ws.apache.org>
Subject [jira] Commented: (AXISCPP-964) SOAP request/response not UTF-8 encoded (but claims to be)
Date Fri, 13 Feb 2009 23:40:59 GMT

    [ https://issues.apache.org/jira/browse/AXISCPP-964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12673421#action_12673421
] 

nadir amra commented on AXISCPP-964:
------------------------------------

And yes another option is to have user specify path to iconv dll/shared library - and if found,
use that.  Otherwise, revert to using encoding in config file, and if not specified, try to
obtain appropriate encoding for locale and finally revert to a default. 

> SOAP request/response not UTF-8 encoded (but claims to be)
> ----------------------------------------------------------
>
>                 Key: AXISCPP-964
>                 URL: https://issues.apache.org/jira/browse/AXISCPP-964
>             Project: Axis-C++
>          Issue Type: Bug
>          Components: SOAP
>    Affects Versions: current (nightly)
>         Environment: All platforms, except OS/400
>            Reporter: Henrik Nordberg
>
> (See the end of this description for a one-liner that works around this problem for most
cases.)
> SoapSerializer.cpp, line 379 says
> serialize( "<?xml version='1.0' encoding='utf-8' ?>", NULL);
> that is that the SOAP response is UTF-8 encoded. But this is only true for OS/400 as
can be seen in HTTPTransport.cpp, lines 311-
> #ifndef __OS400__
>         *m_pActiveChannel << this->getHTTPHeaders ();
>         *m_pActiveChannel << this->m_strBytesToSend.c_str ();
> #else
>         // Ebcdic (OS/400) systems need to convert the data to UTF-8. Note that free()
is 
>         // correctly used and should not be changed to delete().		
>         const char *buf = this->getHTTPHeaders ();
>         utf8Buf = toUTF8((char *)buf, strlen(buf)+1);
>         *m_pActiveChannel << utf8Buf;
>         free(utf8Buf);
>         utf8Buf = NULL;
>         utf8Buf = toUTF8((char *)this->m_strBytesToSend.c_str(), this->m_strBytesToSend.length()+1);
>         *m_pActiveChannel << utf8Buf;
>         free(utf8Buf);
>         utf8Buf = NULL;
> #endif
> This leads to clients trying to decode the response as UTF-8, and will have errors whenever
the response contains non-ASCII characters (i.e., > 127).
> Axis Java, for example, will prduce this error upon decoding: 
> "java.io.UTFDataFormatException: Invalid byte 2 of 3-byte UTF-8 sequence."
> A simple workaround is to change SoapSerializer.cpp, line 379:
> from
> serialize( "<?xml version='1.0' encoding='utf-8' ?>", NULL);
> to
> serialize( "<?xml version='1.0' encoding='ISO-8859-1' ?>", NULL);
> The real fix, however, is to encode the response with UTF-8 for all platforms (not just
OS/400).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message