abdera-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "jv ning (JIRA)" <j...@apache.org>
Subject [jira] Updated: (ABDERA-222) Parse failures reading utf-8 xml files that have attribute values that contain non US-ASCII valid utf-8 characters
Date Thu, 26 Mar 2009 02:24:51 GMT

     [ https://issues.apache.org/jira/browse/ABDERA-222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

jv ning updated ABDERA-222:

    Attachment: ChunkedTransferFailure.java

This code, when run against abdera 4.0 using HttpClient 3.1 demonstrates the chunked transfer
multi-byte failures

There are two examples in the code, 
one that places a multibyte character at position 0 in a chunk, the byte array rawChunkWithMultiByteAtStart
and one that does not place a multbyte character at position 0 of any chunk.

> Parse failures reading utf-8 xml files that have attribute values that contain non US-ASCII
valid utf-8 characters
> ------------------------------------------------------------------------------------------------------------------
>                 Key: ABDERA-222
>                 URL: https://issues.apache.org/jira/browse/ABDERA-222
>             Project: Abdera
>          Issue Type: Bug
>    Affects Versions: 0.4.0
>         Environment: solarix x86_64, MaxOS Leopard x86_64, linux x86_64
>            Reporter: jv ning
>         Attachments: ChunkedTransferFailure.java
> When parsing XML files that are items fetched by http-client 3.1 
> The same items parse correctly, if written to a byte array and then a ByteArrayInputStream
on the byte array, is passed to parse.
> parser.parse(response.getResponseBodyAsStream());
> Caused by: com.ctc.wstx.exc.WstxUnexpectedCharException: Illegal character (NULL, unicode
0) encountered: not valid in any content
>  at [row,col {unknown-source}]: [3,56]
>         at com.ctc.wstx.sr.StreamScanner.constructNullCharException(StreamScanner.java:615)
>         at com.ctc.wstx.sr.StreamScanner.throwInvalidSpace(StreamScanner.java:644)
>         at com.ctc.wstx.sr.BasicStreamReader.readTextPrimary(BasicStreamReader.java:4554)
>         at com.ctc.wstx.sr.BasicStreamReader.nextFromTree(BasicStreamReader.java:2886)
>         at com.ctc.wstx.sr.BasicStreamReader.next(BasicStreamReader.java:1019)
>         at org.apache.abdera.parser.stax.FOMBuilder.getNextElementToParse(FOMBuilder.java:163)
>         at org.apache.abdera.parser.stax.FOMBuilder.next(FOMBuilder.java:187) 

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message