camel-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aki Yoshida (JIRA)" <>
Subject [jira] [Created] (CAMEL-7584) XML-Aware Tokenizer failing with utf-8 multibyte characters
Date Mon, 07 Jul 2014 16:18:33 GMT
Aki Yoshida created CAMEL-7584:

             Summary: XML-Aware Tokenizer failing with utf-8 multibyte characters
                 Key: CAMEL-7584
             Project: Camel
          Issue Type: Bug
          Components: camel-core
            Reporter: Aki Yoshida
            Assignee: Aki Yoshida
             Fix For: 2.14.0

There is some issue in the underlining Stax reader's  getLocation().getCharOffset() when the
input data is an InputStream to the stax reader.

This issue was brought up in the woodstox community. But I believe fixing it seems to be non
trivial as woodstox internally uses char/Reader and keeps the offset value to the character
sequence and not to the original input stream.

We change the tokenzer to pass to the woodstox parser instead of passing

This message was sent by Atlassian JIRA

View raw message