hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aaron T. Myers (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HDFS-4235) when outputting XML, OfflineEditsViewer can't handle some edits containing non-ASCII strings
Date Fri, 22 Feb 2013 23:04:14 GMT

     [ https://issues.apache.org/jira/browse/HDFS-4235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Aaron T. Myers updated HDFS-4235:
---------------------------------

     Target Version/s: 2.0.4-beta  (was: 2.0.3-alpha)
    Affects Version/s: 2.0.3-alpha
               Status: Open  (was: Patch Available)

Canceling patch until feedback is addressed. 
                
> when outputting XML, OfflineEditsViewer can't handle some edits containing non-ASCII
strings
> --------------------------------------------------------------------------------------------
>
>                 Key: HDFS-4235
>                 URL: https://issues.apache.org/jira/browse/HDFS-4235
>             Project: Hadoop HDFS
>          Issue Type: Bug
>    Affects Versions: 2.0.3-alpha
>            Reporter: Colin Patrick McCabe
>            Assignee: Colin Patrick McCabe
>            Priority: Minor
>         Attachments: HDFS-4235.001.patch
>
>
> It seems that when outputting XML, OfflineEditsViewer can't handle some edits containing
non-ASCII strings.
> Example:
> {code}
> cmccabe@keter:/h> ./bin/hdfs oev -i ~/Downloads/current2/edits -o /tmp/u.xml     
                                   
> 17:11:24,662 ERROR OfflineEditsBinaryLoader:82 - Got IOException at position 10593
> Encountered exception. Exiting: SAX error: The character '�' is an invalid XML character
> java.io.IOException: SAX error: The character '�' is an invalid XML character
>         at org.apache.hadoop.hdfs.tools.offlineEditsViewer.XmlEditsVisitor.visitOp(XmlEditsVisitor.java:119)
>         at org.apache.hadoop.hdfs.tools.offlineEditsViewer.OfflineEditsBinaryLoader.loadEdits(OfflineEditsBinaryLoader.java:78)
>         at org.apache.hadoop.hdfs.tools.offlineEditsViewer.OfflineEditsViewer.go(OfflineEditsViewer.java:142)
>         at org.apache.hadoop.hdfs.tools.offlineEditsViewer.OfflineEditsViewer.run(OfflineEditsViewer.java:228)
>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84)
>         at org.apache.hadoop.hdfs.tools.offlineEditsViewer.OfflineEditsViewer.main(OfflineEditsViewer.java:237)
> {code}
> Probably, we forgot to properly escape and/or re-encode a filename before putting it
into the XML.  The other processors (stats, binary) don't have this problem, so it is purely
an XML encoding issue.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message