poi-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject [Bug 57008] Wrting _x0427_ to a string cell changes the string to some strange UTF-8 character
Date Mon, 30 Mar 2015 14:51:26 GMT
https://bz.apache.org/bugzilla/show_bug.cgi?id=57008

Dominik Stadler <dominik.stadler@gmx.at> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|---                         |WONTFIX

--- Comment #8 from Dominik Stadler <dominik.stadler@gmx.at> ---
Thanks, that now made it easier to dig into this, unfortunately it seems this
is currently done "by design" in POI to implement the encoding that Excel files
define so we write and read files consistently with what Microsoft OOXML spec
states.

Escaping is done at XSSFRichtTextString.utfDecode(), however it seems this was
put in on purpose, so I am reluctant to "undo" it here as likely others depend
on it being in place...

Furthermore as we did not get any related bug-report for a long time and the
code is in place since 2010, I don't think many people write such a _xXXXX_
text "by chance" into their files, unfortunately it seems to have hit you here.

Overall I think your best bet currently is to perform the escaping via _0x005F
in your code before you populate the cell via some helper function.

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


Mime
View raw message