incubator-ooo-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject DO NOT REPLY [Bug 119219] Saved RTF has issue in encoding Latin1 characters
Date Thu, 12 Apr 2012 23:15:23 GMT
https://issues.apache.org/ooo/show_bug.cgi?id=119219

--- Comment #3 from orcmid <orcmid@apache.org> 2012-04-12 23:15:23 UTC ---
Created attachment 77439
  --> https://issues.apache.org/ooo/attachment.cgi?id=77439
This RTF document demonstrates the defect reported in #119219

Although this file will download as plaintext, it is an RTF (a data format that
uses ASCII).  It can be opened correctly in OpenOffice Writer and in Microsoft
Word.  The defect is only visible when viewing the plaintext.  Just search for
"\u335" and you'll see the places where the incorrect single-byte code appears.
 (3f is the ASCII code for '?')

It was produced with Apache OpenOffice 3.4 r1309668 on Windows.

Now, I am not clear this is a bug.  The expected codes are greater than '7f,
the highest ASCII value.  The RTF prolog from AOO specifies that the RTF is
ansi coded.  It does not specify a code page to be used for single-byte codes
instead.  I also don't believe there is an option to select a code page as part
of exporting to RTF format.

Since OpenOffice operates in Unicode I can see why there is a disconnect with
Windows-1252.  OpenOffice export to RTF does not identify a code page in which
its non-ASCII characters will be expressed in a single-byte code.  That's an
interesting problem, since OpenOffice is a multi-platform product.

-- 
Configure bugmail: https://issues.apache.org/ooo/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.

Mime
View raw message