poi-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject [Bug 56205] [PATCH] Upgrade OOXML schema to 3rd edition (transitional)
Date Tue, 04 Mar 2014 00:41:50 GMT
https://issues.apache.org/bugzilla/show_bug.cgi?id=56205

--- Comment #2 from Andreas Beeker <andreas.beeker@gmx.de> ---
I thought about comparing the schemas, but the changes are substantially, so
that doesn't work.

So the next try was to check the poi examples and their output, XSSF and XSLF
looked ok, but XWPF has some problems (e.g. the SimpleDocument example looks
not ok).
So the patch needs some rework, especially as most changes were in the XWPF
part.

Apart of the junit test I haven't checked input processing yet.

There are some infos about the compatibility on this office article [1] - note
the line: "... writes files conformant to ISO/IEC 29500 Transitional ...", but
when you look into the details (e.g. [2] as an example for the other affected
percent attributes), you see that although it's able to read an alternative
format, it writes the legacy format. 

As far as I have checked the changes for length/percent attributes, it depends
on POI if the resulting file can be read by versions < 2010, e.g. if
measurement units are used in length attributes, the file probably can't be
read anymore by versions < 2010. Therefore we would need to take care when
populating new attributes to stick with the legacy format, if possible.

The new "sharedTypes" namespace [3] seems to stay out of the resulting file.

So I guess in the end, it's a trade off 
- using a new schema and potentially using/introducing features which can be
only used in newer Office versions
- vs. having the greatest common format, i.e. a schema which only allows one
kind of attribute content

> If we use the newer schemas, how does that change what we output? Will it mean that the
files we generate stop being compatible with older office versions?
That depends, if we use the new features

> How about input? Will it mean we stop being able to read files generated by older versions
of POI, or older versions of office?
The 3rd transitional schema should be compatible to the 1st edition - but there
are certain features like VML, which are phased out.


[1] http://msdn.microsoft.com/en-US/library/office/gg607163(v=office.14).aspx
[2] http://msdn.microsoft.com/en-us/library/gg548598(v=office.12).aspx
[3] http://schemas.openxmlformats.org/officeDocument/2006/sharedTypes

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


Mime
View raw message