openoffice-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From J├╝rgen Schmidt <jogischm...@gmail.com>
Subject Re: News about the new OOXML framework.
Date Tue, 03 Jun 2014 10:05:22 GMT
On 03/06/14 09:41, Andre Fischer wrote:
> I would like to give a short status update about the new OOXML framework.
> 
> - Created the new module main/ooxml
>   There are not yet any makefiles that build the contents of the ooxml/
> module nor link it into the build process of OpenOffice. However, you
> can use e.g. Eclipse to import the Java projects that are described below.
> 
> - Moved the old Office Open XML wiki pages out of the way and create two
> new ones:
>   = "OfficeOpenXML" contains an introduction into the OOXML file format,
> a status overview of the implementation progress and links to more
> detailed information.
>   = "OOXML" and "ooxml" (uppercase/lowercase) redirect to
> "OfficeOpenXML" so that there is no excuse to not find this page.
>   = "OOXML Framework" contains more detailed information about the new
> framework.

for convenience, the wiki pages can be found under

https://wiki.openoffice.org/wiki/OOXML

https://wiki.openoffice.org/wiki/OOXML_Framework

> 
> - Created a new Java project at ooxml/main/source/framework/SchemaParser
> that parses the XML schema files that come with the ECMA-376
> specification files.
>   Its purpose is to read the schema files and create a skeleton OOXML
> parser from it.  This skeleton can then be filled in with code for
> importing certain elements of OOXML documents.
> 
> - Created a new Java project at
> ooxml/main/source/framework/JavaOOXMLParser.  Its purpose is testing and
> debugging of and experimenting with the schema parser.  It is not
> intended to become a runtime component of OpenOffice.
> 
> 
> The SchemaParser is able to parse all files of the ECMA-376
> specification both in the old (1st edition of 2006) and new (4th edition
> of 2012) versions.  It looks like we need both since the new one is the
> current standard (equivalent to the ISO standard) while the old on is
> actually used.
> Not all details of the schema files are handled yet.
> 
> The JavaOOXMLParser, based on parser tables created by the SchemaParser,
> is already able to parse the large DOCX file of the 1st edition
> specification.  When pretty printed it is about 90 MB large.  It takes
> the parser about 90 s to read it.  Note that the parser is not optimized
> in any way (if it where then it would be optimized for readability, not
> for speed) and that it writes about 650 MB of log files in the process.
> 
> If anyone would like to play with the parsers, I will gladly provide
> more details.

Thanks for the update, I think it's good to know that you are already
able to read both versions as described above.

I believe most users of OOXML don't care about the spec and even don't
know that the files produced today are by default OOXML transitional.
OOXML is far more complex than reading one spec ... but nobody cares
about the complexity and simply want pr expect a 1:1 support ;-)

Juergen

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@openoffice.apache.org
For additional commands, e-mail: dev-help@openoffice.apache.org


Mime
View raw message