Return-Path: X-Original-To: apmail-openoffice-dev-archive@www.apache.org Delivered-To: apmail-openoffice-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 075D511C81 for ; Tue, 3 Jun 2014 11:41:34 +0000 (UTC) Received: (qmail 80205 invoked by uid 500); 3 Jun 2014 11:41:33 -0000 Delivered-To: apmail-openoffice-dev-archive@openoffice.apache.org Received: (qmail 80121 invoked by uid 500); 3 Jun 2014 11:41:33 -0000 Mailing-List: contact dev-help@openoffice.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@openoffice.apache.org Delivered-To: mailing list dev@openoffice.apache.org Received: (qmail 80110 invoked by uid 99); 3 Jun 2014 11:41:33 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Jun 2014 11:41:33 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jogischmidt@gmail.com designates 209.85.212.181 as permitted sender) Received: from [209.85.212.181] (HELO mail-wi0-f181.google.com) (209.85.212.181) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Jun 2014 11:41:28 +0000 Received: by mail-wi0-f181.google.com with SMTP id n15so6367422wiw.14 for ; Tue, 03 Jun 2014 04:41:02 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type:content-transfer-encoding; bh=T/HTehlfxMLf5EADizSCTG1lqfS33G/mwngoCgT7nl0=; b=vT8o85JWWbMpcJ1j5TOpBKf73pVS8L2qvTy+q2JUiKuy4PZFcxGoFsuj5kILJAhAq4 Oc3+94s1oVNp2/RfylovbxXdGfH9xtjvp6sQuVqF2q79oS4W12ut9GeTo0fvK5Wd5fhG R1/oxjo9I8etu5Qbux7s45WGKMiU1dJ0yq30WIz8/24cUYLnYQAj6pW0hq8CY2pNnjOM OlDgXWYjwVOu15AePvVpxmoE28lDkKHPrsa9HPNfaDGgK769cHE0bbq154My46iXu/dX Kw/Zlratf6mc0l3qwTK86Xc9bB9d3GpPQSIFTUxcmztHB7DjQQ6U5iq2WYYIT/oSmf6d bXrw== X-Received: by 10.194.238.65 with SMTP id vi1mr14744274wjc.84.1401795662793; Tue, 03 Jun 2014 04:41:02 -0700 (PDT) Received: from [9.155.131.75] (deibp9eh1--blueice2n2.emea.ibm.com. [195.212.29.172]) by mx.google.com with ESMTPSA id ba9sm40819932wib.24.2014.06.03.04.41.01 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Tue, 03 Jun 2014 04:41:02 -0700 (PDT) Message-ID: <538DB44E.6070607@gmail.com> Date: Tue, 03 Jun 2014 13:41:02 +0200 From: =?ISO-8859-1?Q?J=FCrgen_Schmidt?= User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.9; rv:24.0) Gecko/20100101 Thunderbird/24.5.0 MIME-Version: 1.0 To: dev@openoffice.apache.org Subject: Re: News about the new OOXML framework. References: <538D7C2C.6080607@googlemail.com> <821DBECA-503B-4741-9550-D2DA1E1D56EB@apache.org> In-Reply-To: <821DBECA-503B-4741-9550-D2DA1E1D56EB@apache.org> X-Enigmail-Version: 1.6 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org On 03/06/14 13:20, Dave Fisher wrote: > Hi, > > Are you aware of Apache POI? > > Reads and writes most Ooxml in Java. yes, we know POI and Andrea pointed already on thread where this was discussed. When you look in the details Java is used for the SchemaParser and later on to generate a C++ parser. No plans to use Java during runtime in the office to parse OOXML. But Java is perfect and productive for the development of the tooling etc. Juergen > > Regards, > Dave > > Apache POI and OpenOffice PMCs > > Sent from my iPhone > >> On Jun 3, 2014, at 3:41 AM, Andre Fischer wrote: >> >> I would like to give a short status update about the new OOXML framework. >> >> - Created the new module main/ooxml >> There are not yet any makefiles that build the contents of the ooxml/ module nor link it into the build process of OpenOffice. However, you can use e.g. Eclipse to import the Java projects that are described below. >> >> - Moved the old Office Open XML wiki pages out of the way and create two new ones: >> = "OfficeOpenXML" contains an introduction into the OOXML file format, a status overview of the implementation progress and links to more detailed information. >> = "OOXML" and "ooxml" (uppercase/lowercase) redirect to "OfficeOpenXML" so that there is no excuse to not find this page. >> = "OOXML Framework" contains more detailed information about the new framework. >> >> - Created a new Java project at ooxml/main/source/framework/SchemaParser that parses the XML schema files that come with the ECMA-376 specification files. >> Its purpose is to read the schema files and create a skeleton OOXML parser from it. This skeleton can then be filled in with code for importing certain elements of OOXML documents. >> >> - Created a new Java project at ooxml/main/source/framework/JavaOOXMLParser. Its purpose is testing and debugging of and experimenting with the schema parser. It is not intended to become a runtime component of OpenOffice. >> >> >> The SchemaParser is able to parse all files of the ECMA-376 specification both in the old (1st edition of 2006) and new (4th edition of 2012) versions. It looks like we need both since the new one is the current standard (equivalent to the ISO standard) while the old on is actually used. >> Not all details of the schema files are handled yet. >> >> The JavaOOXMLParser, based on parser tables created by the SchemaParser, is already able to parse the large DOCX file of the 1st edition specification. When pretty printed it is about 90 MB large. It takes the parser about 90 s to read it. Note that the parser is not optimized in any way (if it where then it would be optimized for readability, not for speed) and that it writes about 650 MB of log files in the process. >> >> If anyone would like to play with the parsers, I will gladly provide more details. >> >> Best regards, >> Andre >> >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: dev-unsubscribe@openoffice.apache.org >> For additional commands, e-mail: dev-help@openoffice.apache.org >> > > --------------------------------------------------------------------- > To unsubscribe, e-mail: dev-unsubscribe@openoffice.apache.org > For additional commands, e-mail: dev-help@openoffice.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@openoffice.apache.org For additional commands, e-mail: dev-help@openoffice.apache.org