Return-Path: X-Original-To: apmail-corinthia-dev-archive@minotaur.apache.org Delivered-To: apmail-corinthia-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 17590C3FC for ; Thu, 8 Jan 2015 16:00:21 +0000 (UTC) Received: (qmail 89893 invoked by uid 500); 8 Jan 2015 16:00:22 -0000 Delivered-To: apmail-corinthia-dev-archive@corinthia.apache.org Received: (qmail 89868 invoked by uid 500); 8 Jan 2015 16:00:22 -0000 Mailing-List: contact dev-help@corinthia.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@corinthia.incubator.apache.org Delivered-To: mailing list dev@corinthia.incubator.apache.org Received: (qmail 89857 invoked by uid 99); 8 Jan 2015 16:00:22 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 08 Jan 2015 16:00:22 +0000 X-ASF-Spam-Status: No, hits=-1993.8 required=5.0 tests=ALL_TRUSTED,HTML_MESSAGE,T_RP_MATCHES_RCVD,URIBL_SBL X-Spam-Check-By: apache.org Received: from [140.211.11.3] (HELO mail.apache.org) (140.211.11.3) by apache.org (qpsmtpd/0.29) with SMTP; Thu, 08 Jan 2015 16:00:20 +0000 Received: (qmail 89675 invoked by uid 99); 8 Jan 2015 16:00:00 -0000 Received: from mail-relay.apache.org (HELO mail-relay.apache.org) (140.211.11.15) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 08 Jan 2015 16:00:00 +0000 Received: from [192.168.1.37] (unknown [202.44.228.17]) by mail-relay.apache.org (ASF Mail Server at mail-relay.apache.org) with ESMTPSA id BBFAE1A003F for ; Thu, 8 Jan 2015 15:59:59 +0000 (UTC) From: Peter Kelly Content-Type: multipart/alternative; boundary="Apple-Mail=_E391220B-D472-48EC-8784-98B15C4DEBA8" Message-Id: Mime-Version: 1.0 (Mac OS X Mail 8.2 \(2064\)) Subject: Re: ODF filter Date: Thu, 8 Jan 2015 22:59:47 +0700 References: <07E07C76-27B2-4B85-A482-83D0C65E22AF@apache.org> <10FF0071-9636-432A-99B1-59C19EE8841B@comcast.net> To: dev@corinthia.incubator.apache.org In-Reply-To: <10FF0071-9636-432A-99B1-59C19EE8841B@comcast.net> X-Mailer: Apple Mail (2.2064) X-Virus-Checked: Checked by ClamAV on apache.org --Apple-Mail=_E391220B-D472-48EC-8784-98B15C4DEBA8 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=windows-1252 > On 8 Jan 2015, at 10:16 am, Dave Fisher wrote: >=20 > Hi Peter, >=20 > This is a helpful email from your concrete discussion I can better = understand the mapping between the abstract / HTML model and the = concrete / DOCX, ODT. >=20 > You mention differences in the style runs for Word and ODT of which I = am familiar from the OOXML side. Does the abstract model / HTML take a = particular approach towards style runs? Is there a concrete version of = the HTML model? Is there a specification or plan for the abstract model? As a general principle, no - a given filter is expected to handle = arbitrary HTML. However, there is a function for =93normalising=94 a HTML document to = change nested sets of inline elements (span, b, i, etc.) into a flat = sequence of runs (each represented as a span element). The Word filter = uses this, due to Word=92s flat model of inline runs. ODF text documents, on the other hand, *do* support nested formatting = runs, so when writing this filter it may make sense not to apply the = normalisation process used in the word filter. This should be done if = there is information that could not be represented in HTML and would be = lost by flattening the structure like we do for word. There=92s been a few times where the topic of what internal = representation we should use has been raised - whether we should stick = with HTML, come up with our own entirely different model, or something = else. I personally think HTML is a good choice, but perhaps for those = who have raised the issue of an alternate intermediate form, this might = be a good time to start that discussion ;) =97 Dr Peter M. Kelly pmkelly@apache.org PGP key: http://www.kellypmk.net/pgp-key = (fingerprint 5435 6718 59F0 DD1F BFA0 5E46 2523 BAA1 44AE 2966) --Apple-Mail=_E391220B-D472-48EC-8784-98B15C4DEBA8--