Return-Path: Delivered-To: apmail-poi-dev-archive@www.apache.org Received: (qmail 54072 invoked from network); 6 May 2010 14:31:42 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 6 May 2010 14:31:42 -0000 Received: (qmail 8849 invoked by uid 500); 6 May 2010 14:31:42 -0000 Delivered-To: apmail-poi-dev-archive@poi.apache.org Received: (qmail 8822 invoked by uid 500); 6 May 2010 14:31:41 -0000 Mailing-List: contact dev-help@poi.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: "POI Developers List" Delivered-To: mailing list dev@poi.apache.org Received: (qmail 8814 invoked by uid 99); 6 May 2010 14:31:41 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 May 2010 14:31:41 +0000 X-ASF-Spam-Status: No, hits=0.8 required=10.0 tests=AWL,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [64.190.72.22] (HELO athena.intekon.com) (64.190.72.22) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 May 2010 14:31:35 +0000 Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Apple Message framework v1078) Subject: Re: DO NOT REPLY [Bug 49020] "org.xml.sax.SAXParseException: does not close tag
." when opening some Excel 2007 files From: Paul Spencer In-Reply-To: Date: Thu, 6 May 2010 10:31:07 -0400 Content-Transfer-Encoding: quoted-printable Message-Id: References: <20100331111454.CA87A234C4BE@brutus.apache.org> To: "POI Developers List" X-Mailer: Apple Mail (2.1078) Nick, Thank you for the fix. I will test with my files this week. Paul Spencer On May 5, 2010, at 1:52 PM, Nick Burch wrote: > On Wed, 31 Mar 2010, Paul Spencer wrote: >>> For the long term, you should report a bug to Microsoft about this. = They either need to sanitise the user input and sort out the tags (eg =
becomes
), or they need to give up and escape the whole tag = contents for the bits where iffy data could get added (eg put this = textbox within a CDATA section) >>=20 >> I will report the but to Microsoft, but that does not address = existing files. >=20 > Any luck getting them to agree with the fault? >=20 >>> Medium term, we should get a list of the problem bits that Excel = does wrong, >>> such as
(but perhaps others). Then, we need to write a XML = Input Wrapper >>> that cleans these up before they get passed to the XML Processor for = loading. >>> Something like this is quite nasty, though it's possible some other = project out >>> there has already done it, and we can just re-use what they do. >>=20 >> I like this as a solution. >=20 > Having just written code for this workaround, I really don't... It's = amazingly sick code! Seems to mostly work though, certainly for your = test file >=20 > Nick >=20 > --------------------------------------------------------------------- > To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org > For additional commands, e-mail: dev-help@poi.apache.org >=20 --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org For additional commands, e-mail: dev-help@poi.apache.org