poi-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Allison, Timothy B." <talli...@mitre.org>
Subject RE: POI not parsing these XLS file
Date Thu, 21 Apr 2016 16:38:30 GMT
https://msdn.microsoft.com/en-us/library/bb226687(v=office.11).aspx

Pre-ooxml spreadsheetML?

'file' and DROID both identify this as regular xml.

I'll see if I can dig up some other files.

-----Original Message-----
From: Javen O'Neal [mailto:javenoneal@gmail.com] 
Sent: Thursday, April 21, 2016 11:43 AM
To: POI Users List <user@poi.apache.org>
Subject: Re: POI not parsing these XLS file

Interesting. Xls extension identified as application/xml. POI does not support the Workbook
XML format. OOXML is different.
On Apr 21, 2016 8:37 AM, "Andrew Munn" <andrew@nmedia.net> wrote:

> I am using poi 3.15
>
> I can not get POI to parse these XLS files being generated by 
> Bloomberg into something useful.  I can use Tika to parse them into 
> one long line of HTML but then there are no cell breaks or line breaks.
>
> http://www.topazdevelopment.com/tmp/poi/2010-cal-eu.xls
>
> Tika indentifies as:
>
> Content-Length: 84473
> Content-Type: application/xml
> X-Parsed-By: org.apache.tika.parser.DefaultParser
> X-Parsed-By: org.apache.tika.parser.xml.DcXMLParser
> X-TIKA:digest:MD5: 132c9c6bb186b7dea86f7da08f18c672
> X-TIKA:digest:SHA256:
> badef53ab482d611d4345323c625ac048d76dd3af23a69a98ed86cbe8bf53304
> resourceName: 2010-cal-eu.xls
>
> if I do:
>
>   org.apache.poi.ss.usermodel.Workbook workbook = 
> WorkbookFactory.create(myFile);
>
> I get:
>
> org.apache.poi.openxml4j.exceptions.InvalidFormatException: Your 
> InputStream was neither an OLE2 stream, nor an OOXML stream
>
> I can open these files in Excel and LibreOffice.
>
> Am I missing something?
>
> Thanks!
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@poi.apache.org For additional 
> commands, e-mail: user-help@poi.apache.org
>
>
Mime
View raw message