poi-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrew Munn <and...@nmedia.net>
Subject Re: POI not parsing these XLS file
Date Thu, 21 Apr 2016 15:46:43 GMT
Looks like Gnumeric ssconvert likes it:

ssconvert -E Gnumeric_Excel:excel 2010-cal-eu.xls -T Gnumeric_stf:stf_csv 
/tmp/1.csv

-Andrew



On Thu, 21 Apr 2016, Javen O'Neal wrote:

> Interesting. Xls extension identified as application/xml. POI does not
> support the Workbook XML format. OOXML is different.
> On Apr 21, 2016 8:37 AM, "Andrew Munn" <andrew@nmedia.net> wrote:
> 
> > I am using poi 3.15
> >
> > I can not get POI to parse these XLS files being generated by Bloomberg
> > into something useful.  I can use Tika to parse them into one long
> > line of HTML but then there are no cell breaks or line breaks.
> >
> > http://www.topazdevelopment.com/tmp/poi/2010-cal-eu.xls
> >
> > Tika indentifies as:
> >
> > Content-Length: 84473
> > Content-Type: application/xml
> > X-Parsed-By: org.apache.tika.parser.DefaultParser
> > X-Parsed-By: org.apache.tika.parser.xml.DcXMLParser
> > X-TIKA:digest:MD5: 132c9c6bb186b7dea86f7da08f18c672
> > X-TIKA:digest:SHA256:
> > badef53ab482d611d4345323c625ac048d76dd3af23a69a98ed86cbe8bf53304
> > resourceName: 2010-cal-eu.xls
> >
> > if I do:
> >
> >   org.apache.poi.ss.usermodel.Workbook workbook =
> > WorkbookFactory.create(myFile);
> >
> > I get:
> >
> > org.apache.poi.openxml4j.exceptions.InvalidFormatException: Your
> > InputStream was neither an OLE2 stream, nor an OOXML stream
> >
> > I can open these files in Excel and LibreOffice.
> >
> > Am I missing something?
> >
> > Thanks!
> >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: user-unsubscribe@poi.apache.org
> > For additional commands, e-mail: user-help@poi.apache.org
> >
> >
> 

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@poi.apache.org
For additional commands, e-mail: user-help@poi.apache.org


Mime
View raw message