lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From saisantoshi <saisantosh...@gmail.com>
Subject Re: Readers for extracting textual info from pd/doc/excel for indexing the actual content
Date Tue, 05 Feb 2013 21:17:47 GMT
I am looking at the versions supported by newer version of Tika (1.3) and was
not sure what version(s) of the Microsoft office it supports
(97/2000/2010/2013) for each of the below?

http://tika.apache.org/1.3/formats.html#Microsoft_Office_document_formats


Microsoft word (also does it support bot docx and doc formats)
Microsoft Excel (pptx and ppt)
Microsoft PPT  (xlsx and xls)

Appreciate if you could point me to any link available that lists out all
the supported versions for the above? 



--
View this message in context: http://lucene.472066.n3.nabble.com/Readers-for-extracting-textual-info-from-pd-doc-excel-for-indexing-the-actual-content-tp4036379p4038642.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message