lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gopikrishnan Subramani" <gopi.subram...@gmail.com>
Subject Re: Indexing MS Powerpoint files with Lucene
Date Thu, 07 Sep 2006 09:50:54 GMT
Did you check POI javadocs? Look for
org.apache.poi.hslf.extractor.PowerPointExtractor. It's one of the most
straightforward classes from POI as far extracting text for indexing is
concerned.

-Gopi

On 9/7/06, Venkateshprasanna <prasannahmv@yahoo.co.in> wrote:
>
>
> Is there any filter available for extracting text from MS Powerpoint files
> and indexing them?
> The lucene website suggests the POI project, which, it seems does not
> support PPT files as of now.
>
> Regards,
> Venkateshprasanna
>
> --
> View this message in context:
> http://www.nabble.com/which-way-to-index-pdf%2Cword%2Cexcel-tf2224468.html#a6185039
> Sent from the Lucene - Java Users forum at Nabble.com.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message