lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "yueyu lin" <popeye...@gmail.com>
Subject Re: which way to index pdf,word,excel
Date Wed, 06 Sep 2006 06:00:02 GMT
First, Lucene is just a index toolkit, you have to USE it to implement your
application.

If you want to index something, you must have knowledge how to extract
information from them and what kind of keys they need to be set.

Then you can do what you want to.
On 9/5/06, James liu <liuping.james@gmail.com> wrote:
>
> i wanna find frame which can index xml,word,excel,pdf,,,not one.
>
>
> 2006/9/6, Doron Cohen <DORONC@il.ibm.com>:
> >
> > Lucene FAQ - http://wiki.apache.org/jakarta-lucene/LuceneFAQ - has a few
> > entries just for this:
> >
> >   How can I index HTML documents?
> >   How can I index XML documents?
> >   How can I index OpenOffice.org files?
> >   How can I index MS-Word documents?
> >   How can I index MS-Excel documents?
> >   How can I index MS-Powerpoint documents?
> >   How can I index Email (from MS-Exchange or another IMAP server) ?
> >   How can I index RTF documents?
> >   How can I index PDF documents?
> >   How can I index JSP files?
> >
> >
> > "James liu" <liuping.james@gmail.com> wrote on 05/09/2006 19:14:24:
> >
> > > i find lius many question ,,,,so i wanna give up and find new.
> > >
> > > who recommend ?
> >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> > For additional commands, e-mail: java-user-help@lucene.apache.org
> >
> >
>
>


-- 
--
Yueyu Lin

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message