lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <otis_gospodne...@yahoo.com>
Subject RE: Integrating the PDF Extract With Lucene!!!!
Date Thu, 17 Oct 2002 14:37:14 GMT
You need to:
1. look at the Javadoc.  Link on the Lucene site.
2. go through the demo.  Link on the Lucene site.
3. look at the demo in the distribution.  Download Lucene and look for
anything named 'demo'.

It sounds like you haven't done any of that yet....

Otis

--- Vinod Bhagat <vbhagat@blastradius.com> wrote:
> Hi Otis
> 
>   May be i am asking for more.. is there some class that accepts this
> document field terminology, would you be in a position to name out
> the class
> name,
> 
>  and also i did not understand clearly what u mean by creating Field
> form
> the text  and than create document form these field....
> 
>  The Q are...
> 
>  1)  How to decide how much or what to be in a field....
> 
>  2) and than how much and what to be in document.
> 
>  Considering the PDF file might have 300+ pages.. than how much
> logical
> field and hence document can be created....
> 
>  I think i am acting like an idiot asking the above Q... may be
> things will
> be more cleared if i know about this demo class and the understanding
> of the
> Fields and document categorization.
> 
>  wait for your positive reply.
> 
> Cheers
> Vin.
> 
> -----Original Message-----
> From: Otis Gospodnetic [mailto:otis_gospodnetic@yahoo.com]
> Sent: Thursday, October 17, 2002 3:00 PM
> To: Lucene Users List
> Subject: Re: Integrating the PDF Extract With Lucene!!!!
> 
> 
> Once you extract the content of the PDF and have it in your String
> variables, you can create Fields with them, then create Documents
> with
> Fields, and finally add those Documents to IndexWriter, which indexes
> them.
> 
> Please look at the demo code that comes with Lucene first.
> 
> Otis
> 
> --- Vinod Bhagat <vbhagat@blastradius.com> wrote:
> > Hi Gurus
> > 
> >  I manage to get the content form the PDF file using the JPedal
> > libraries.
> > Now i need to use this content to Index inside Lucene, so that PDF
> > (binaries) files can be searched/indexed by Lucene. 
> > 
> > And i am new with Lucene. Can anyone share there experience of
> > indexing the
> > extracted content from PDF into Lucene. How to go about it, i have
> no
> > idea
> > at the moment?
> > 
> >  Wait for the positive and early response.
> > 
> >  Best Regards.
> > 
> >  Vin
> > 
> > --
> > To unsubscribe, e-mail:  
> > <mailto:lucene-user-unsubscribe@jakarta.apache.org>
> > For additional commands, e-mail:
> > <mailto:lucene-user-help@jakarta.apache.org>
> > 
> 
> 
> __________________________________________________
> Do you Yahoo!?
> Faith Hill - Exclusive Performances, Videos & More
> http://faith.yahoo.com
> 
> --
> To unsubscribe, e-mail:
> <mailto:lucene-user-unsubscribe@jakarta.apache.org>
> For additional commands, e-mail:
> <mailto:lucene-user-help@jakarta.apache.org>
> 
> --
> To unsubscribe, e-mail:  
> <mailto:lucene-user-unsubscribe@jakarta.apache.org>
> For additional commands, e-mail:
> <mailto:lucene-user-help@jakarta.apache.org>
> 


__________________________________________________
Do you Yahoo!?
Faith Hill - Exclusive Performances, Videos & More
http://faith.yahoo.com

--
To unsubscribe, e-mail:   <mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>


Mime
View raw message