lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Vinod Bhagat <vbha...@blastradius.com>
Subject RE: Integrating the PDF Extract With Lucene!!!!
Date Thu, 17 Oct 2002 13:04:11 GMT
Hi Otis

  May be i am asking for more.. is there some class that accepts this
document field terminology, would you be in a position to name out the class
name,

 and also i did not understand clearly what u mean by creating Field form
the text  and than create document form these field....

 The Q are...

 1)  How to decide how much or what to be in a field....

 2) and than how much and what to be in document.

 Considering the PDF file might have 300+ pages.. than how much logical
field and hence document can be created....

 I think i am acting like an idiot asking the above Q... may be things will
be more cleared if i know about this demo class and the understanding of the
Fields and document categorization.

 wait for your positive reply.

Cheers
Vin.

-----Original Message-----
From: Otis Gospodnetic [mailto:otis_gospodnetic@yahoo.com]
Sent: Thursday, October 17, 2002 3:00 PM
To: Lucene Users List
Subject: Re: Integrating the PDF Extract With Lucene!!!!


Once you extract the content of the PDF and have it in your String
variables, you can create Fields with them, then create Documents with
Fields, and finally add those Documents to IndexWriter, which indexes
them.

Please look at the demo code that comes with Lucene first.

Otis

--- Vinod Bhagat <vbhagat@blastradius.com> wrote:
> Hi Gurus
> 
>  I manage to get the content form the PDF file using the JPedal
> libraries.
> Now i need to use this content to Index inside Lucene, so that PDF
> (binaries) files can be searched/indexed by Lucene. 
> 
> And i am new with Lucene. Can anyone share there experience of
> indexing the
> extracted content from PDF into Lucene. How to go about it, i have no
> idea
> at the moment?
> 
>  Wait for the positive and early response.
> 
>  Best Regards.
> 
>  Vin
> 
> --
> To unsubscribe, e-mail:  
> <mailto:lucene-user-unsubscribe@jakarta.apache.org>
> For additional commands, e-mail:
> <mailto:lucene-user-help@jakarta.apache.org>
> 


__________________________________________________
Do you Yahoo!?
Faith Hill - Exclusive Performances, Videos & More
http://faith.yahoo.com

--
To unsubscribe, e-mail:
<mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail:
<mailto:lucene-user-help@jakarta.apache.org>

--
To unsubscribe, e-mail:   <mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>


Mime
View raw message