lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erik Hatcher <>
Subject Re: Lucene parsing for PDF
Date Thu, 29 Dec 2005 10:21:20 GMT
Shyam - I moderated your message through, so please subscribe to the  
list to send to it in the future.

Please provide us with some details - a standalone RAMDirectory-using  
JUnit TestCase is the most ideal way to share an issue like this and  
have someone else take a look at it.  And frequently the act of  
distilling an issue down to a test case points out the error being  
made :)


On Dec 29, 2005, at 1:40 AM, Shyam Bhaskaran wrote:

> Hi,
> I am working on a search project using Lucene and currently I am  
> working on
> parsing PDF documents. I was successful in implementing my parser  
> using
> Lucene and PDFBox. I have a doubt on how to exclude or (maybe  
> delete) pages
> from the index. I am not sure how to do this.. I mean when exactly  
> it has to
> be done.. Looking at the Lucene book it tells about removing  
> documents using
> Lucene by id or by term, but I was not successful in implementing  
> this.. Can
> anyone help me with this...
> Regards,
> Shyam

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message