Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 94111 invoked from network); 29 Dec 2005 10:21:54 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 29 Dec 2005 10:21:54 -0000 Received: (qmail 91077 invoked by uid 500); 29 Dec 2005 10:21:51 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 90134 invoked by uid 500); 29 Dec 2005 10:21:49 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 90123 invoked by uid 99); 29 Dec 2005 10:21:49 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 29 Dec 2005 02:21:48 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [69.55.225.129] (HELO ehatchersolutions.com) (69.55.225.129) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 29 Dec 2005 02:21:48 -0800 Received: by ehatchersolutions.com (Postfix, from userid 504) id 3661E13E201D; Thu, 29 Dec 2005 05:21:26 -0500 (EST) Received: from [172.16.1.101] (va-71-48-129-167.dhcp.sprint-hsd.net [71.48.129.167]) by ehatchersolutions.com (Postfix) with ESMTP id 4F0C813E2005 for ; Thu, 29 Dec 2005 05:21:22 -0500 (EST) Mime-Version: 1.0 (Apple Message framework v746.2) In-Reply-To: <3beac0c40512282240h48f80d3bl89916c0a89e4272d@mail.gmail.com> References: <3beac0c40512282240h48f80d3bl89916c0a89e4272d@mail.gmail.com> Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Message-Id: <6680BFDE-AF3F-421A-B61F-E429663573A7@ehatchersolutions.com> Content-Transfer-Encoding: 7bit From: Erik Hatcher Subject: Re: Lucene parsing for PDF Date: Thu, 29 Dec 2005 05:21:20 -0500 To: java-user@lucene.apache.org X-Mailer: Apple Mail (2.746.2) X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Shyam - I moderated your message through, so please subscribe to the list to send to it in the future. Please provide us with some details - a standalone RAMDirectory-using JUnit TestCase is the most ideal way to share an issue like this and have someone else take a look at it. And frequently the act of distilling an issue down to a test case points out the error being made :) Erik On Dec 29, 2005, at 1:40 AM, Shyam Bhaskaran wrote: > Hi, > > I am working on a search project using Lucene and currently I am > working on > parsing PDF documents. I was successful in implementing my parser > using > Lucene and PDFBox. I have a doubt on how to exclude or (maybe > delete) pages > from the index. I am not sure how to do this.. I mean when exactly > it has to > be done.. Looking at the Lucene book it tells about removing > documents using > Lucene by id or by term, but I was not successful in implementing > this.. Can > anyone help me with this... > > Regards, > Shyam --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org