Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 13919 invoked from network); 8 Mar 2007 10:12:58 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 8 Mar 2007 10:12:58 -0000 Received: (qmail 70790 invoked by uid 500); 8 Mar 2007 10:12:58 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 70755 invoked by uid 500); 8 Mar 2007 10:12:58 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 70740 invoked by uid 99); 8 Mar 2007 10:12:58 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 08 Mar 2007 02:12:57 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: local policy) Received: from [209.11.145.17] (HELO bobcat.webappcabaret.net) (209.11.145.17) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 08 Mar 2007 02:12:47 -0800 Received: (qmail 26116 invoked by uid 98); 8 Mar 2007 02:12:21 -0800 Received: from unknown (HELO ?10.0.1.2?) (ulf@ulfdittmer.com@85.178.112.47) by 0 with SMTP; 8 Mar 2007 02:12:21 -0800 Mime-Version: 1.0 (Apple Message framework v752.2) In-Reply-To: <2b817f360703080137y46fa43capc9ef18217808604b@mail.gmail.com> References: <2b817f360703080137y46fa43capc9ef18217808604b@mail.gmail.com> Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Message-Id: Content-Transfer-Encoding: 7bit From: Ulf Dittmer Subject: Re: indexing pdfs Date: Thu, 8 Mar 2007 11:12:46 +0100 To: java-user@lucene.apache.org X-Mailer: Apple Mail (2.752.2) X-Virus-Checked: Checked by ClamAV on apache.org For DOC files you can use the Jakarta POI library. Text extraction is outlined here: http://jakarta.apache.org/poi/hwpf/quick-guide.html Ulf On 08.03.2007, at 10:37, ashwin kumar wrote: > hi can some one help me by giving any sample programs for indexing > pdfs and .doc files --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org