Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2F14599E6 for ; Wed, 1 Feb 2012 13:57:38 +0000 (UTC) Received: (qmail 26236 invoked by uid 500); 1 Feb 2012 13:57:35 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 26187 invoked by uid 500); 1 Feb 2012 13:57:35 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 26179 invoked by uid 99); 1 Feb 2012 13:57:34 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 01 Feb 2012 13:57:34 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of Prasad.Kokepudi@ness.com designates 196.12.44.204 as permitted sender) Received: from [196.12.44.204] (HELO MAIL8.NESS.COM) (196.12.44.204) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 01 Feb 2012 13:57:28 +0000 Received: from inhydnt11.ness.com ([192.168.36.23]) by MAIL8.NESS.COM with ESMTP; 01 Feb 2012 19:27:06 +0530 X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Subject: RE: lucene-3.0.3 Date: Wed, 1 Feb 2012 19:24:11 +0530 Message-ID: <3D7F018025EA1F429F25962058105DA707EC9C26@inhydnt11.ness.com> In-Reply-To: X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: lucene-3.0.3 Thread-Index: Aczg6NbvkWfgv+rgR+6Jzghg5mk5gQAAECQA References: <3D7F018025EA1F429F25962058105DA707EC9BCA@inhydnt11.ness.com> From: "Prasad KVSH" To: Cc: "Vijaya Kumar Tadavarthy" X-Virus-Checked: Checked by ClamAV on apache.org It will be great if you provide some working examples on this. We tried to deploy solr.war but getting exceptions. Thanks Prasad -----Original Message----- From: Ian Lea [mailto:ian.lea@gmail.com]=20 Sent: Wednesday, February 01, 2012 7:22 PM To: java-user@lucene.apache.org Subject: Re: lucene-3.0.3 You could also take a look at Solr. From http://lucene.apache.org/solr/features.html * Easy ways to pull in data from databases and XML files from local disk and HTTP sources * Rich Document Parsing and Indexing (PDF, Word, HTML, etc) using Apache Tika Sounds just what you need. -- Ian. On Wed, Feb 1, 2012 at 1:34 PM, KARTHIK SHIVAKUMAR wrote: > Hi > >>>lucene-3.0.3 can be used for searching a text from > > Lucene 's primary job is to do a text search. > > May it be PDF/HTML/XML/MSword/PPT/XLS > > U have to have the code for plugin to do 2 things > > 1) Strip text from either of the Documents=20 > (PDF/HTML/XML/MSword/PPT/XLS) > 2) Index this processed text using Lucene > > The indexed process can be later used for Searching thru the required=20 > content. > > ;) > with regards > karthik > > > On Wed, Feb 1, 2012 at 6:37 PM, Prasad KVSH wrote: > >> Hi, >> >> >> >> lucene-3.0.3 can be used for searching a text from PDF, xlsx, docx,=20 >> doc, xls, msg, TXT files. For this we have any common function to=20 >> accomplish this. Please help me on this. >> >> >> >> Thanks >> >> Prasad >> >> >> >> > > > -- > *N.S.KARTHIK > R.M.S.COLONY > BEHIND BANK OF INDIA > R.M.V 2ND STAGE > BANGALORE > 560094* --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org