Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 579A79A91 for ; Wed, 1 Feb 2012 14:00:18 +0000 (UTC) Received: (qmail 29664 invoked by uid 500); 1 Feb 2012 14:00:15 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 29612 invoked by uid 500); 1 Feb 2012 14:00:15 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 29603 invoked by uid 99); 1 Feb 2012 14:00:14 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 01 Feb 2012 14:00:14 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of erickerickson@gmail.com designates 209.85.161.176 as permitted sender) Received: from [209.85.161.176] (HELO mail-gx0-f176.google.com) (209.85.161.176) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 01 Feb 2012 14:00:09 +0000 Received: by ggnr5 with SMTP id r5so790724ggn.35 for ; Wed, 01 Feb 2012 05:59:49 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; bh=PF9xAuAO/0HMnjI8+KMWA+1SFBKeMfIhByltPXQ5CHc=; b=U5K8qbycsiU8dcz8DO5o6Jm64lHXEe5Fa6F2boyf9xOeZaK0acNar8HGRG8u7EuZUC GKu0EjKRse8b1KcQDBlNE1S1Ir4NcdGoTD+dAvt+SK1U2mr5SOLETVw6YVEgAAtbZltQ Fe+vik2WbwAqlf5Ndj7E+6mk7aYnLt3/iPy0A= MIME-Version: 1.0 Received: by 10.182.74.66 with SMTP id r2mr39555820obv.67.1328104789014; Wed, 01 Feb 2012 05:59:49 -0800 (PST) Received: by 10.182.90.134 with HTTP; Wed, 1 Feb 2012 05:59:48 -0800 (PST) In-Reply-To: <3D7F018025EA1F429F25962058105DA707EC9C26@inhydnt11.ness.com> References: <3D7F018025EA1F429F25962058105DA707EC9BCA@inhydnt11.ness.com> <3D7F018025EA1F429F25962058105DA707EC9C26@inhydnt11.ness.com> Date: Wed, 1 Feb 2012 08:59:48 -0500 Message-ID: Subject: Re: lucene-3.0.3 From: Erick Erickson To: java-user@lucene.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable What did you try and what exceptions did you get? You might review: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Wed, Feb 1, 2012 at 8:54 AM, Prasad KVSH wrot= e: > It will be great if you provide some working examples on this. We tried > to deploy solr.war but getting exceptions. > > Thanks > Prasad > > -----Original Message----- > From: Ian Lea [mailto:ian.lea@gmail.com] > Sent: Wednesday, February 01, 2012 7:22 PM > To: java-user@lucene.apache.org > Subject: Re: lucene-3.0.3 > > You could also take a look at Solr. =A0From > http://lucene.apache.org/solr/features.html > > =A0* Easy ways to pull in data from databases and XML files from local > disk and HTTP sources > > =A0* Rich Document Parsing and Indexing (PDF, Word, HTML, etc) using > Apache Tika > > > Sounds just what you need. > > > -- > Ian. > > On Wed, Feb 1, 2012 at 1:34 PM, KARTHIK SHIVAKUMAR > wrote: >> Hi >> >>>>lucene-3.0.3 can be used for searching a text from >> >> Lucene 's primary job is to do a text search. >> >> May it be PDF/HTML/XML/MSword/PPT/XLS >> >> U have to have the code for plugin to do 2 things >> >> 1) Strip text from either of the Documents >> (PDF/HTML/XML/MSword/PPT/XLS) >> 2) Index this processed text using Lucene >> >> The indexed process can be later used for Searching thru the required >> content. >> >> ;) >> with regards >> karthik >> >> >> On Wed, Feb 1, 2012 at 6:37 PM, Prasad KVSH > wrote: >> >>> Hi, >>> >>> >>> >>> lucene-3.0.3 can be used for searching a text from PDF, xlsx, docx, >>> doc, xls, msg, TXT files. For this we have any common function to >>> accomplish this. Please help me on this. >>> >>> >>> >>> Thanks >>> >>> Prasad >>> >>> >>> >>> >> >> >> -- >> *N.S.KARTHIK >> R.M.S.COLONY >> BEHIND BANK OF INDIA >> R.M.V 2ND STAGE >> BANGALORE >> 560094* > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org