lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Prasad KVSH" <Prasad.Kokep...@ness.com>
Subject RE: lucene-3.0.3
Date Wed, 01 Feb 2012 16:53:43 GMT
Hi,
 
Please find our requirement and we trying to accomplish this.

Our client is looking for a Extended search engine like searching the given text inside the
documents like (PDF, Msg, Excel, XML, Word, TXT etc)  and return the list of file names where
it find the text. Using the return list we can populate them in User Interface after validating
with user access rights. Actually we have one image server in that there will be few folders
and sub folders, each folder will have may have 10,000 files.

so far we are search text for TXT files only using lucene-3.0.3. 

Thanks

Prasad


________________________________

From: KARTHIK SHIVAKUMAR [mailto:nskarthik.k@gmail.com]
Sent: Wed 2/1/2012 7:04 PM
To: java-user@lucene.apache.org
Subject: Re: lucene-3.0.3



Hi

>>lucene-3.0.3 can be used for searching a text from

Lucene 's primary job is to do a text search.

May it be PDF/HTML/XML/MSword/PPT/XLS

U have to have the code for plugin to do 2 things

1) Strip text from either of the Documents (PDF/HTML/XML/MSword/PPT/XLS)
2) Index this processed text using Lucene

The indexed process can be later used for Searching thru the required
content.

;)
with regards
karthik


On Wed, Feb 1, 2012 at 6:37 PM, Prasad KVSH <Prasad.Kokepudi@ness.com>wrote:

> Hi,
>
>
>
> lucene-3.0.3 can be used for searching a text from PDF, xlsx, docx, doc,
> xls, msg, TXT files. For this we have any common function to accomplish
> this. Please help me on this.
>
>
>
> Thanks
>
> Prasad
>
>
>
>


--
*N.S.KARTHIK
R.M.S.COLONY
BEHIND BANK OF INDIA
R.M.V 2ND STAGE
BANGALORE
560094*



Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message