lucene-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rajesh Munavalli" <raje...@dessci.com>
Subject RE: search alogorithm in Lucene
Date Mon, 08 Aug 2005 17:23:46 GMT
Lucene considers text documents only. If you use the standard analyzer
all the contents in the document will be parsed the same way. To index
XML document you need to come up with your own Analyzer/Tokenizer which
separates XML tags and indexes accordingly. I guess you want to preserve
the meta-data contained in the XML document.

--
Rajesh Munavalli 

-----Original Message-----
From: Madhu Panitini [mailto:Madhu.Panitini@pass-consulting.com] 
Sent: Monday, August 08, 2005 12:17 PM
To: general@lucene.apache.org
Subject: RE: search alogorithm in Lucene

Hi one more question

Is there any format of text file that lucene eexpects some think like
addition of XML tags for the text document which is given for lucene
before indexing.

regards
madhu

-----Original Message-----
From: Madhu Panitini
Sent: Monday, August 08, 2005 7:02 PM
To: general@lucene.apache.org
Subject: search alogorithm in Lucene

Hi all,
I new to the lucene, but I am familiar with the IR. I want build IR
system in Java and I found Lucene, but some questions remained
unanswered for me after searching complete website. 

I have couple of questions regarding Lucene, 

1. What is the search algorithm(s)[VSM, ..] used or available in the
Lucene?

2. How term weight is calculated in Lucene, how many types of term
weight calculating formulas are implemented and what are they?

Regards
Madhu





Mime
View raw message