lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ian Lea <>
Subject Re: XML results ranking
Date Fri, 16 Jul 2010 08:40:14 GMT

If you google "Lucene xml" you'll find info, but I'll attempt to
answer your questions below

> ...
> I wonder whether Lucene:
> (1) provides full-text search over content of XML elements ?

Yes.  If you index the content, lucene will let you search over it.

> (2) provides substring search over values of attributes of XML elements ?

Yes, there is wildcard support.  Or use something like n-grams.

> (3) scores relevance of matching XML documents ?


> (4) allows to identify (in matching document) XML elements with matched
> query terms and than navigate to parental/children nodes in XML ?structure ?

Not really.

> (5) provides a way to give more weight to some XML element types during
> relevance scoring ?

Yes.  See boosting.

Lucene is a library that doesn't index XML directly, but you can write
code to parse your XML and feed it into lucene, specifying which
fields you want indexed and which stored for later retrieval.


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message