lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
Subject Re: Dmitry's Term Vector stuff, plus some
Date Wed, 25 Feb 2004 23:01:46 GMT
nice suggestion about capping the highlighter's number of tokens - I'll add that in.

I've had a quick look at your knowledgebase docs. Can't you split them at index time into
multiple smaller docs using the <a name="xxx"> tags as doc boundaries?
Each lucene document could then have a field with the URL [sourcedoc]#xxx, taking you to the
relevant section in the source document.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message