lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From markharw...@yahoo.co.uk
Subject Re: Dmitry's Term Vector stuff, plus some
Date Wed, 25 Feb 2004 23:01:46 GMT
Doug,
nice suggestion about capping the highlighter's number of tokens - I'll add that in.

Bruce,
I've had a quick look at your knowledgebase docs. Can't you split them at index time into
multiple smaller docs using the <a name="xxx"> tags as doc boundaries?
Each lucene document could then have a field with the URL [sourcedoc]#xxx, taking you to the
relevant section in the source document.

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org


Mime
View raw message