lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Brian Beard <brian_s_be...@hotmail.com>
Subject Re: highlighter / fragmenter performance for large fields
Date Mon, 20 Oct 2008 21:42:48 GMT

Karsten,

Thanks, I will look into this.

>Hi Brian,
>
>I don't know the internals of highlighting („explanation“) in lucene.
>But I know that XTF (
>http://xtf.wiki.sourceforge.net/underHood_Documents#tocunderHood_Documents5
>) can handle very large documents (above 100 Mbyte) with highlighting very
>fast. The difference to your approach is, that xtf devide the document in
>small (overlapping) chunks and store the original text as xml separately
>with connection to lucene indexed fields via numbered xml-nodes.
>For large texts (above 200 KByte), it is the best tool I know.
>
>Best regards
>  Karsten


_________________________________________________________________
Store, manage and share up to 5GB with Windows Live SkyDrive.
http://skydrive.live.com/welcome.aspx?provision=1?ocid=TXT_TAGLM_WL_skydrive_102008
Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message