lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erik Hatcher <e...@ehatchersolutions.com>
Subject Re: Can I retrieve token offsets from Hits?
Date Wed, 21 Jul 2004 11:12:57 GMT

On Jul 21, 2004, at 6:59 AM, Stepan Mik wrote:
> It is possible to retrieve tokens offsets (Token.startOffset(),
> Token.endOffset()) later when document is found and returned in hit
> collection?

No.... offsets are not stored in the index.  In fact, the only place 
they are currently used is with the Highlighter code.

>  I need these values for hihglighting. I've already looked to
> Highlighter in sandbox but it actually re-analyzes the original
> document's field. However, this is not preffered way when using
> complicated (performance demanding) analyzer. So my question is - it is
> possible to store (somehow) token offsets and get them later without
> reanalizing the document?

There has been lots of discussion on this topic in the past.  Perhaps 
you could dig up those threads to get a feel for what the latest 
thinking on this is.

	Erik


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message