lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jack Krupansky" <j...@basetechnology.com>
Subject Re: Getting position increments directly from the the index
Date Thu, 23 May 2013 12:12:46 GMT
It might be nice to inquire as to the largest position for a field in a 
document. Is that information kept anywhere? Not that I know of, although I 
suppose it can be calculated at runtime by running though all the terms of 
the field. Then he could just divide by 1000.

-- Jack Krupansky

-----Original Message----- 
From: Michael McCandless
Sent: Thursday, May 23, 2013 6:28 AM
To: Lucene Users
Subject: Re: Getting position increments directly from the the index

Do you actually index the sentence boundary as a token?  If so, you
could just get the totalTermFreq of that token?


Mike McCandless

http://blog.mikemccandless.com


On Wed, May 22, 2013 at 10:11 AM, Igor Shalyminov
<ishalyminov@yandex-team.ru> wrote:
> Hello!
>
> I'm storing sentence bounds in the index as position increments of 1000.
> I want to get the total number of sentences in the index, i. e. the number 
> of "1000" increment values.
> Can I do that some other way rather than just loading each document and 
> extracting position increments with a custom Analyzer?
>
> --
> Best Regards,
> Igor Shalyminov
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org 


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message