lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Robert Muir <rcm...@gmail.com>
Subject Re: how to retrieve total token count per collection/index
Date Thu, 09 Aug 2012 16:02:19 GMT
On Thu, Aug 9, 2012 at 10:20 AM, tech.vronk <tech@vronk.net> wrote:
> Hello,
>
> I wonder how to figure out the total token count in a collection (per
> index), i.e. the size of a corpus/collection measured in tokens.
>

You want to use this statistic, which tells you number of tokens for
an indexed field:
http://lucene.apache.org/core/4_0_0-ALPHA/core/org/apache/lucene/index/Terms.html#getSumTotalTermFreq%28%29

-- 
lucidimagination.com

Mime
View raw message