lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tom Burton-West (JIRA)" <>
Subject [jira] Updated: (LUCENE-2393) Utility to output total term frequency and df from a lucene index
Date Fri, 16 Apr 2010 19:14:27 GMT


Tom Burton-West updated LUCENE-2393:

    Attachment: LUCENE-2393.patch

Updated the HighFreqTermsWithTF to use flex API. 

 I don't understand the flex API well enough yet to determine if I should have used
 to do a bulk read instead of DocsEnum.nextDoc() and DocsEnum.freq()..

> Utility to output total term frequency and df from a lucene index
> -----------------------------------------------------------------
>                 Key: LUCENE-2393
>                 URL:
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: contrib/*
>            Reporter: Tom Burton-West
>            Priority: Trivial
>         Attachments: LUCENE-2393.patch, LUCENE-2393.patch, LUCENE-2393.patch
> This is a command line utility that takes a field name, term, and index directory and
outputs the document frequency for the term and the total number of occurrences of the term
in the index (i.e. the sum of the tf of the term for each document).  It is useful for estimating
the size of the term's entry in the *prx files and consequent Disk I/O demands

This message is automatically generated by JIRA.
If you think it was sent incorrectly contact one of the administrators:
For more information on JIRA, see:


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message