lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Grant Ingersoll <gsing...@apache.org>
Subject Re: Retrieving TermVectors from a Field over the full index?
Date Mon, 11 Jun 2007 13:16:02 GMT
I am curious as to what you are going to do with this information.   
Is it strictly for display, or are you doing some sort of reranking  
algorithm?  Having this insight may help us think of other ways to  
solve the problem.

On Jun 9, 2007, at 11:19 AM, Benjamin Pasero wrote:

> Hi,
>
> I wonder if this is possible:
>
> Return all Terms of a Field in the Index together with the number of
> occurances
> in all documents.
>
> E.g. have 10 Documents with the Field "author" in the index, 5 of them
> having
> the value "foo" and 5 "bar" I would like to build a map with:
>
> [foo] -> 5
> [bar] -> 5
>
> I looked at what Luke is doing to show the top terms of a given  
> field in the
> index and it seems to iterate over all terms (using
> IndexReader#terms()). Isnt
> that quite un-efficient? I would at least expect a method
> IndexReader#terms(String field)
> to limit the terms on the desired field.
>
> Thanks for helping,
> Ben
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>

--------------------------
Grant Ingersoll
Center for Natural Language Processing
http://www.cnlp.org/tech/lucene.asp

Read the Lucene Java FAQ at http://wiki.apache.org/jakarta-lucene/ 
LuceneFAQ



---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message