lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrzej Bialecki (JIRA)" <j...@apache.org>
Subject [jira] Commented: (SOLR-1632) Distributed IDF
Date Fri, 11 Dec 2009 23:26:18 GMT

    [ https://issues.apache.org/jira/browse/SOLR-1632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789607#action_12789607
] 

Andrzej Bialecki  commented on SOLR-1632:
-----------------------------------------

I believe the API that I propose would support such implementation as well. Please note that
it's usually not feasible to compute and distribute the complete IDF table for all terms -
you would have to replicate a union of all term dictionaries across the cluster. In practice,
you limit the amount of information by various means, e.g. only distributing data related
to the current request (this implementation) or reducing the frequency of updates (e.g. LRU
caching), or approximating global DF with a constant for frequent terms (where the contribution
of their IDF to the score would be negligible anyway).

> Distributed IDF
> ---------------
>
>                 Key: SOLR-1632
>                 URL: https://issues.apache.org/jira/browse/SOLR-1632
>             Project: Solr
>          Issue Type: New Feature
>          Components: search
>    Affects Versions: 1.5
>            Reporter: Andrzej Bialecki 
>         Attachments: distrib.patch
>
>
> Distributed IDF is a valuable enhancement for distributed search across non-uniform shards.
This issue tracks the proposed implementation of an API to support this functionality in Solr.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message