lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Robert Muir (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LUCENE-4782) Let the NaiveBayes classifier have a fallback docCount method if codec doesn't support Terms#docCount()
Date Wed, 20 Feb 2013 15:21:13 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-4782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13582237#comment-13582237
] 

Robert Muir commented on LUCENE-4782:
-------------------------------------

I'm not sure we have to realistically worry about this too much.

It only applies to 3.x indexes: in general all current codecs support this statistic.

So another option is to simply add SuppressCodecs("Lucene3x") annotation to the classification
module and document that you should run IndexUpgrader on any old 3.x segments you have lying
around.

                
> Let the NaiveBayes classifier have a fallback docCount method if codec doesn't support
Terms#docCount()
> -------------------------------------------------------------------------------------------------------
>
>                 Key: LUCENE-4782
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4782
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: modules/classification
>            Reporter: Tommaso Teofili
>            Assignee: Tommaso Teofili
>             Fix For: 4.2, 5.0
>
>
> In _SimpleNaiveBayesClassifier_ _docsWithClassSize_ variable is initialized to _MultiFields.getTerms(this.atomicReader,
this.classFieldName).getDocCount()_ which may be -1 if the codec doesn't support doc counts,
therefore there should be an alternative way to initialize such a variable with the documents
count.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message