lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Simon Willnauer (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (LUCENE-4813) Allow DirectSpellchecker to use totalTermFrequency rather than docFrequency
Date Thu, 07 Mar 2013 15:20:13 GMT

     [ https://issues.apache.org/jira/browse/LUCENE-4813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Simon Willnauer updated LUCENE-4813:
------------------------------------

    Attachment: LUCENE-4813.patch

next iteration making the new statistic optional and experts can just pass them in if they
want. This patch is bw compatible runtime wise while it breaks some apis since I change floats
to double but I think that is a fair game here. I think its close / ready
                
> Allow DirectSpellchecker to use totalTermFrequency rather than docFrequency
> ---------------------------------------------------------------------------
>
>                 Key: LUCENE-4813
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4813
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: modules/spellchecker
>    Affects Versions: 4.1
>            Reporter: Simon Willnauer
>             Fix For: 4.2, 5.0
>
>         Attachments: LUCENE-4813.patch, LUCENE-4813.patch
>
>
> we have a bunch of new statistics in on our term dictionaries that we should make use
of where it makes sense. For DirectSpellChecker totalTermFreq and sumTotalTermFreq might be
better suited for spell correction on top of a fulltext index than docFreq and maxDoc

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message