lucene-lucene-net-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ben West (JIRA)" <j...@apache.org>
Subject [jira] Updated: (LUCENENET-366) Spellchecker issues
Date Mon, 03 May 2010 18:53:57 GMT

     [ https://issues.apache.org/jira/browse/LUCENENET-366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ben West updated LUCENENET-366:
-------------------------------

    Attachment: LUCENENET-366.patch

This patch just converts the Fields' store/index type to be the same as in java lucene 2.9.
I might not have time to update the whole thing, so just submitting this for now.

> Spellchecker issues
> -------------------
>
>                 Key: LUCENENET-366
>                 URL: https://issues.apache.org/jira/browse/LUCENENET-366
>             Project: Lucene.Net
>          Issue Type: Bug
>            Reporter: Ben West
>            Priority: Minor
>         Attachments: LUCENENET-366.patch, LuceneNet-SpellcheckFixes.patch
>
>
> There are several issues with the spellchecker:
> - It doesn't do duplicate checking across updates (so the same word is often indexed
many, many times)
> - The n-gram fields are stored as well as indexed, which increases the size of the index
by several orders of magnitude and provides no benefit
> - Some deprecated functions are used, which slows it down
> - Some methods aren't commented fully
> I will attach a patch that fixes these.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message