lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Adam Hiatt (JIRA)" <>
Subject [jira] Commented: (SOLR-81) Add Query Spellchecker functionality
Date Sat, 03 Mar 2007 00:38:51 GMT


Adam Hiatt commented on SOLR-81:

> Is spelling check normally going to be integrated into the "main" index, or will it normally
be a separate index?
AH: It is a separate index.

> If the latter, does it make more sense for some of this (the field definitions &
handler) to be in contrib instead of core? 
AH: That would be fine by me. However, it should be noted that it can be turned on for any

> Any other way to avoid "cluttering" the current schema.xml?
> If spelling check is to be a core feature (that one can turn on for any field in any
index), it seems like it needs to be easier to configure. Having the user 
> define all the ngram fields, fieldTypes, and copyField statements doesn't seem ideal.

AH: I think there is some confusion over Otis's version and mine. I was never able to get
Otis's version (single index using ngram types + copyfields) working fully so I went with
the pure SpellChecker implementation that doesn't require any of that (no schema.xml additions)
It just needs for the user to use a custom request handler to query for spelling corrections
(Otis wrote the original) and a custom commit handler (based on CommitRequestHandler) to rebuild
the spell checker index.

For the record the version I commited is:

> Add Query Spellchecker functionality
> ------------------------------------
>                 Key: SOLR-81
>                 URL:
>             Project: Solr
>          Issue Type: New Feature
>          Components: search
>            Reporter: Otis Gospodnetic
>            Priority: Minor
>         Attachments: SOLR-81-edgengram-ngram.patch, SOLR-81-ngram-schema.patch, SOLR-81-ngram.patch,
SOLR-81-ngram.patch, SOLR-81-ngram.patch, SOLR-81-ngram.patch, SOLR-81-spellchecker.patch
> Use the simple approach of n-gramming outside of Solr and indexing n-gram documents.
 For example:
> <doc>
> <field name="word">lettuce</field>
> <field name="start3">let</field>
> <field name="gram3">let ett ttu tuc uce</field>
> <field name="end3">uce</field>
> <field name="start4">lett</field>
> <field name="gram4">lett ettu ttuc tuce</field>
> <field name="end4">tuce</field>
> </doc>
> See:
> Java clients: SOLR-20 (add delete commit optimize), SOLR-30 (search)

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message