lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Shai Erera (JIRA)" <>
Subject [jira] [Commented] (SOLR-3700) Create a Classification component
Date Thu, 30 Aug 2012 09:51:07 GMT


Shai Erera commented on SOLR-3700:

Is there any reason not to develop it as a Lucene module? I haven't looked at the patch, but
if it's not Solr-specific, or depends on Solr API, perhaps we can make this issue a LUCENE-####

I see no reason such module will be available for Solr users only, unless you plan to depend
on Solr API, in which case I will not slow down your development by insisting it becomes a
Lucene module.
> Create a Classification component
> ---------------------------------
>                 Key: SOLR-3700
>                 URL:
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Tommaso Teofili
>            Assignee: Tommaso Teofili
>            Priority: Minor
>         Attachments: SOLR-3700_2.patch, SOLR-3700.patch
> Lucene/Solr can host huge sets of documents containing lots of information in fields
so that these can be used as training examples (w/ features) in order to very quickly create
classifiers algorithms to use on new documents and / or to provide an additional service.
> So the idea is to create a contrib module (called 'classification') to host a ClassificationComponent
that will use already seen data (the indexed documents / fields) to classify new documents
/ text fragments.
> The first version will contain a (simplistic) Lucene based Naive Bayes classifier but
more implementations should be added in the future.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message