lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Stanislaw Osinski (JIRA)" <j...@apache.org>
Subject [jira] Commented: (SOLR-1692) CarrotClusteringEngine produce summary does nothing
Date Sat, 02 Jan 2010 20:29:54 GMT

    [ https://issues.apache.org/jira/browse/SOLR-1692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12795914#action_12795914
] 

Stanislaw Osinski commented on SOLR-1692:
-----------------------------------------

I've had a quick look into this issue and have two questions to consider:

* Where should the configuration of the highlighter we use for clustering come from? Should
it be the same as for the regular Solr highlighting or should we allow a clustering-specific
configuration? My intuition is that we should go with the former. Otherwise, we may lose the
clear relationship between cluster labels and documents on the output, because the clusters
will be generated based on a text that is different from what the user is going to see.

* What should we do if the highlighter is not able to generate a summary? One option is to
use the full contents of the field. Alternatively, we could use N (configurable) first characters
of the field. The answer to this really depends on the characteristics of the data we may
get. If the total number of documents fed to Carrot2 doesn't exceed about a 1000, longer documents
shouldn't be too much of a problem, so I'd suggest the former option (use full field text).

> CarrotClusteringEngine produce summary does nothing
> ---------------------------------------------------
>
>                 Key: SOLR-1692
>                 URL: https://issues.apache.org/jira/browse/SOLR-1692
>             Project: Solr
>          Issue Type: Bug
>          Components: contrib - Clustering
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>             Fix For: 1.5
>
>
> In the CarrotClusteringEngine, the produceSummary option does nothing, as the results
of doing the highlighting are just ignored.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message