lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Areek Zillur (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (LUCENE-5294) Suggester Dictionary implementation that takes expressions as term weights
Date Sun, 20 Oct 2013 11:17:42 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-5294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13800125#comment-13800125
] 

Areek Zillur commented on LUCENE-5294:
--------------------------------------

Thanks Michael for the response!
  - I will open up a jira to add support to CompositeReader for the Dictionary implementation
(and hopefully change the DocumentInputIterator to go segment by segment)
  - Will also make the DocumentDictionary more lenient.
I hope to expose these Dictionary implementations to solr soon.

> Suggester Dictionary implementation that takes expressions as term weights
> --------------------------------------------------------------------------
>
>                 Key: LUCENE-5294
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5294
>             Project: Lucene - Core
>          Issue Type: New Feature
>          Components: core/search
>            Reporter: Areek Zillur
>             Fix For: 4.6, 5.0
>
>         Attachments: LUCENE-5294.patch
>
>
> It would be nice to have a Suggester Dictionary implementation that could compute the
weights of the terms consumed by the suggester based on an user-defined expression (using
lucene's expression module).
> It could be an extension of the existing DocumentDictionary (which takes terms, weights
and (optionally) payloads from the stored documents in the index). The only exception being
that instead of taking the weights for the terms from the specified weight fields, it could
compute the weights using an user-defn expression, that uses one or more NumicDocValuesField
from the document.
> Example:
>   let the document have
>      - product_id
>      - product_name
>      - product_popularity
>      - product_profit
>   Then this implementation could be used with an expression of     "0.2*product_popularity
+ 0.8*product_profit" to determine the weights of the terms for the corresponding documents
(optionally along with a payload (product_id))



--
This message was sent by Atlassian JIRA
(v6.1#6144)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message