lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vaijanath N. Rao (JIRA)" <>
Subject [jira] Commented: (SOLR-651) A SearchComponent for fetching TF-IDF values
Date Wed, 22 Oct 2008 13:49:44 GMT


Vaijanath N. Rao commented on SOLR-651:

Hi Grant,

I think my understanding is slightly different, let me try to clarify them both.

If the user has asked for tf=true he is expecting term frequency value
so the output would be
<int name="display">2</int>

If the user has asked for tf=true&idf=true user implies give me both computation
so the output would be
<float name="display">1.0</float>

I thought of this output and hence thought this representation would be ideal. But I think
I might been missing something that you have though about.

--Thanks and Regards
Vaijanath N. Rao

> A SearchComponent for fetching TF-IDF values
> --------------------------------------------
>                 Key: SOLR-651
>                 URL:
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 1.3
>            Reporter: Noble Paul
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 1.4
>         Attachments: SOLR-651.patch, SOLR-651.patch, SOLR-651.patch, SOLR-651.patch,
> A SearchComponent that can return TF-IDF vector for any given document in the SOLR index
> Query : A Document Number / a query identifying a Document
> Response :  A Map of term vs.TF-IDF value of every term in the Selected
> Document
> Why ?
> Most of the Machine Learning Algorithms work on TFIDF representation of
> documents, hence adding a Request Handler proving the TFIDF representation
> will pave the way for incorporating Learning Paradigms to SOLR framework.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message