[ https://issues.apache.org/jira/browse/IGNITE-9145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16698800#comment-16698800
]
ASF GitHub Bot commented on IGNITE-9145:
----------------------------------------
Github user asfgit closed the pull request at:
https://github.com/apache/ignite/pull/5481
> [ML] Add different strategies to index labels in StringEncoderTrainer
> ---------------------------------------------------------------------
>
> Key: IGNITE-9145
> URL: https://issues.apache.org/jira/browse/IGNITE-9145
> Project: Ignite
> Issue Type: Improvement
> Components: ml
> Reporter: Aleksey Zinoviev
> Assignee: Aleksey Zinoviev
> Priority: Major
> Fix For: 2.8
>
>
> The main idea to add a few strategies of indexing: sorting and so on.
> Currently it supports only one strategy (most popular with zero and less popular with
the max index size).
> There are can be a few options
> * 'frequencyDesc': descending order by label frequency (most frequent label assigned
0)
> * 'frequencyAsc': ascending order by label frequency (least frequent label assigned
0)
>
> Please, update the method **transformFrequenciesToEncodingValues and add the strategy
as a parameter of trainer.
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
|