spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Burak KÖSE (JIRA) <j...@apache.org>
Subject [jira] [Commented] (SPARK-15064) Locale support in StopWordsRemover
Date Mon, 02 May 2016 16:48:12 GMT

    [ https://issues.apache.org/jira/browse/SPARK-15064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15266955#comment-15266955
] 

Burak KÖSE commented on SPARK-15064:
------------------------------------

Can you show me a starter point about case insensitive matching based on locale and region?

> Locale support in StopWordsRemover
> ----------------------------------
>
>                 Key: SPARK-15064
>                 URL: https://issues.apache.org/jira/browse/SPARK-15064
>             Project: Spark
>          Issue Type: New Feature
>          Components: ML
>    Affects Versions: 2.0.0
>            Reporter: Xiangrui Meng
>
> We support case insensitive filtering (default) in StopWordsRemover. However, case insensitive
matching depends on the locale and region, which cannot be explicitly set in StopWordsRemover.
We should consider adding this support in MLlib.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message