spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "yuhao yang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-15064) Locale support in StopWordsRemover
Date Wed, 06 Jun 2018 07:49:00 GMT

    [ https://issues.apache.org/jira/browse/SPARK-15064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16502929#comment-16502929
] 

yuhao yang commented on SPARK-15064:
------------------------------------

Yuhao will be OOF from May 29th to June 6th (annual leave and conference). Please expect delayed
email response. Conctact 669 243 8273 for anything urgent.

Regards,
Yuhao


> Locale support in StopWordsRemover
> ----------------------------------
>
>                 Key: SPARK-15064
>                 URL: https://issues.apache.org/jira/browse/SPARK-15064
>             Project: Spark
>          Issue Type: New Feature
>          Components: ML
>    Affects Versions: 2.0.0
>            Reporter: Xiangrui Meng
>            Priority: Major
>
> We support case insensitive filtering (default) in StopWordsRemover. However, case insensitive
matching depends on the locale and region, which cannot be explicitly set in StopWordsRemover.
We should consider adding this support in MLlib.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message