spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Cyril de Vogelaere (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-20180) Add a special value for unlimited max pattern length in Prefix span, and set it as default.
Date Mon, 03 Apr 2017 11:09:41 GMT

    [ https://issues.apache.org/jira/browse/SPARK-20180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15953282#comment-15953282
] 

Cyril de Vogelaere commented on SPARK-20180:
--------------------------------------------

Fine, I thought a TODO left in the code would reflect the wish of the community, at least
a little.
I will close this thread and open a new one on changing the default value to maxInteger, since
I personnally think it would be more friendly to new users.

Link to new thread : https://issues.apache.org/jira/browse/SPARK-20203

Tomorrow, I will create a new thread with another improvement I want to add to spark. I need
to run a performance test on just that change first,
to prove it will be usefull. I hope you will follow it too.

> Add a special value for unlimited max pattern length in Prefix span, and set it as default.
> -------------------------------------------------------------------------------------------
>
>                 Key: SPARK-20180
>                 URL: https://issues.apache.org/jira/browse/SPARK-20180
>             Project: Spark
>          Issue Type: Improvement
>          Components: MLlib
>    Affects Versions: 2.1.0
>            Reporter: Cyril de Vogelaere
>            Priority: Minor
>   Original Estimate: 0h
>  Remaining Estimate: 0h
>
> Right now, we need to use .setMaxPatternLength() method to
> specify is the maximum pattern length of a sequence. Any pattern longer than that won't
be outputted.
> The current default maxPatternlength value being 10.
> This should be changed so that with input 0, all pattern of any length would be outputted.
Additionally, the default value should be changed to 0, so that a new user could find all
patterns in his dataset without looking at this parameter.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message