mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <>
Subject [jira] [Commented] (MAHOUT-1866) Add matrix-to-tsv string function
Date Sun, 29 May 2016 18:06:12 GMT


Hudson commented on MAHOUT-1866:

SUCCESS: Integrated in Mahout-Quality #3360 (See [])
MAHOUT-1866: Add matrix-to-tsv string function, this closes (smarthi: rev 8f4ee88fb40710d983ea3fb6ad008317f6c00936)
* math-scala/src/main/scala/org/apache/mahout/math/drm/package.scala

> Add matrix-to-tsv string function
> ---------------------------------
>                 Key: MAHOUT-1866
>                 URL:
>             Project: Mahout
>          Issue Type: Sub-task
>          Components: visiualization
>    Affects Versions: 0.12.1
>            Reporter: Trevor Grant
>            Assignee: Suneel Marthi
>             Fix For: 0.13.0
> Need a function to convert a matrix to a tsv string which can then be plotted by
> - Zeppelin %table visualization packages
> - Passed to R / Python via Zeppelin Resource Manager
> It has been noted that a matrix can be registered as an RDD and passed across contexts
directly in Spark, however this breaks the 'backend agnoistic' philosophy.  Until H20 and
Flink also both support Python / R environments it is more reasonable to use tab-seperated-value
> Further, matrices might be extremely large and unfit for being directly converted to
tsvs.  It may be wise to introduce some sort of safety valve for preventing excessively large
matrices from being materialized into local memory (eg. supposing the user hasn't called their
own sampling method on a matrix).

This message was sent by Atlassian JIRA

View raw message