mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted Dunning (JIRA)" <j...@apache.org>
Subject [jira] Commented: (MAHOUT-510) Standardize serialization mechanisms
Date Sun, 16 Jan 2011 01:38:45 GMT

    [ https://issues.apache.org/jira/browse/MAHOUT-510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12982197#action_12982197
] 

Ted Dunning commented on MAHOUT-510:
------------------------------------

I think that this patch looks pretty reasonable.  It affects a lot of code that Jeff has committed
so that he should comment.  The Naive Bayes code is hit less so, but it would be good to hear
from Robin on that side.

A key thing to remember is that asFormatString won't work on large objects anyway.  It has
occasionally been useful to assist with equality checks, but that is a small justification.
 As far as the Naive Bayes stuff is concerned, the serialized form hasn't been sufficient
to fully re-instantiate a usable model, so it is hard to see what the point is.



> Standardize serialization mechanisms
> ------------------------------------
>
>                 Key: MAHOUT-510
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-510
>             Project: Mahout
>          Issue Type: Task
>    Affects Versions: 0.4
>            Reporter: Sean Owen
>             Fix For: 0.5
>
>         Attachments: MAHOUT-510.patch
>
>
> At the moment this is tracking a broader concern: to standardize as much as possible
how we approach serialization. The long-term goal is notionally to use the following "encodings"
as the input/output of Mahout stuff, and by extension, probably internally too.
> - Text
> - Vector Writable
> - (maybe Avro)
> not
> - Serializable
> - GSON / JSON

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message