commons-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted Dunning" <ted.dunn...@gmail.com>
Subject [math] Fwd: [jira] Commented: (MAHOUT-65) Add Element Labels to Vectors and Matrices
Date Tue, 21 Oct 2008 05:34:42 GMT
Luc and other commons math folk:

Do you guys have opinions about serialization formats for matrices (both
dense and sparse, both with and without row, column and cell attributes)?

---------- Forwarded message ----------
From: Jeff Eastman <jdog@windwardsolutions.com>
Date: Mon, Oct 20, 2008 at 10:03 PM
Subject: Re: [jira] Commented: (MAHOUT-65) Add Element Labels to Vectors and
Matrices
To: mahout-dev@lucene.apache.org


Ted Dunning wrote:

> I see what you mean.
>
> To repeat in other words, the problems that need to be solved are:
>
> a) there are many uses already so adding attributes should be transparent
> to
> those who don't use them
>
> b) the encoding should not be ad hoc because this would be our second ad
> hoc
> encoding and only one should ever be allowed before using a standard
>
>
+1

> So here is a (kind of) concrete proposal:
>
> a) use JSON or Thrift for concrete syntax
>
>
Any preferences here? This might also impact other Mahout packages in the
future, so everybody please weigh in. In general, it seems that having a
common, public encoding for matrix and vector data would help users mix and
match the Mahout services. What are the requirements of these other
services? From inspection, it looks like only the clustering packages use
them currently.

Jeff



-- 
ted

Mime
View raw message