mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Marc Sturlese <>
Subject Re: about DistributedRowMatrix implementation
Date Sat, 01 Oct 2011 10:18:34 GMT
Well after digging into the code and do some tests, I've seen that what I was
asking for is not possible. Mahout will only let you do a distributed matrix
multiplication of 2 sparse matrix, as the representation of a whole row or
column has to feed in memory. Actually have to feed in memory a row and a
column each time (as it uses the CompositeInputFormat). 
To do dense matrix multiplication with hadoop just found this: 
But the data generated by the maps will be extremely huge and the job will
take ages (of course depending of the number of nodes).
I've seed around that Hama and R are possible solutions too. Any advice,
comment or experience?

View this message in context:
Sent from the Mahout User List mailing list archive at

View raw message