mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Raphael Cendrillon" <cendrillon1...@gmail.com>
Subject Re: Review Request: Row mean job for PCA
Date Sat, 17 Dec 2011 20:50:42 GMT

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/3147/
-----------------------------------------------------------

(Updated 2011-12-17 20:50:42.776447)


Review request for mahout, Ted Dunning, lancenorskog, and Dmitriy Lyubimov.


Changes
-------

Correct version.


Summary
-------

Here's a patch with a simple job to calculate the row mean (column-wise mean). One outstanding
issue is the combiner, this requires a wrtiable class IntVectorTupleWritable, where the Int
stores the number of rows, and the Vector stores the column-wise sum.


This addresses bug MAHOUT-923.
    https://issues.apache.org/jira/browse/MAHOUT-923


Diffs (updated)
-----

  /trunk/core/src/main/java/org/apache/mahout/math/hadoop/DistributedRowMatrix.java 1215567

  /trunk/core/src/main/java/org/apache/mahout/math/hadoop/MatrixColumnMeansJob.java PRE-CREATION

  /trunk/core/src/test/java/org/apache/mahout/math/hadoop/TestDistributedRowMatrix.java 1215567


Diff: https://reviews.apache.org/r/3147/diff


Testing
-------

Junit test


Thanks,

Raphael


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message