incubator-hama-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hama Wiki] Trivial Update of "WordCountMatrix" by udanax
Date Wed, 17 Sep 2008 07:47:59 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hama Wiki" for change notification.

The following page has been changed by udanax:
http://wiki.apache.org/hama/WordCountMatrix

------------------------------------------------------------------------------
  
  == Abstract ==
  
- The word count matrix (document-word) approach is often referred to as latent semantic indexing
and document clustering (Of course, A word frequently present in all documents will not be
useful for clustering -- The length of all documents is not uniform so a lengthy document
will have higher word counts). This example gives parallel implementation of the Matrix-creation
(In the future, the matrix sparse decomposition technique).
+ The word count matrix (document-word) approach is often referred to as latent semantic indexing
and document clustering (Of course, A word frequently present in all documents will not be
useful for clustering; The length of all documents is not uniform so a lengthy document will
have higher word counts). This example gives parallel implementation of the Matrix-creation
(In the future, the matrix sparse decomposition technique).
  

Mime
View raw message