mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jake Mannix <jake.man...@gmail.com>
Subject Re: LDA on single node is much faster than 20 nodes
Date Tue, 06 Sep 2011 23:44:36 GMT
On Tue, Sep 6, 2011 at 4:44 PM, Chris Lu <clu@atypon.com> wrote:

> I see, thanks!
>
> Seems it should build into Mahout LDA algorithms, since the input file is
> usually not too large, but really needs parallel mapping processes.
>
>
If your input is not large, running a multithreaded in-memory algorithm on a
relatively beefy box (16+ cores, enough RAM to fit your data + model + some
spare) will be *much* faster than putting the same data on cluster,
actually.

  -jake

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message