mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <>
Subject Re: "--features " in trainlogistic what is this for?
Date Fri, 27 Apr 2012 23:10:35 GMT
Putting a smaller value here will degrade prediction quality because more
and more features will collide in the hashed feature space.  Increasing
this beyond a certain point, however, will not significantly increase
prediction quality and it will increase memory usage.

On Fri, Apr 27, 2012 at 11:01 PM, Yang <> wrote:

> when I run mahout trainlogistic , there is an optional param --features
> from the book "Mahout in action", it says:
> --features
> The size of the internal feature vector to use in building the model. A
> larger value here can be helpful, especially with text-like input data
> so is this something like buffer size so it does not affect the result of
> the training? I thought the feature count to be considered in the model is
> already explicitly given by the --predictors param
> thanks
> yang

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message