mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <ted.dunn...@gmail.com>
Subject Re: "--features " in trainlogistic what is this for?
Date Fri, 27 Apr 2012 23:10:35 GMT
Putting a smaller value here will degrade prediction quality because more
and more features will collide in the hashed feature space.  Increasing
this beyond a certain point, however, will not significantly increase
prediction quality and it will increase memory usage.

On Fri, Apr 27, 2012 at 11:01 PM, Yang <teddyyyy123@gmail.com> wrote:

> when I run mahout trainlogistic , there is an optional param --features
>
> from the book "Mahout in action", it says:
>
> --features
> The size of the internal feature vector to use in building the model. A
> larger value here can be helpful, especially with text-like input data
>
>
> so is this something like buffer size so it does not affect the result of
> the training? I thought the feature count to be considered in the model is
> already explicitly given by the --predictors param
>
>
> thanks
> yang
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message