mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chandra Mohan, Ananda Vel Murugan" <Ananda.Muru...@honeywell.com>
Subject significance of FEATURES in SGD
Date Wed, 03 Jul 2013 12:58:19 GMT
Hi,

I am experimenting Mahout for text classification. I have 2 million training data i.e text
of approximately 20 words. They fall into 121 categories. I tried AdaptiveLogisticRegression.
When I create sparse vector of cardinality 10000, it takes hours to converge, but when I tried
with 100 it converges fast. Is this measure very significant in determining the accuracy of
the model? Please advise.

Regards,
Anand.C

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message