Hi,
How to solve this problem of format bug, please explain exactly which type of
data to provide to get good classification results. I think the problem is
because of presence of columns which contain the same value as that of target
variable, i had read about target leak problem in mahout which says that if we
have those columns in the predictor variables which contains the same value as
that of our target variable than we will get poor results, am i getting such
results because of target leak ?
|