mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Suneel Marthi (JIRA)" <>
Subject [jira] [Commented] (MAHOUT-1391) Possibility to disable confusion matrix in naive bayes
Date Wed, 08 Jan 2014 20:00:50 GMT


Suneel Marthi commented on MAHOUT-1391:

Mansur, Could u also include a test case that reproduces the original issue as part of the

> Possibility to disable confusion matrix in naive bayes
> ------------------------------------------------------
>                 Key: MAHOUT-1391
>                 URL:
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>    Affects Versions: 0.8
>            Reporter: Mansur Iqbal
>             Fix For: 0.9
>         Attachments: MAHOUT-1391.patch
> Sometimes confusion matrix is to big and not really necessary.
> And there is another case for the possibility:
> If you split a dataset with many labels with random selection percent to testdataset
and trainingdataset, it could happen, that there are classes/labels in testdata, which do
not appear in the trainingdataset. By creating a model with the trainingdata the created labelindex
does not include some labels from testdata. Therefore if you test on this model with the testdata,
mahout tries to create a confusion matrix with the labels from testdata which are not included
in the labelindex and throws an exception.

This message was sent by Atlassian JIRA

View raw message