ignite-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Anton Dmitriev (JIRA)" <j...@apache.org>
Subject [jira] [Created] (IGNITE-11655) ML: OneHotEncoder returns more columns than expected
Date Fri, 29 Mar 2019 09:43:00 GMT
Anton Dmitriev created IGNITE-11655:

             Summary: ML: OneHotEncoder returns more columns than expected
                 Key: IGNITE-11655
                 URL: https://issues.apache.org/jira/browse/IGNITE-11655
             Project: Ignite
          Issue Type: Bug
            Reporter: Anton Dmitriev

OneHotEncoder returns more columns than expected (two values that might be encoded using two
columns encoded using 3 columns). The following example demonstrates the problem:

Map<Integer, Object[]> training = new HashMap<>();
        training.put(0, new Object[]{42.0});
        training.put(1, new Object[]{43.0});
        training.put(2, new Object[]{42.0});

        EncoderTrainer<Integer, Object[]> trainer = new EncoderTrainer<Integer, Object[]>()

        IgniteBiFunction<Integer, Object[], Vector> processor = trainer.fit(training,
1, (k, v) -> v);

        Vector res = processor.apply(1, new Object[]{42.0});


>>> [0.0, 1.0, 0.0]

This message was sent by Atlassian JIRA

View raw message