Hyperparameter Tuning

A PredictionIO engine is instantiated by a set of parameters. These parameters define which algorithm is to be used, as well supply the parameters for the algorithm itself. This naturally raises the question of how to choose the best set of parameters. The evaluation module streamlines the process of tuning the engine to the best parameter set and deploys it.
Quick Start
We demonstrate the evaluation with the classification template. The classification template uses a naive bayesian algorithm that has a smoothing parameter. We evaluate the prediction quality against different parameter values to find the best parameter values, and then deploy it.
Edit the AppId

Edit MyClassification/src/main/scala/Evaluation.scala to specify the appId you used to import the data.
1 +Hyperparameter Tuning
®
TEMPLATES OPEN SOURCE
PredictionIO Docs
Hyperparameter Tuning
PredictionIO Docs
Apache PredictionIO® Documentation
Welcome to Apache PredictionIO®
Getting Started
A Quick Intro
Installing Apache PredictionIO
Downloading an Engine Template
Deploying Your First Engine
Customizing the Engine
Integrating with Your App
App Integration Overview
List of SDKs
Java & Android SDK
PHP SDK
Python SDK
Ruby SDK
Community Powered SDKs
Deploying an Engine
Deploying as a Web Service
Batch Predictions
Monitoring Engine
Setting Engine Parameters
Deploying Multiple Engine Variants
Engine Server Plugin
Customizing an Engine
Learning DASE
Implement DASE
Troubleshooting Engine Development
Engine Scala APIs
Collecting and Analyzing Data
Event Server Overview
Collecting Data with REST/SDKs
Events Modeling
Unifying Multichannel Data with Webhooks
Channel
Importing Data in Batch
Using Analytics Tools
Event Server Plugin
Choosing an Al gorithm(s)
Built-in Algorithm Libraries
Switching to Another Algorithm
Combining Multiple Algorithms
Adding Your Own Algorithms
ML Tuning and Evaluation
Overview
Hyperparameter Tuning
Evaluation Dashboard
Choosing Evaluati on Metrics
Building Evaluation Metrics
System Architecture
Architecture Overview
Using Another Data Store
PredictionIO® Official Templates
Intro
Recommendation
Quick Start
DASE
Evaluation Explained
How-To
Read Custom Events
Customize Data Preparator
Customize Serving
Train with Implicit Preference
Filter Recommended Items by Blacklist in Query
Ba tch Persistable Evaluator
E-Commerce Recommendation
Quick Start
DASE
How-To
Train with Rate Event
Adjust Score
Similar Product
Quick Start
DASE
How-To
Multiple Events and Multiple Algorithms
Returns Item Properties
Train with Rate Event
Get Rid of Events for Users
Recommend Users
Classification
Quick Start
DASE
How-To
Use Alternative Algorithm
Read Custom Properties
Engine Template Gallery
Browse
Submit your Engine as a Template
Demo T utorials
Comics Recommendation Demo
Community Contributed Demo
Text Classification Engine Tutorial
Getting Involved
Contribute Code
Contribute Documentation
Contribute a SDK
Contribute a Webhook
Community Projects
Getting Help
FAQs
Support
Resources
Command-line Interface
Release Cadence
Developing Engines with IntelliJ IDEA
Upgrade Instructions
Glossary
Apache Softw are Foundation
Apache Homepage
License
Sponsorship
Thanks
Security
ML Tuning and Evaluation>
Hyperparameter Tuning
Hyperparameter Tuning
On this page

Quick Start

Detailed Explanation

The Evaluation Design

Evaluation Data Generation

Evaluation Metrics

Parameters Generation

Running the Evaluation

Notes

Edit this page
ML Tuning and Evaluation>
Hyperparameter Tuning
Hyperparameter Tuning

A PredictionIO engine is instantiated by a set of parameters. These parameters define which algorithm is to be used, as well supply the parameters for the algorithm itself. This naturally raises the question of how to choose the best set of parameters. The evaluation module streamlines the process of tuning the engine to the best parameter set and deploys it.
Quick Start
We demonstrate the evaluation with the classification template. The classification template uses a naive bayesian algorithm that has a smoothing parameter. We evaluate the prediction quality against different parameter values to find the best parameter values, and then deploy it.
Edit the AppId
Edit MyClassifica tion/src/main/scala/Evaluation.scala to specify the appId you used to import the data.
1 2 3 4 @@ -385,7 +385,7 @@ Metrics: org.template.classification.Accuracy: 0.9281045751633987 The best variant params can be found in best.json [INFO] [CoreWorkflow$] runEvaluation completed -

Notes

We deliberately not metion test set in this hyperparameter tuning guide. In machine learning literature, the test set is a separate piece of data which is used to evaluate the final engine params outputted by the evaluation process. This guarantees that no information in the training / validation set is leaked into the engine params and yields a biased outcome. With PredictionIO, there are multiple ways of conducting robust tuning, we will cover this topic in the coming sections.

Community
Download
Docs< /a>
GitHub
Subscribe to User Mailing List
Stackoverflow
Contribute
Contribute
Source Code
Bug Tracker
Subscribe to Development Mailing List
Apache PredictionIO, PredictionIO, Apache, the Apache feather logo, and the Apache Pred ictionIO project logo are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.
All other marks mentioned may be trademarks or registered trademarks of their respective owners.
®
Star Fork