Archive | Python Machine Learning

Boxplot of top 10 Spot-Checking Algorithms on a Classification Problem

How to Develop a Framework to Spot-Check Machine Learning Algorithms in Python

Spot-checking algorithms is a technique in applied machine learning designed to quickly and objectively provide a first set of results on a new predictive modeling problem. Unlike grid searching and other types of algorithm tuning that seek the optimal algorithm or optimal configuration for an algorithm, spot-checking is intended to evaluate a diverse set of […]

Continue Reading 18
Example ROC Curve

A Gentle Introduction to Probability Scoring Methods in Python

How to Score Probability Predictions in Python and Develop an Intuition for Different Metrics. Predicting probabilities instead of class labels for a classification problem can provide additional nuance and uncertainty for the predictions. The added nuance allows more sophisticated metrics to be used to interpret and evaluate the predicted probabilities. In general, methods for the […]

Continue Reading 20
Calibrated and Uncalibrated SVM Reliability Diagram

How and When to Use a Calibrated Classification Model with scikit-learn

Instead of predicting class values directly for a classification problem, it can be convenient to predict the probability of an observation belonging to each possible class. Predicting probabilities allows some flexibility including deciding how to interpret the probabilities, presenting predictions with uncertainty, and providing more nuanced ways to evaluate the skill of the model. Predicted […]

Continue Reading 20
Line Plot of ROC Curve

How and When to Use ROC Curves and Precision-Recall Curves for Classification in Python

It can be more flexible to predict probabilities of an observation belonging to each class in a classification problem rather than predicting classes directly. This flexibility comes from the way that probabilities may be interpreted using different thresholds that allow the operator of the model to trade-off concerns in the errors made by the model, […]

Continue Reading 62
Scatter plot of Moons Test Classification Problem

How to Generate Test Datasets in Python with scikit-learn

Test datasets are small contrived datasets that let you test a machine learning algorithm or test harness. The data from test datasets have well-defined properties, such as linearly or non-linearity, that allow you to explore specific algorithm behavior. The scikit-learn Python library provides a suite of functions for generating samples from configurable test problems for […]

Continue Reading 26
How to Setup a Python Environment for Machine Learning and Deep Learning with Anaconda

How to Setup Your Python Environment for Machine Learning with Anaconda

It can be difficult to install a Python machine learning environment on some platforms. Python itself must be installed first and then there are many packages to install, and it can be confusing for beginners. In this tutorial, you will discover how to set up a Python machine learning development environment using Anaconda. After completing […]

Continue Reading 966