Machine learning models are chosen based on their mean performance, often calculated using k-fold cross-validation. The algorithm with the best […]

# Archive | Statistics

## A Gentle Introduction to Degrees of Freedom in Machine Learning

Degrees of freedom is an important concept from statistics and engineering. It is often employed to summarize the number of […]

## Arithmetic, Geometric, and Harmonic Means for Machine Learning

Calculating the average of a variable or a list of numbers is a common operation in machine learning. It is […]

## 17 Statistical Hypothesis Tests in Python (Cheat Sheet)

Quick-reference guide to the 17 statistical hypothesis tests that you need in applied machine learning, with sample code in Python. […]

## Statistics for Machine Learning (7-Day Mini-Course)

Statistics for Machine Learning Crash Course. Get on top of the statistics used in machine learning in 7 Days. Statistics […]

## How to Code the Student’s t-Test from Scratch in Python

Perhaps one of the most widely used statistical hypothesis tests is the Student’s t test. Because you may use this […]

## How to Calculate McNemar’s Test to Compare Two Machine Learning Classifiers

The choice of a statistical hypothesis test is a challenging open problem for interpreting machine learning results. In his widely […]

## The Role of Randomization to Address Confounding Variables in Machine Learning

A large part of applied machine learning is about running controlled experiments to discover what algorithm or algorithm configuration to […]

## All of Statistics for Machine Learning

A foundation in statistics is required to be effective as a machine learning practitioner. The book “All of Statistics” was […]

## A Gentle Introduction to Statistical Power and Power Analysis in Python

The statistical power of a hypothesis test is the probability of detecting an effect, if there is a true effect […]