We have an intuition that more observations is better. This is the same intuition behind the idea that if we […]

# Archive | Statistics

## A Gentle Introduction to Calculating Normal Summary Statistics

A sample of data is a snapshot from a broader population of all possible observations that could be taken of […]

## How to Calculate Correlation Between Variables in Python

There may be complex and unknown relationships between the variables in your dataset. It is important to discover and quantify […]

## Introduction to Random Number Generators for Machine Learning in Python

Randomness is a big part of machine learning. Randomness is used as a tool or a feature in preparing data […]

## How to Calculate Bootstrap Confidence Intervals For Machine Learning Results in Python

It is important to both present the expected skill of a machine learning model a well as confidence intervals for […]

## How to Report Classifier Performance with Confidence Intervals

Once you choose a machine learning algorithm for your classification problem, you need to report the performance of the model […]

## How to Use Statistical Significance Tests to Interpret Machine Learning Results

It is good practice to gather a population of results when comparing two different machine learning algorithms or when comparing […]

## Estimate the Number of Experiment Repeats for Stochastic Machine Learning Algorithms

A problem with many stochastic machine learning algorithms is that different runs of the same algorithm on the same data […]

## Machine Learning Terminology from Statistics and Computer Science

Data plays a big part in machine learning. It is important to understand and use the right terminology when talking about […]

## Crash Course in Statistics for Machine Learning

You do not need to know statistics before you can start learning and applying machine learning. You can start today. […]