Archive | Probability

A Gentle Introduction to Computational Learning Theory

By Jason Brownlee on September 7, 2020 in Probability 11

Computational learning theory, or statistical learning theory, refers to mathematical frameworks for quantifying learning tasks and algorithms. These are sub-fields of machine learning that a machine learning practitioner does not need to know in great depth in order to achieve good results on a wide range of problems. Nevertheless, it is a sub-field where having […]

Develop an Intuition for Bayes Theorem With Worked Examples

By Jason Brownlee on August 19, 2020 in Probability 10

Bayes Theorem provides a principled way for calculating a conditional probability. It is a deceptively simple calculation, providing a method that is easy to use for scenarios where our intuition often fails. The best way to develop an intuition for Bayes Theorem is to think about the meaning of the terms in the equation and […]

A Gentle Introduction to the Bayes Optimal Classifier

By Jason Brownlee on August 19, 2020 in Probability 12

The Bayes Optimal Classifier is a probabilistic model that makes the most probable prediction for a new example. It is described using the Bayes Theorem that provides a principled way for calculating a conditional probability. It is also closely related to the Maximum a Posteriori: a probabilistic framework referred to as MAP that finds the […]

Empirical Probability Density Function for the Bimodal Data Sample

How to Use an Empirical Distribution Function in Python

By Jason Brownlee on August 28, 2020 in Probability 20

An empirical distribution function provides a way to model and sample cumulative probabilities for a data sample that does not fit a standard probability distribution. As such, it is sometimes called the empirical cumulative distribution function, or ECDF for short. In this tutorial, you will discover the empirical probability distribution function. After completing this tutorial, […]

A Gentle Introduction to Stochastic in Machine Learning

What Does Stochastic Mean in Machine Learning?

By Jason Brownlee on July 24, 2020 in Probability 28

The behavior and performance of many machine learning algorithms are referred to as stochastic. Stochastic refers to a variable process where the outcome involves some randomness and has some uncertainty. It is a mathematical term and is closely related to “randomness” and “probabilistic” and can be contrasted to the idea of “deterministic.” The stochastic nature […]

A Gentle Introduction to Maximum a Posteriori (MAP) for Machine Learning

By Jason Brownlee on September 25, 2019 in Probability 14

Density estimation is the problem of estimating the probability distribution for a sample of observations from a problem domain. Typically, estimating the entire distribution is intractable, and instead, we are happy to have the expected value of the distribution, such as the mean or mode. Maximum a Posteriori or MAP for short is a Bayesian-based […]

A Gentle Introduction to Markov Chain Monte Carlo for Probability

By Jason Brownlee on September 25, 2019 in Probability 15

Probabilistic inference involves estimating an expected value or density using a probabilistic model. Often, directly inferring values is not tractable with probabilistic models, and instead, approximation methods must be used. Markov Chain Monte Carlo sampling provides a class of algorithms for systematic random sampling from high-dimensional probability distributions. Unlike Monte Carlo sampling methods that are […]

Histogram Plots of Differently Sized Monte Carlo Samples From the Target Function

A Gentle Introduction to Monte Carlo Sampling for Probability

By Jason Brownlee on September 25, 2019 in Probability 33

Monte Carlo methods are a class of techniques for randomly sampling a probability distribution. There are many problem domains where describing or estimating the probability distribution is relatively straightforward, but calculating a desired quantity is intractable. This may be due to many reasons, such as the stochastic nature of the domain or an exponential number […]

Histogram of Dataset Constructed From Two Different Gaussian Processes

A Gentle Introduction to Expectation-Maximization (EM Algorithm)

By Jason Brownlee on August 28, 2020 in Probability 26

Maximum likelihood estimation is an approach to density estimation for a dataset by searching across probability distributions and their parameters. It is a general and effective approach that underlies many machine learning algorithms, although it requires that the training dataset is complete, e.g. all relevant interacting random variables are present. Maximum likelihood becomes intractable if […]

Probabilistic Model Selection Measures AIC, BIC, and MDL

Probabilistic Model Selection with AIC, BIC, and MDL

By Jason Brownlee on August 28, 2020 in Probability 64

Model selection is the problem of choosing one from among a set of candidate models. It is common to choose a model that performs the best on a hold-out test dataset or to estimate model performance using a resampling technique, such as k-fold cross-validation. An alternative approach to model selection involves using probabilistic statistical measures […]

1 2 … 4 Next →