Start Here with Machine Learning

Need Help Getting Started with Applied Machine Learning?

These are the Step-by-Step Guides that You’ve Been Looking For!

What do you want help with?

How Do I Get Started?

The most common question I’m asked is: “how do I get started?”

My best advice for getting started in machine learning is broken down into a 5-step process:

Step 1: Adjust Mindset. Believe you can practice and apply machine learning.
Step 2: Pick a Process. Use a systemic process to work through problems.
- Applied Machine Learning Process
Step 3: Pick a Tool. Select a tool for your level and map it onto your process.
- Beginners: Weka Workbench.
- Intermediate: Python Ecosystem.
- Advanced: R Platform.
- Best Programming Language for Machine Learning
Step 4: Practice on Datasets. Select datasets to work on and practice the process.
Step 5: Build a Portfolio. Gather results and demonstrate your skills.

For more on this top-down approach, see:

Many of my students have used this approach to go on and do well in Kaggle competitions and get jobs as Machine Learning Engineers and Data Scientists.

Applied Machine Learning Process

The benefit of machine learning are the predictions and the models that make predictions.

To have skill at applied machine learning means knowing how to consistently and reliably deliver high-quality predictions on problem after problem. You need to follow a systematic process.

Below is a 5-step process that you can follow to consistently achieve above average results on predictive modeling problems:

Step 1: Define your problem.
- How to Define Your Machine Learning Problem
Step 2: Prepare your data.
Step 3: Spot-check algorithms.
Step 4: Improve results.
Step 5: Present results.

For a good summary of this process, see the posts:

Probability for Machine Learning

Probability is the mathematics of quantifying and harnessing uncertainty. It is the bedrock of many fields of mathematics (like statistics) and is critical for applied machine learning.

Below is the 3 step process that you can use to get up-to-speed with probability for machine learning, fast.

Step 1: Discover what Probability is.
- Basics of Mathematical Notation for Machine Learning
- What Is Probability?
Step 2: Discover why Probability is so important for machine learning.
- 5 Reasons to Learn Probability for Machine Learning
- A Gentle Introduction to Uncertainty in Machine Learning
Step 3: Dive into Probability topics.
- Probability for Machine Learning Mini-Course
- Probability for Machine Learning (my book)

You can see all of the tutorials on probability here. Below is a selection of some of the most popular tutorials.

Probability Foundations

Bayes Theorem

Probability Distributions

Information Theory

Statistics for Machine Learning

Statistical Methods an important foundation area of mathematics required for achieving a deeper understanding of the behavior of machine learning algorithms.

Below is the 3 step process that you can use to get up-to-speed with statistical methods for machine learning, fast.

Step 1: Discover what Statistical Methods are.
- What is Statistics (and why is it important in machine learning)?
Step 2: Discover why Statistical Methods are important for machine learning.
- The Close Relationship Between Applied Statistics and Machine Learning
- 10 Examples of How to Use Statistical Methods in a Machine Learning Project
Step 3: Dive into the topics of Statistical Methods.
- Statistics for Machine Learning (7-Day Mini-Course)
- Statistical Methods for Machine Learning (my book)

You can see all of the statistical methods posts here. Below is a selection of some of the most popular tutorials.

Summary Statistics

Statistical Hypothesis Tests

Resampling Methods

Estimation Statistics

Linear Algebra for Machine Learning

Linear algebra is an important foundation area of mathematics required for achieving a deeper understanding of machine learning algorithms.

Below is the 3 step process that you can use to get up-to-speed with linear algebra for machine learning, fast.

Step 1: Discover what Linear Algebra is.
- Basics of Mathematical Notation for Machine Learning
- A Gentle Introduction to Linear Algebra
Step 2: Discover why Linear Algebra is important for machine learning.
Step 3: Dive into Linear Algebra topics.
- Linear Algebra for Machine Learning Mini-Course
- Linear Algebra for Machine Learning (my book)

You can see all linear algebra posts here. Below is a selection of some of the most popular tutorials.

Linear Algebra in Python

Matrices

Vectors

Matrix Factorization

Optimization for Machine Learning

Optimization is the core of all machine learning algorithms. When we train a machine learning model, it is doing optimization with the given dataset.

You can get familiar with optimization for machine learning in 3 steps, fast.

Step 1: Discover what Optimization is.
- A Gentle Introduction to Applied Machine Learning as a Search Problem
- A Gentle Introduction to Function Optimization
Step 2: Discover the Optimization Algorithms.
Step 3: Dive into Optimization Topics.
- How to Manually Optimize Machine Learning Model Hyperparameters
- Optimization for Machine Learning (my book)

You can see all optimization posts here. Below is a selection of some of the most popular tutorials.

Local Optimization

Global Optimization

Gradient Descent

Applications of Optimization

Calculus for Machine Learning

Calculus is the hidden driver for the success of many machine learning algorithms. When we talk about the gradient descent optimization part of a machine learning algorithm, the gradient is found using calculus.

You can get familiar with calculus for machine learning in 3 steps.

Step 1: Discover what Calculus is about.
- What is Calculus?
- Key Concepts in Calculus: Rate of Change
Step 2: Discover the rules of differentiation.
Step 3: Dive into Calculus Topics.

You can see all calculus posts here. Below is a selection of some of the most popular tutorials.

Basic Calculus

Multivariate Calculus

Calculus for Optimization

Applications of Calculus

Python for Machine Learning

Python is the lingua franca of machine learning projects. Not only a lot of machine learning libraries are in Python, but also it is effective to help us finish our machine learning projects quick and neatly. Having good Python programming skills can let you get more done in shorter time!

You can get familiar with Python for machine learning in 3 steps.

Step 1: Learn the language.
- How to Learn Python for Machine Learning?
- Some Language Features in Python
Step 2: Learn how to work with the language.
Step 3: Learn what you can do in Python ecosystem.

You can see all Python posts here. But don’t miss Python for Machine Learning (my book). Below is a selection of some of the most popular tutorials.

Basic Language

Troubleshooting

Language Techniques

Libraries

Understand Machine Learning Algorithms

Machine learning is about machine learning algorithms.

You need to know what algorithms are available for a given problem, how they work, and how to get the most out of them.

Here’s how to get started with machine learning algorithms:

Step 1: Discover the different types of machine learning algorithms.
- A Tour of Machine Learning Algorithms
Step 2: Discover the foundations of machine learning algorithms.
Step 3: Discover how top machine learning algorithms work.
- Machine Learning Algorithms Mini-Course
- Master Machine Learning Algorithms (my book)

You can see all machine learning algorithm posts here. Below is a selection of some of the most popular tutorials.

Linear Algorithms

Nonlinear Algorithms

Ensemble Algorithms

How to Study/Learn ML Algorithms

Weka Machine Learning (no code)

Weka is a platform that you can use to get started in applied machine learning.

It has a graphical user interface meaning that no programming is required and it offers a suite of state of the art algorithms.

Here’s how you can get started with Weka:

Step 1: Discover the features of the Weka platform.
- What is the Weka Machine Learning Workbench
Step 2: Discover how to get around the Weka platform.
- How to Download and Install the Weka Machine Learning Workbench
- A Tour of the Weka Machine Learning Workbench
Step 3: Discover how to deliver results with Weka.

You can see all Weka machine learning posts here. Below is a selection of some of the most popular tutorials.

Prepare Data in Weka

Weka Algorithm Tutorials

Python Machine Learning (scikit-learn)

Python is one of the fastest growing platforms for applied machine learning.

You can use the same tools like pandas and scikit-learn in the development and operational deployment of your model.

Below are the steps that you can use to get started with Python machine learning:

Step 1: Discover Python for machine learning
- A Gentle Introduction to Scikit-Learn: A Python Machine Learning Library
Step 2: Discover the ecosystem for Python machine learning.
Step 3: Discover how to work through problems using machine learning in Python.

You can see all Python machine learning posts here. Below is a selection of some of the most popular tutorials.

Prepare Data in Python

Machine Learning in Python

R Machine Learning (caret)

R is a platform for statistical computing and is the most popular platform among professional data scientists.

It’s popular because of the large number of techniques available, and because of excellent interfaces to these methods such as the powerful caret package.

Here’s how to get started with R machine learning:

Step 1: Discover the R platform and why it is so popular.
Step 2: Discover machine learning algorithms in R.
- How To Get Started With Machine Learning Algorithms in R
Step 3: Discover how to work through problems using machine learning in R.

You can see all R machine learning posts here. Below is a selection of some of the most popular tutorials.

Data Preparation in R

Applied Machine Learning in R

Code Algorithm from Scratch (Python)

You can learn a lot about machine learning algorithms by coding them from scratch.

Learning via coding is the preferred learning style for many developers and engineers.

Here’s how to get started with machine learning by coding everything from scratch.

Step 1: Discover the benefits of coding algorithms from scratch.
- Benefits of Implementing Machine Learning Algorithms From Scratch
- Understand Machine Learning Algorithms By Implementing Them From Scratch
Step 2: Discover that coding algorithms from scratch is a learning tool only.
- Stop Coding Machine Learning Algorithms From Scratch
- Don’t Start with Open-Source Code When Implementing Machine Learning Algorithms
Step 3: Discover how to code machine learning algorithms from scratch in Python.
- Machine Learning Algorithms From Scratch (my book)

You can see all of the Code Algorithms from Scratch posts here. Below is a selection of some of the most popular tutorials.

Prepare Data

Linear Algorithms

Algorithm Evaluation

Nonlinear Algorithms

Introduction to Time Series Forecasting (Python)

Time series forecasting is an important topic in business applications.

Many datasets contain a time component, but the topic of time series is rarely covered in much depth from a machine learning perspective.

Here’s how to get started with Time Series Forecasting:

Step 1: Discover Time Series Forecasting.
- What Is Time Series Forecasting?
Step 2: Discover Time Series as Supervised Learning.
- Time Series Forecasting as Supervised Learning
Step 3: Discover how to get good at delivering results with Time Series Forecasting.
- Time Series Forecasting With Python Mini-Course
- Time Series Forecasting With Python (my book)

You can see all Time Series Forecasting posts here. Below is a selection of some of the most popular tutorials.

Data Preparation Tutorials

Forecasting Tutorials

Data Preparation for Machine Learning (Python)

The performance of your predictive model is only as good as the data that you use to train it.

As such data preparation may the most important parts of your applied machine learning project.

Here’s how to get started with Data Preparation for machine learning:

Step 1: Discover the importance of data preparation.
- What Is Data Preparation in a Machine Learning Project
- Why Data Preparation Is So Important in Machine Learning
Step 2: Discover data preparation techniques.
- Tour of Data Preparation Techniques for Machine Learning
- Framework for Data Preparation Techniques in Machine Learning
Step 3: Discover how to get good at delivering results with data preparation.

You can see all Data Preparation tutorials here. Below is a selection of some of the most popular tutorials.

Data Cleaning

Feature Selection

Data Transforms

Dimensionality Reduction

XGBoost in Python (Stochastic Gradient Boosting)

XGBoost is a highly optimized implementation of gradient boosted decision trees.

It is popular because it is being used by some of the best data scientists in the world to win machine learning competitions.

Here’s how to get started with XGBoost:

Step 1: Discover the Gradient Boosting Algorithm.
- A Gentle Introduction to the Gradient Boosting Algorithm for Machine Learning
Step 2: Discover XGBoost.
- A Gentle Introduction to XGBoost for Applied Machine Learning
Step 3: Discover how to get good at delivering results with XGBoost.

You can see all XGBoosts posts here. Below is a selection of some of the most popular tutorials.

XGBoost Basics

XGBoost Tuning

Imbalanced Classification

Imbalanced classification refers to classification tasks where there are many more examples for one class than another class.

These types of problems often require the use of specialized performance metrics and learning algorithms as the standard metrics and methods are unreliable or fail completely.

Here’s how you can get started with Imbalanced Classification:

Step 1: Discover the challenge of imbalanced classification
- A Gentle Introduction to Imbalanced Classification
Step 2: Discover the intuition for skewed class distributions.
- Develop an Intuition for Severely Skewed Class Distributions
Step 3: Discover how to solve imbalanced classification problems.

You can see all Imbalanced Classification posts here. Below is a selection of some of the most popular tutorials.

Performance Measures

Cost-Sensitive Algorithms

Data Sampling

Advanced Methods

Deep Learning (Keras)

Deep learning is a fascinating and powerful field.

State-of-the-art results are coming from the field of deep learning and it is a sub-field of machine learning that cannot be ignored.

Here’s how to get started with deep learning:

Step 1: Discover what deep learning is all about.
- What is Deep Learning?
- 8 Inspirational Applications of Deep Learning
Step 2: Discover the best tools and libraries.
Step 3: Discover how to work through problems and deliver results.

You can see all deep learning posts here. Below is a selection of some of the most popular tutorials.

Background

Multilayer Perceptrons

Convolutional Neural Networks

Recurrent Neural Networks

Deep Learning (PyTorch)

Besides Keras, PyTorch is another library for deep learning with a huge market-share. It is important to know about PyTorch and become familiar with its syntax.

Here’s how to get started with deep learning in PyTorch:

Step 1: Discover what deep learning is all about.
- What is Deep Learning?
- 8 Inspirational Applications of Deep Learning
Step 2: Discover PyTorch
- Overview of Some Deep Learning Libraries
- PyTorch Tutorial: How to Develop Deep Learning Models with Python
Step 3: Discover how to work through problems and deliver results.

You can see all PyTorch deep learning posts here. Below is a selection of some of the most popular tutorials.

Background

Multilayer Perceptrons

Model Building Techniques

Advanced Networks

Machine Learning in OpenCV

OpenCV is the most popular library for image processing but its machine learning module is less well-known.

If you are already using OpenCV, adding machine learning to your project should be at no additional cost. You can make use of the experiences you learned in scikit-learn or Keras to bring your image processing project to the next level.

Below are the steps that you can use to get started with machine learning in OpenCV:

Step 1: Refresher on what OpenCV offers
- A Gentle Introduction to OpenCV
Step 2: Discover how to present images for the consumption by machine learning models
Step 3: Discover how to use machine learning in OpenCV

You can see all OpenCV machine learning posts here. Below is a selection of some of the most popular tutorials.

Foundations on OpenCV and Image Processing

Machine Learning in Python

Better Deep Learning Performance

Although it is easy to define and fit a deep learning neural network model, it can be challenging to get good performance on a specific predictive modeling problem.

There are standard techniques that you can use to improve the learning, reduce overfitting, and make better predictions with your deep learning model.

Here’s how to get started with getting better deep learning performance:

Step 1: Discover the challenge of deep learning.
- Why Training a Neural Network Is Hard
- The Challenge of Training Deep Learning Neural Network Models
Step 2: Discover frameworks for diagnosing and improving model performance.
Step 3: Discover techniques that you can use to improve performance.
- How to Get Better Deep Learning Results (7-Day Mini-Course)
- Better Deep Learning (my book)

You can see all better deep learning posts here. Below is a selection of some of the most popular tutorials.

Better Learning (fix training)

Better Generalization (fix overfitting)

Better Predictions (ensembles)

Tips, Tricks, and Resources

Ensemble Learning

Predictive performance is the most important concern on many classification and regression problems. Ensemble learning algorithms combine the predictions from multiple models and are designed to perform better than any contributing ensemble member.

Here’s how to get started with getting better ensemble learning performance:

Step 1: Discover ensemble learning.
- A Gentle Introduction to Ensemble Learning
- Why Use Ensemble Learning
Step 2: Discover ensemble learning algorithms.
- A Gentle Introduction to Ensemble Learning Algorithms
Step 3: Discover techniques that you can use to improve performance.
- Ensemble Machine Learning With Python (7-Day Mini-Course)
- Ensemble Learning Algorithms With Python (my book)

You can see all ensemble learning posts here. Below is a selection of some of the most popular tutorials.

Ensemble Basics

Stacking Ensembles

Bagging Ensembles

Boosting Ensembles

Long Short-Term Memory Networks (LSTMs)

Long Short-Term Memory (LSTM) Recurrent Neural Networks are designed for sequence prediction problems and are a state-of-the-art deep learning technique for challenging prediction problems.

Here’s how to get started with LSTMs in Python:

Step 1: Discover the promise of LSTMs.
- The Promise of Recurrent Neural Networks for Time Series Forecasting
Step 2: Discover where LSTMs are useful.
Step 3: Discover how to use LSTMs on your project.

You can see all LSTM posts here. Below is a selection of some of the most popular tutorials using LSTMs in Python with the Keras deep learning library.

Data Preparation for LSTMs

LSTM Behaviour

Modeling with LSTMs

LSTM for Time Series

Deep Learning for Natural Language Processing (NLP)

Working with text data is hard because of the messy nature of natural language.

Text is not “solved” but to get state-of-the-art results on challenging NLP problems, you need to adopt deep learning methods

Here’s how to get started with deep learning for natural language processing:

Step 1: Discover what deep learning for NLP is all about.
Step 2: Discover standard datasets for NLP.
- 7 Applications of Deep Learning for Natural Language Processing
- Datasets for Natural Language Processing
Step 3: Discover how to work through problems and deliver results.
- Crash-Course in Deep Learning for Natural Language Processing
- Deep Learning for Natural Language Processing (my book)

You can see all deep learning for NLP posts here. Below is a selection of some of the most popular tutorials.

Bag-of-Words Model

Language Modeling

Text Summarization

Text Classification

Word Embeddings

Photo Captioning

Text Translation

Deep Learning for Computer Vision

Working with image data is hard because of the gulf between raw pixels and the meaning in the images.

Computer vision is not solved, but to get state-of-the-art results on challenging computer vision tasks like object detection and face recognition, you need deep learning methods.

Here’s how to get started with deep learning for computer vision:

Step 1: Discover what deep learning for Computer Vision is all about.
- What is Computer Vision?
- What is the Promise of Deep Learning for Computer Vision?
Step 2: Discover standard tasks and datasets for Computer Vision.
Step 3: Discover how to work through problems and deliver results.
- How to Get Started With Deep Learning for Computer Vision (7-Day Mini-Course)
- Deep Learning for Computer Vision (my book)

You can see all deep learning for Computer Vision posts here. Below is a selection of some of the most popular tutorials.

Image Data Handling

Image Data Augmentation

Image Classification

Image Data Preparation

Basics of Convolutional Neural Networks

Object Recognition

Deep Learning for Time Series Forecasting

Deep learning neural networks are able to automatically learn arbitrary complex mappings from inputs to outputs and support multiple inputs and outputs.

Methods such as MLPs, CNNs, and LSTMs offer a lot of promise for time series forecasting.

Here’s how to get started with deep learning for time series forecasting:

Step 1: Discover the promise (and limitations) of deep learning for time series.
Step 2: Discover how to develop robust baseline and defensible forecasting models.
- Taxonomy of Time Series Forecasting Problems
- How to Develop a Skillful Machine Learning Time Series Forecasting Model
Step 3: Discover how to build deep learning models for time series forecasting.
- How to Get Started with Deep Learning for Time Series Forecasting (7-Day Mini-Course)
- Deep Learning for Time Series Forecasting (my book)

You can see all deep learning for time series forecasting posts here. Below is a selection of some of the most popular tutorials.

Forecast Trends and Seasonality (univariate)

Human Activity Recognition (multivariate classification)

Forecast Electricity Usage (multivariate, multi-step)

Models Types

Time Series Case Studies

Forecast Air Pollution (multivariate, multi-step)

Generative Adversarial Networks (GANs)

Generative Adversarial Networks, or GANs for short, are an approach to generative modeling using deep learning methods, such as convolutional neural networks.

GANs are an exciting and rapidly changing field, delivering on the promise of generative models in their ability to generate realistic examples across a range of problem domains, most notably in image-to-image translation tasks.

Here’s how to get started with deep learning for Generative Adversarial Networks:

Step 1: Discover the promise of GANs for generative modeling.
- 18 Impressive Applications of Generative Adversarial Networks
Step 2: Discover the GAN architecture and different GAN models.
- A Gentle Introduction to Generative Adversarial Networks
- A Tour of Generative Adversarial Network Models
Step 3: Discover how to develop GAN models in Python with Keras.
- How to Get Started With Generative Adversarial Networks (7-Day Mini-Course)
- Generative Adversarial Networks with Python (my book)

You can see all Generative Adversarial Network tutorials listed here. Below is a selection of some of the most popular tutorials.

GAN Fundamentals

GAN Loss Functions

Develop Simple GAN Models

GANs for Image Translation

Attention and Transformers

Attention mechanisms are the techniques invented to mitigate the issue where recurrent neural networks failed to work well with long sequences of input. We learned that the attention mechanism itself can be used as a building block of neural networks and therefore we now have the transformer architecture.

Attention mechanisms and transformer models are shown to deliver amazing results, especially in natural language processing. There are examples of using transformer models in one way or another that make computers understand human language and perform tasks such as translation or summarizing a paragraph, in human-like quality.

Here’s how to get started to understand attention mechanisms and transformers:

Step 1: Learn about what attention is and what it can do.
- A Bird’s Eye View of Research on Attention
Step 2: Discover how to use attention in a neural network model.
- Adding a Custom Attention Layer to a Recurrent Neural Network in Keras
Step 3: Learn how the transformer model is built from the attention mechanism.
- The Transformer Model
- Building Transformer Models with Attention (my book)

You can see all Attention and Transformer tutorials listed here. Below is a selection of some of the most popular tutorials.

Attention Fundamentals

Transformer Fundamentals

Building a Transformer Model from Scratch

Need More Help?

I’m here to help you become awesome at applied machine learning.

If you still have questions and need help, you have some options:

Ebooks: I sell a catalog of Ebooks that show you how to get results with machine learning, fast.
- Machine Learning Mastery EBook Catalog
Blog: I write a lot about applied machine learning on the blog, try the search feature.
- Machine Learning Mastery Blog
Frequently Asked Questions: The most common questions I get and their answers
- Machine Learning Mastery FAQ
Contact: You can contact me with your question, but one question at a time please.
- Machine Learning Mastery Contact

Navigation

Need Help Getting Started with Applied Machine Learning?

These are the Step-by-Step Guides that You’ve Been Looking For!

What do you want help with?

Foundations

Beginner

Intermediate

Advanced

How Do I Get Started?

Applied Machine Learning Process

Probability for Machine Learning

Probability Foundations

Bayes Theorem

Probability Distributions

Information Theory

Statistics for Machine Learning

Summary Statistics

Statistical Hypothesis Tests

Resampling Methods

Estimation Statistics

Linear Algebra for Machine Learning

Linear Algebra in Python

Matrices

Vectors

Matrix Factorization

Optimization for Machine Learning

Local Optimization

Global Optimization

Gradient Descent

Applications of Optimization

Calculus for Machine Learning

Basic Calculus

Multivariate Calculus

Calculus for Optimization

Applications of Calculus

Python for Machine Learning

Basic Language

Troubleshooting

Language Techniques

Libraries

Understand Machine Learning Algorithms

Linear Algorithms

Nonlinear Algorithms

Ensemble Algorithms

How to Study/Learn ML Algorithms

Weka Machine Learning (no code)

Prepare Data in Weka

Weka Algorithm Tutorials

Python Machine Learning (scikit-learn)

Prepare Data in Python

Machine Learning in Python

R Machine Learning (caret)

Data Preparation in R

Applied Machine Learning in R

Code Algorithm from Scratch (Python)

Prepare Data

Linear Algorithms

Algorithm Evaluation

Nonlinear Algorithms

Introduction to Time Series Forecasting (Python)

Data Preparation Tutorials

Forecasting Tutorials

Data Preparation for Machine Learning (Python)

Data Cleaning

Feature Selection

Data Transforms

Dimensionality Reduction

XGBoost in Python (Stochastic Gradient Boosting)

XGBoost Basics

XGBoost Tuning

Imbalanced Classification

Performance Measures

Cost-Sensitive Algorithms

Data Sampling

Advanced Methods

Deep Learning (Keras)

Background

Multilayer Perceptrons

Convolutional Neural Networks

Recurrent Neural Networks