Archive | Deep Learning Performance

Line Plots of Classification Accuracy on Train and Test Datasets With Different Batch Sizes

How to Control the Stability of Training Neural Networks With the Batch Size

By Jason Brownlee on August 28, 2020 in Deep Learning Performance 39

Neural networks are trained using gradient descent where the estimate of the error used to update the weights is calculated based on a subset of the training dataset. The number of examples from the training dataset used in the estimate of the error gradient is called the batch size and is an important hyperparameter that […]

Line Plot Classification Accuracy of MLP With Batch Normalization After Activation Function on Train and Test Datasets Over Training Epochs

How to Accelerate Learning of Deep Neural Networks With Batch Normalization

By Jason Brownlee on August 25, 2020 in Deep Learning Performance 42

Batch normalization is a technique designed to automatically standardize the inputs to a layer in a deep learning neural network. Once implemented, batch normalization has the effect of dramatically accelerating the training process of a neural network, and in some cases improves the performance of the model via a modest regularization effect. In this tutorial, […]

How to Calibrate Probabilities for Imbalanced Classification

A Gentle Introduction to Batch Normalization for Deep Neural Networks

By Jason Brownlee on December 4, 2019 in Deep Learning Performance 53

Training deep neural networks with tens of layers is challenging as they can be sensitive to the initial random weights and configuration of the learning algorithm. One possible reason for this difficulty is the distribution of the inputs to layers deep in the network may change after each mini-batch when the weights are updated. This […]

3 Must-Own Books for Deep Learning Practitioners

By Jason Brownlee on August 6, 2019 in Deep Learning Performance 18

Developing neural networks is often referred to as a dark art. The reason for this is that being skilled at developing neural network models comes from experience. There are no reliable methods to analytically calculate how to design a “good” or “best” model for your specific dataset. You must draw on experience and experiment in […]

Line Plot of Train and Test Set Accuracy of Over Training Epochs for Deep MLP with ReLU with 15 Hidden Layers

How to Fix the Vanishing Gradients Problem Using the ReLU

By Jason Brownlee on August 25, 2020 in Deep Learning Performance 26

The vanishing gradients problem is one example of unstable behavior that you may encounter when training a deep neural network. It describes the situation where a deep multilayer feed-forward network or a recurrent neural network is unable to propagate useful gradient information from the output end of the model back to the layers near the […]

Line Plot of Rectified Linear Activation for Negative and Positive Inputs

A Gentle Introduction to the Rectified Linear Unit (ReLU)

By Jason Brownlee on August 20, 2020 in Deep Learning Performance 79

In a neural network, the activation function is responsible for transforming the summed weighted input from the node into the activation of the node or output for that input. The rectified linear activation function or ReLU for short is a piecewise linear function that will output the input directly if it is positive, otherwise, it […]

Line Plot of Single Model Test Performance (blue dots) and Model Weight Ensemble Test Performance (orange line) With an Exponential Decay

Ensemble Neural Network Model Weights in Keras (Polyak Averaging)

By Jason Brownlee on August 28, 2020 in Deep Learning Performance 18

The training process of neural networks is a challenging optimization process that can often fail to converge. This can mean that the model at the end of training may not be a stable or best-performing set of weights to use as a final model. One approach to address this problem is to use an average […]

Line Plot of Cosine Annealing Learning Rate Schedule

Snapshot Ensemble Deep Learning Neural Network in Python

By Jason Brownlee on August 28, 2020 in Deep Learning Performance 34

Model ensembles can achieve lower generalization error than single models but are challenging to develop with deep learning neural networks given the computational cost of training each single model. An alternative is to train multiple model snapshots during a single training run and combine their predictions to make an ensemble prediction. A limitation of this […]

Four Scatter Plots of the Circles Dataset Varied by the Amount of Statistical Noise

Impact of Dataset Size on Deep Learning Model Skill And Performance Estimates

By Jason Brownlee on August 25, 2020 in Deep Learning Performance 22

Supervised learning is challenging, although the depths of this challenge are often learned then forgotten or willfully ignored. This must be the case, because dwelling too long on this challenge may result in a pessimistic outlook. In spite of the challenge, we continue to wield supervised learning algorithms and they perform well in practice. Fundamental […]

Visualization of Stacked Generalization Ensemble of Neural Network Models

Stacking Ensemble for Deep Learning Neural Networks in Python

By Jason Brownlee on August 28, 2020 in Deep Learning Performance 224

Model averaging is an ensemble technique where multiple sub-models contribute equally to a combined prediction. Model averaging can be improved by weighting the contributions of each sub-model to the combined prediction by the expected performance of the submodel. This can be extended further by training an entirely new model to learn how to best combine […]

← Previous 1 2 3 4 … 6 Next →