Search results for "encoder decoder attention"

Training the Transformer Model

By Stefania Cristina on January 6, 2023 in Attention 44

We have put together the complete Transformer model, and now we are ready to train it for neural machine translation. We shall use a training dataset for this purpose, which contains short English and German sentence pairs. We will also revisit the role of masking in computing the accuracy and loss metrics during the training […]

The Transformer Model

By Stefania Cristina on January 6, 2023 in Attention 26

We have already familiarized ourselves with the concept of self-attention as implemented by the Transformer attention mechanism for neural machine translation. We will now be shifting our focus to the details of the Transformer architecture itself to discover how self-attention can be implemented without relying on the use of recurrence and convolutions. In this tutorial, […]

Introduction to Dimensionality Reduction for Machine Learning

By Jason Brownlee on June 30, 2020 in Data Preparation 11

The number of input variables or features for a dataset is referred to as its dimensionality. Dimensionality reduction refers to techniques that reduce the number of input variables in a dataset. More input features often make a predictive modeling task more challenging to model, more generally referred to as the curse of dimensionality. High-dimensionality statistics […]

How to Develop a Deep Learning Photo Caption Generator from Scratch

By Jason Brownlee on December 23, 2020 in Deep Learning for Natural Language Processing 1,196

Develop a Deep Learning Model to Automatically Describe Photographs in Python with Keras, Step-by-Step. Caption generation is a challenging artificial intelligence problem where a textual description must be generated for a given photograph. It requires both methods from computer vision to understand the content of the image and a language model from the field of […]

Example of Image Classification With Localization of Multiple Chairs From VOC 2012

9 Applications of Deep Learning for Computer Vision

By Jason Brownlee on July 5, 2019 in Deep Learning for Computer Vision 39

The field of computer vision is shifting from statistical methods to deep learning neural network methods. There are still many challenging problems to solve in computer vision. Nevertheless, deep learning methods are achieving state-of-the-art results on some specific problems. It is not just the performance of deep learning models on benchmark problems that is most […]

Activation Regularization for Reducing Generalization Error in Deep Learning Neural Networks

A Gentle Introduction to Activation Regularization in Deep Learning

By Jason Brownlee on August 6, 2019 in Deep Learning Performance 8

Deep learning models are capable of automatically learning a rich internal representation from raw input data. This is called feature or representation learning. Better learned representations, in turn, can lead to better insights into the domain, e.g. via visualization of learned features, and to better predictive models that make use of the learned features. A […]

How to Develop LSTM Models for Time Series Forecasting

By Jason Brownlee on August 28, 2020 in Deep Learning for Time Series 981

Long Short-Term Memory networks, or LSTMs for short, can be applied to time series forecasting. There are many types of LSTM models that can be used for each specific type of time series forecasting problem. In this tutorial, you will discover how to develop a suite of LSTM models for a range of standard time […]

Box and whisker plots of target variables for one chunk

How to Load, Visualize, and Explore a Multivariate Multistep Time Series Dataset

By Jason Brownlee on August 5, 2019 in Deep Learning for Time Series 10

Real-world time series forecasting is challenging for a whole host of reasons not limited to problem features such as having multiple input variables, the requirement to predict multiple time steps, and the need to perform the same type of prediction for multiple physical sites. The EMC Data Science Global Hackathon dataset, or the ‘Air Quality […]

A Gentle Introduction to Neural Machine Translation

By Jason Brownlee on August 7, 2019 in Deep Learning for Natural Language Processing 31

One of the earliest goals for computers was the automatic translation of text from one language to another. Automatic or machine translation is perhaps one of the most challenging artificial intelligence tasks given the fluidity of human language. Classically, rule-based systems were used for this task, which were replaced in the 1990s with statistical methods. […]

How to Use Small Experiments to Develop a Caption Generation Model in Keras

By Jason Brownlee on September 3, 2020 in Deep Learning for Natural Language Processing 85

Caption generation is a challenging artificial intelligence problem where a textual description must be generated for a photograph. It requires both methods from computer vision to understand the content of the image and a language model from the field of natural language processing to turn the understanding of the image into words in the right […]

← Previous 1 … 3 4 5 Next →