Tag Archives | transformer

The Transformer Model

By Stefania Cristina on January 6, 2023 in Attention 26

We have already familiarized ourselves with the concept of self-attention as implemented by the Transformer attention mechanism for neural machine translation. We will now be shifting our focus to the details of the Transformer architecture itself to discover how self-attention can be implemented without relying on the use of recurrence and convolutions. In this tutorial, […]

The Transformer Attention Mechanism

By Stefania Cristina on January 6, 2023 in Attention 18

Before the introduction of the Transformer model, the use of attention for neural machine translation was implemented by RNN-based encoder-decoder architectures. The Transformer model revolutionized the implementation of attention by dispensing with recurrence and convolutions and, alternatively, relying solely on a self-attention mechanism. We will first focus on the Transformer attention mechanism in this tutorial […]

A Tour of Attention-Based Architectures

By Stefania Cristina on January 6, 2023 in Attention 4

As the popularity of attention in machine learning grows, so does the list of neural architectures that incorporate an attention mechanism. In this tutorial, you will discover the salient neural architectures that have been used in conjunction with attention. After completing this tutorial, you will better understand how the attention mechanism is incorporated into different […]

A Bird’s Eye View of Research on Attention

By Stefania Cristina on January 6, 2023 in Attention 7

Attention is a concept that is scientifically studied across multiple disciplines, including psychology, neuroscience, and, more recently, machine learning. While all disciplines may have produced their own definitions for attention, one core quality they can all agree on is that attention is a mechanism for making both biological and artificial neural systems more flexible. In […]

← Previous 1 2

Navigation

Tag Archives | transformer