Tag Archives | machine translation

The Transformer Model

By Stefania Cristina on January 6, 2023 in Attention 26

We have already familiarized ourselves with the concept of self-attention as implemented by the Transformer attention mechanism for neural machine translation. We will now be shifting our focus to the details of the Transformer architecture itself to discover how self-attention can be implemented without relying on the use of recurrence and convolutions. In this tutorial, […]

The Transformer Attention Mechanism

By Stefania Cristina on January 6, 2023 in Attention 18

Before the introduction of the Transformer model, the use of attention for neural machine translation was implemented by RNN-based encoder-decoder architectures. The Transformer model revolutionized the implementation of attention by dispensing with recurrence and convolutions and, alternatively, relying solely on a self-attention mechanism. We will first focus on the Transformer attention mechanism in this tutorial […]

The Attention Mechanism from Scratch

By Stefania Cristina on January 6, 2023 in Attention 27

The attention mechanism was introduced to improve the performance of the encoder-decoder model for machine translation. The idea behind the attention mechanism was to permit the decoder to utilize the most relevant parts of the input sequence in a flexible manner, by a weighted combination of all the encoded input vectors, with the most relevant […]

Navigation

Tag Archives | machine translation