Tag Archives | decoder

Joining the Transformer Encoder and Decoder Plus Masking

By Stefania Cristina on January 6, 2023 in Attention 32

We have arrived at a point where we have implemented and tested the Transformer encoder and decoder separately, and we may now join the two together into a complete model. We will also see how to create padding and look-ahead masks by which we will suppress the input values that will not be considered in […]

Implementing the Transformer Decoder from Scratch in TensorFlow and Keras

By Stefania Cristina on January 6, 2023 in Attention 15

There are many similarities between the Transformer encoder and decoder, such as their implementation of multi-head attention, layer normalization, and a fully connected feed-forward network as their final sub-layer. Having implemented the Transformer encoder, we will now go ahead and apply our knowledge in implementing the Transformer decoder as a further step toward implementing the […]

A Bird’s Eye View of Research on Attention

By Stefania Cristina on January 6, 2023 in Attention 7

Attention is a concept that is scientifically studied across multiple disciplines, including psychology, neuroscience, and, more recently, machine learning. While all disciplines may have produced their own definitions for attention, one core quality they can all agree on is that attention is a mechanism for making both biological and artificial neural systems more flexible. In […]

Navigation

Tag Archives | decoder