Search results for "attention"

The Vision Transformer Model

By Stefania Cristina on January 6, 2023 in Attention 5

With the Transformer architecture revolutionizing the implementation of attention, and achieving very promising results in the natural language processing domain, it was only a matter of time before we could see its application in the computer vision domain too. This was eventually achieved with the implementation of the Vision Transformer (ViT). In this tutorial, you […]

The Transformer Positional Encoding Layer in Keras, Part 2

By Mehreen Saeed on January 6, 2023 in Attention 15

In part 1, a gentle introduction to positional encoding in transformer models, we discussed the positional encoding layer of the transformer model. We also showed how you could implement this layer and its functions yourself in Python. In this tutorial, you’ll implement the positional encoding layer in Keras and Tensorflow. You can then use this […]

muhammad-murtaza-ghani-CIVbJZR8aAk-unsplash

A Gentle Introduction to Positional Encoding in Transformer Models, Part 1

By Mehreen Saeed on January 6, 2023 in Attention 45

In languages, the order of the words and their position in a sentence really matters. The meaning of the entire sentence can change if the words are re-ordered. When implementing NLP solutions, recurrent neural networks have an inbuilt mechanism that deals with the order of sequences. The transformer model, however, does not use recurrence or […]

The Transformer Model

By Stefania Cristina on January 6, 2023 in Attention 26

We have already familiarized ourselves with the concept of self-attention as implemented by the Transformer attention mechanism for neural machine translation. We will now be shifting our focus to the details of the Transformer architecture itself to discover how self-attention can be implemented without relying on the use of recurrence and convolutions. In this tutorial, […]

Photo by <a href="https://www.pexels.com/photo/view-of-wooden-steps-taken-underwater-3634369/">Francesco Ungaro</a>

Overview of Some Deep Learning Libraries

By Adrian Tam on July 27, 2022 in Deep Learning 5

Machine learning is a broad topic. Deep learning, in particular, is a way of using neural networks for machine learning. A neural network is probably a concept older than machine learning, dating back to the 1950s. Unsurprisingly, there were many libraries created for it. The following aims to give an overview of some of the […]

Introduction to the Python Deep Learning Library TensorFlow

By Jason Brownlee on July 27, 2022 in Deep Learning 34

TensorFlow is a Python library for fast numerical computing created and released by Google. It is a foundation library that can be used to create Deep Learning models directly or by using wrapper libraries that simplify the process built on top of TensorFlow. In this post, you will discover the TensorFlow library for Deep Learning. […]

Setting Breakpoints and Exception Hooks in Python

By Stefania Cristina on June 21, 2022 in Python for Machine Learning 0

There are different ways of debugging code in Python, one of which is to introduce breakpoints into the code at points where one would like to invoke a Python debugger. The statements used to enter a debugging session at different call sites depend on the version of the Python interpreter that one is working with, […]

Photo by <a href="https://www.pexels.com/photo/close-up-shot-of-cassette-tapes-with-small-pieces-of-flowers-7166023/">Olha Ruskykh</a>. Some rights reserved.

A Guide to Getting Datasets for Machine Learning in Python

By Adrian Tam on June 21, 2022 in Python for Machine Learning 3

Compared to other programming exercises, a machine learning project is a blend of code and data. You need both to achieve the result and do something useful. Over the years, many well-known datasets have been created, and many have become standards or benchmarks. In this tutorial, we are going to see how we can obtain […]

Comments, Docstrings, and Type Hints in Python Code

By Adrian Tam on June 21, 2022 in Python for Machine Learning 1

The source code of a program should be readable to humans. Making it run correctly is only half of its purpose. Without a properly commenting code, it would be difficult for one, including the future you, to understand the rationale and intent behind the code. It would also make the code impossible to maintain. In […]

A Gentle Introduction to Multivariate Calculus

By Stefania Cristina on July 19, 2021 in Calculus 8

It is often desirable to study functions that depend on many variables. Multivariate calculus provides us with the tools to do so by extending the concepts that we find in calculus, such as the computation of the rate of change, to multiple variables. It plays an essential role in the process of training a neural […]

← Previous 1 … 4 5 6 … 17 Next →