Search results for "Natural Language Processing"

How to Implement Scaled Dot-Product Attention from Scratch in TensorFlow and Keras

By Stefania Cristina on January 6, 2023 in Attention 5

Having familiarized ourselves with the theory behind the Transformer model and its attention mechanism, we’ll start our journey of implementing a complete Transformer model by first seeing how to implement the scaled-dot product attention. The scaled dot-product attention is an integral part of the multi-head attention, which, in turn, is an important component of both […]

The Transformer Positional Encoding Layer in Keras, Part 2

By Mehreen Saeed on January 6, 2023 in Attention 15

In part 1, a gentle introduction to positional encoding in transformer models, we discussed the positional encoding layer of the transformer model. We also showed how you could implement this layer and its functions yourself in Python. In this tutorial, you’ll implement the positional encoding layer in Keras and Tensorflow. You can then use this […]

muhammad-murtaza-ghani-CIVbJZR8aAk-unsplash

A Gentle Introduction to Positional Encoding in Transformer Models, Part 1

By Mehreen Saeed on January 6, 2023 in Attention 45

In languages, the order of the words and their position in a sentence really matters. The meaning of the entire sentence can change if the words are re-ordered. When implementing NLP solutions, recurrent neural networks have an inbuilt mechanism that deals with the order of sequences. The transformer model, however, does not use recurrence or […]

A Bird’s Eye View of Research on Attention

By Stefania Cristina on January 6, 2023 in Attention 7

Attention is a concept that is scientifically studied across multiple disciplines, including psychology, neuroscience, and, more recently, machine learning. While all disciplines may have produced their own definitions for attention, one core quality they can all agree on is that attention is a mechanism for making both biological and artificial neural systems more flexible. In […]

When to Use MLP, CNN, and RNN Neural Networks

By Jason Brownlee on August 15, 2022 in Deep Learning 134

What neural network is appropriate for your predictive modeling problem? It can be difficult for a beginner to the field of deep learning to know what type of network to use. There are so many types of networks to choose from and new methods being published and discussed every day. To make things worse, most […]

Feature Engineering for Machine Learning

8 Top Books on Data Cleaning and Feature Engineering

By Jason Brownlee on June 30, 2020 in Data Preparation 21

Data preparation is the transformation of raw data into a form that is more appropriate for modeling. It is a challenging topic to discuss as the data differs in form, type, and structure from project to project. Nevertheless, there are common data preparation tasks across projects. It is a huge field of study and goes […]

Learning Curves of Cross-Entropy Loss for a Deep Learning Model

TensorFlow 2 Tutorial: Get Started in Deep Learning with tf.keras

By Jason Brownlee on August 2, 2022 in Deep Learning 135

Predictive modeling with deep learning is a skill that modern developers need to know. TensorFlow is the premier open-source deep learning framework developed and maintained by Google. Although using TensorFlow directly can be challenging, the modern tf.keras API brings Keras’s simplicity and ease of use to the TensorFlow project. Using tf.keras allows you to design, […]

14 Different Types of Learning in Machine Learning

By Jason Brownlee on November 11, 2019 in Start Machine Learning 101

Machine learning is a large field of study that overlaps with and inherits ideas from many related fields such as artificial intelligence. The focus of the field is learning, that is, acquiring skills or knowledge from experience. Most commonly, this means synthesizing useful concepts from historical data. As such, there are many different types of […]

Discrete Probability Distributions for Machine Learning

By Jason Brownlee on October 6, 2020 in Probability 21

The probability for a discrete random variable can be summarized with a discrete probability distribution. Discrete probability distributions are used in machine learning, most notably in the modeling of binary and multi-class classification problems, but also in evaluating the performance for binary classification models, such as the calculation of confidence intervals, and in the modeling […]

9 Books on Generative Adversarial Networks (GANs)

By Jason Brownlee on August 21, 2019 in Generative Adversarial Networks 14

Generative Adversarial Networks, or GANs for short, were first described in the 2014 paper by Ian Goodfellow, et al. titled “Generative Adversarial Networks.” Since then, GANs have seen a lot of attention given that they are perhaps one of the most effective techniques for generating large, high-quality synthetic images. As such, a number of books […]

← Previous 1 … 3 4 5 … 13 Next →