Search results for "word embedding"

How to Implement Multi-Head Attention from Scratch in TensorFlow and Keras

By Stefania Cristina on January 6, 2023 in Attention 28

We have already familiarized ourselves with the theory behind the Transformer model and its attention mechanism. We have already started our journey of implementing a complete model by seeing how to implement the scaled-dot product attention. We shall now progress one step further into our journey by encapsulating the scaled-dot product attention into a multi-head […]

How to Implement Scaled Dot-Product Attention from Scratch in TensorFlow and Keras

By Stefania Cristina on January 6, 2023 in Attention 5

Having familiarized ourselves with the theory behind the Transformer model and its attention mechanism, we’ll start our journey of implementing a complete Transformer model by first seeing how to implement the scaled-dot product attention. The scaled dot-product attention is an integral part of the multi-head attention, which, in turn, is an important component of both […]

The Transformer Positional Encoding Layer in Keras, Part 2

By Mehreen Saeed on January 6, 2023 in Attention 15

In part 1, a gentle introduction to positional encoding in transformer models, we discussed the positional encoding layer of the transformer model. We also showed how you could implement this layer and its functions yourself in Python. In this tutorial, you’ll implement the positional encoding layer in Keras and Tensorflow. You can then use this […]

The Attention Mechanism from Scratch

By Stefania Cristina on January 6, 2023 in Attention 27

The attention mechanism was introduced to improve the performance of the encoder-decoder model for machine translation. The idea behind the attention mechanism was to permit the decoder to utilize the most relevant parts of the input sequence in a flexible manner, by a weighted combination of all the encoded input vectors, with the most relevant […]

Setting Breakpoints and Exception Hooks in Python

By Stefania Cristina on June 21, 2022 in Python for Machine Learning 0

There are different ways of debugging code in Python, one of which is to introduce breakpoints into the code at points where one would like to invoke a Python debugger. The statements used to enter a debugging session at different call sites depend on the version of the Python interpreter that one is working with, […]

How can I transform text into numbers for machine learning?

By Jason Brownlee on July 10, 2020 in 0

A How can I transform text into numbers for machine learning? Text must be converted to numbers before you can use it as input to a machine learning model. The first step is to determine your vocabulary of words, then assign a unique integer to each word. You control the complexity of your modeling task […]

How to Encode Categorical Data for Deep Learning in Keras

3 Ways to Encode Categorical Variables for Deep Learning

By Jason Brownlee on August 27, 2020 in Deep Learning 175

Machine learning and deep learning models, like those in Keras, require all input and output variables to be numeric. This means that if your data contains categorical data, you must encode it to numbers before you can fit and evaluate a model. The two most popular techniques are an integer encoding and a one hot […]

14 Different Types of Learning in Machine Learning

By Jason Brownlee on November 11, 2019 in Start Machine Learning 101

Machine learning is a large field of study that overlaps with and inherits ideas from many related fields such as artificial intelligence. The focus of the field is learning, that is, acquiring skills or knowledge from experience. Most commonly, this means synthesizing useful concepts from historical data. As such, there are many different types of […]

How to Develop a Deep Learning Photo Caption Generator from Scratch

By Jason Brownlee on December 23, 2020 in Deep Learning for Natural Language Processing 1,196

Develop a Deep Learning Model to Automatically Describe Photographs in Python with Keras, Step-by-Step. Caption generation is a challenging artificial intelligence problem where a textual description must be generated for a given photograph. It requires both methods from computer vision to understand the content of the image and a language model from the field of […]

Plot of the Multichannel Convolutional Neural Network For Text

How to Develop a Multichannel CNN Model for Text Classification

By Jason Brownlee on September 3, 2020 in Deep Learning for Natural Language Processing 214

A standard deep learning model for text classification and sentiment analysis uses a word embedding layer and one-dimensional convolutional neural network. The model can be expanded by using multiple parallel convolutional neural networks that read the source document using different kernel sizes. This, in effect, creates a multichannel convolutional neural network for text that reads […]

← Previous 1 2 3 … 7 Next →