Search results for "language model"

Difference Between Algorithm and Model in Machine Learning

By Jason Brownlee on August 19, 2020 in Machine Learning Algorithms 66

Machine learning involves the use of machine learning algorithms and models. For beginners, this is very confusing as often “machine learning algorithm” is used interchangeably with “machine learning model.” Are they the same thing or something different? As a developer, your intuition with “algorithms” like sort algorithms and search algorithms will help to clear up […]

Histogram Plots of the Variables for the Phoneme Dataset

Predictive Model for the Phoneme Imbalanced Classification Dataset

By Jason Brownlee on January 5, 2021 in Imbalanced Classification 22

Many binary classification tasks do not have an equal number of examples from each class, e.g. the class distribution is skewed or imbalanced. Nevertheless, accuracy is equally important in both classes. An example is the classification of vowel sounds from European languages as either nasal or oral on speech recognition where there are many more […]

How to Develop LSTM Models for Time Series Forecasting

By Jason Brownlee on August 28, 2020 in Deep Learning for Time Series 981

Long Short-Term Memory networks, or LSTMs for short, can be applied to time series forecasting. There are many types of LSTM models that can be used for each specific type of time series forecasting problem. In this tutorial, you will discover how to develop a suite of LSTM models for a range of standard time […]

Depiction of CNN Model for Accelerompter Data

Deep Learning Models for Human Activity Recognition

By Jason Brownlee on August 5, 2019 in Deep Learning for Time Series 78

Human activity recognition, or HAR, is a challenging time series classification task. It involves predicting the movement of a person based on sensor data and traditionally involves deep domain expertise and methods from signal processing to correctly engineer features from the raw data in order to fit a machine learning model. Recently, deep learning methods […]

What programming language should I use for machine learning?

By Jason Brownlee on April 13, 2018 in 0

A What programming language should I use for machine learning? The specific programming language or platform that you use does not matter. I strongly believe that the best thing to focus on is how to work through machine learning problems end-to-end (learn more). That being said, I think if you’re not a strong programmer, that […]

Plot of the Multichannel Convolutional Neural Network For Text

How to Develop a Multichannel CNN Model for Text Classification

By Jason Brownlee on September 3, 2020 in Deep Learning for Natural Language Processing 214

A standard deep learning model for text classification and sentiment analysis uses a word embedding layer and one-dimensional convolutional neural network. The model can be expanded by using multiple parallel convolutional neural networks that read the source document using different kernel sizes. This, in effect, creates a multichannel convolutional neural network for text that reads […]

How to Implement Beam Search Decoder for Natural Language Processing

How to Implement a Beam Search Decoder for Natural Language Processing

By Jason Brownlee on June 3, 2020 in Deep Learning for Natural Language Processing 51

Natural language processing tasks, such as caption generation and machine translation, involve generating sequences of words. Models developed for these problems often operate by generating probability distributions across the vocabulary of output words and it is up to decoding algorithms to sample the probability distributions to generate the most likely sequences of words. In this […]

How to Configure an Encoder-Decoder Model for Neural Machine Translation

By Jason Brownlee on August 7, 2019 in Deep Learning for Natural Language Processing 22

The encoder-decoder architecture for recurrent neural networks is achieving state-of-the-art results on standard machine translation benchmarks and is being used in the heart of industrial translation services. The model is simple, but given the large amount of data required to train it, tuning the myriad of design decisions in the model in order get top […]

Encoder-Decoder Recurrent Neural Network Models for Neural Machine Translation

By Jason Brownlee on August 7, 2019 in Deep Learning for Natural Language Processing 22

The encoder-decoder architecture for recurrent neural networks is the standard neural machine translation method that rivals and in some cases outperforms classical statistical machine translation methods. This architecture is very new, having only been pioneered in 2014, although, has been adopted as the core technology inside Google’s translate service. In this post, you will discover […]

Caption Generation with the Inject and Merge Architectures for the Encoder-Decoder Model

Caption Generation with the Inject and Merge Encoder-Decoder Models

By Jason Brownlee on August 7, 2019 in Deep Learning for Natural Language Processing 6

Caption generation is a challenging artificial intelligence problem that draws on both computer vision and natural language processing. The encoder-decoder recurrent neural network architecture has been shown to be effective at this problem. The implementation of this architecture can be distilled into inject and merge based models, and both make different assumptions about the role […]

← Previous 1 2 3 4 … 30 Next →