Search results for "Long Short Term Memory Network"

Convolutional Neural Networks for Multi-Step Time Series Forecasting

By Jason Brownlee on August 28, 2020 in Deep Learning for Time Series 137

Given the rise of smart electricity meters and the wide adoption of electricity generation technology like solar panels, there is a wealth of electricity usage data available. This data represents a multivariate time series of power-related variables that in turn could be used to model and even forecast future electricity consumption. Unlike other machine learning […]

Histograms of each variable in the training data set

1D Convolutional Neural Network Models for Human Activity Recognition

By Jason Brownlee on August 28, 2020 in Deep Learning for Time Series 233

Human activity recognition is the problem of classifying sequences of accelerometer data recorded by specialized harnesses or smart phones into known well-defined movements. Classical approaches to the problem involve hand crafting features from the time series data based on fixed-sized windows and training machine learning models, such as ensembles of decision trees. The difficulty is […]

Encoder-Decoder Recurrent Neural Network Models for Neural Machine Translation

By Jason Brownlee on August 7, 2019 in Deep Learning for Natural Language Processing 22

The encoder-decoder architecture for recurrent neural networks is the standard neural machine translation method that rivals and in some cases outperforms classical statistical machine translation methods. This architecture is very new, having only been pioneered in 2014, although, has been adopted as the core technology inside Google’s translate service. In this post, you will discover […]

A Gentle Introduction to Exploding Gradients in Recurrent Neural Networks

A Gentle Introduction to Exploding Gradients in Neural Networks

By Jason Brownlee on August 14, 2019 in Long Short-Term Memory Networks 41

Exploding gradients are a problem where large error gradients accumulate and result in very large updates to neural network model weights during training. This has the effect of your model being unstable and unable to learn from your training data. In this post, you will discover the problem of exploding gradients with deep artificial neural […]

Gentle Introduction to Global Attention for Encoder-Decoder Recurrent Neural Networks

By Jason Brownlee on August 14, 2019 in Long Short-Term Memory Networks 12

The encoder-decoder model provides a pattern for using recurrent neural networks to address challenging sequence-to-sequence prediction problems such as machine translation. Attention is an extension to the encoder-decoder model that improves the performance of the approach on longer sequences. Global attention is a simplification of attention that may be easier to implement in declarative deep […]

How to Develop a Word Embedding Model for Predicting Movie Review Sentiment

Deep Convolutional Neural Network for Sentiment Analysis (Text Classification)

By Jason Brownlee on September 3, 2020 in Deep Learning for Natural Language Processing 258

Develop a Deep Learning Model to Automatically Classify Movie Reviews as Positive or Negative in Python with Keras, Step-by-Step. Word embeddings are a technique for representing text where different words with similar meaning have a similar real-valued vector representation. They are a key breakthrough that has led to great performance of neural network models on […]

Feeding Hidden State as Input to Decoder

How Does Attention Work in Encoder-Decoder Recurrent Neural Networks

By Jason Brownlee on August 7, 2019 in Deep Learning for Natural Language Processing 57

Attention is a mechanism that was developed to improve the performance of the Encoder-Decoder RNN on machine translation. In this tutorial, you will discover the attention mechanism for the Encoder-Decoder model. After completing this tutorial, you will know: About the Encoder-Decoder model and attention mechanism for machine translation. How to implement the attention mechanism step-by-step. […]

Feed-forward neural network with two hidden layers

Primer on Neural Network Models for Natural Language Processing

By Jason Brownlee on August 14, 2020 in Deep Learning for Natural Language Processing 22

Deep learning is having a large impact on the field of natural language processing. But, as a beginner, where do you start? Both deep learning and natural language processing are huge fields. What are the salient aspects of each field to focus on and which areas of NLP is deep learning having the most impact? […]

A Tour of Recurrent Neural Network Algorithms for Deep Learning

By Jason Brownlee on August 14, 2019 in Long Short-Term Memory Networks 28

Recurrent neural networks, or RNNs, are a type of artificial neural network that add additional weights to the network to create cycles in the network graph in an effort to maintain an internal state. The promise of adding state to neural networks is that they will be able to explicitly learn and exploit context in […]

How to Handle Very Long Sequences with Long Short-Term Memory Recurrent Neural Networks

Techniques to Handle Very Long Sequences with LSTMs

By Jason Brownlee on August 14, 2019 in Long Short-Term Memory Networks 93

Long Short-Term Memory or LSTM recurrent neural networks are capable of learning and remembering over long sequences of inputs. LSTMs work very well if your problem has one output for every input, like time series forecasting or text translation. But LSTMs can be challenging to use when you have very long input sequences and only […]

← Previous 1 2 3 4 … 13 Next →