Search results for "text summarization"

What are Large Language Models

By Adrian Tam on July 20, 2023 in ChatGPT 10

Large language models (LLMs) are recent advances in deep learning models to work on human languages. Some great use case of LLMs has been demonstrated. A large language model is a trained deep-learning model that understands and generates text in a human-like fashion. Behind the scene, it is a large transformer model that does all […]

Adding a Custom Attention Layer to a Recurrent Neural Network in Keras

By Mehreen Saeed on January 6, 2023 in Attention 57

Deep learning networks have gained immense popularity in the past few years. The “attention mechanism” is integrated with deep learning networks to improve their performance. Adding an attention component to the network has shown significant improvement in tasks such as machine translation, image recognition, text summarization, and similar applications. This tutorial shows how to add […]

The Three Levels of Deep Learning Competence

3 Levels of Deep Learning Competence

By Jason Brownlee on August 19, 2019 in Deep Learning 16

Deep learning is not a magic bullet, but the techniques have shown to be highly effective in a large number of very challenging problem domains. This means that there is a ton of demand by businesses for effective deep learning practitioners. The problem is, how can the average business differentiate between good and bad practitioners? […]

How to Run Deep Learning Experiments on a Linux Server

By Jason Brownlee on August 19, 2019 in Deep Learning 6

After you write your code, you must run your deep learning experiments on large computers with lots of RAM, CPU, and GPU resources, often a Linux server in the cloud. Recently, I was asked the question: “How do you run your deep learning experiments?” This is a good nuts-and-bolts question that I love answering. In […]

How to Implement Beam Search Decoder for Natural Language Processing

How to Implement a Beam Search Decoder for Natural Language Processing

By Jason Brownlee on June 3, 2020 in Deep Learning for Natural Language Processing 51

Natural language processing tasks, such as caption generation and machine translation, involve generating sequences of words. Models developed for these problems often operate by generating probability distributions across the vocabulary of output words and it is up to decoding algorithms to sample the probability distributions to generate the most likely sequences of words. In this […]

Caption Generation with the Inject and Merge Architectures for the Encoder-Decoder Model

Caption Generation with the Inject and Merge Encoder-Decoder Models

By Jason Brownlee on August 7, 2019 in Deep Learning for Natural Language Processing 6

Caption generation is a challenging artificial intelligence problem that draws on both computer vision and natural language processing. The encoder-decoder recurrent neural network architecture has been shown to be effective at this problem. The implementation of this architecture can be distilled into inject and merge based models, and both make different assumptions about the role […]

What is Teacher Forcing for Recurrent Neural Networks?

By Jason Brownlee on April 8, 2021 in Long Short-Term Memory Networks 51

Teacher forcing is a method for quickly and efficiently training recurrent neural network models that use the ground truth from a prior time step as input. It is a network training method critical to the development of deep learning language models used in machine translation, text summarization, and image captioning, among many other applications. In […]

How to Develop an Encoder-Decoder Model for Sequence-to-Sequence Prediction in Keras

By Jason Brownlee on August 27, 2020 in Long Short-Term Memory Networks 390

The encoder-decoder model provides a pattern for using recurrent neural networks to address challenging sequence-to-sequence prediction problems such as machine translation. Encoder-decoder models can be developed in the Keras Python deep learning library and an example of a neural machine translation system developed with this model has been described on the Keras blog, with sample […]

Gentle Introduction to Statistical Language Modeling and Neural Language Models

By Jason Brownlee on August 7, 2019 in Deep Learning for Natural Language Processing 10

Language modeling is central to many important natural language processing tasks. Recently, neural-network-based language models have demonstrated better performance than classical methods both standalone and as part of more challenging natural language processing tasks. In this post, you will discover language modeling for natural language processing. After reading this post, you will know: Why language […]

Gentle Introduction to Global Attention for Encoder-Decoder Recurrent Neural Networks

By Jason Brownlee on August 14, 2019 in Long Short-Term Memory Networks 12

The encoder-decoder model provides a pattern for using recurrent neural networks to address challenging sequence-to-sequence prediction problems such as machine translation. Attention is an extension to the encoder-decoder model that improves the performance of the approach on longer sequences. Global attention is a simplification of attention that may be easier to implement in declarative deep […]

← Previous 1 2 3 … 5 Next →