Matrices that contain mostly zero values are called sparse, distinct from matrices where most of the values are non-zero, called dense. Large sparse matrices are common in general and especially in applied machine learning, such as in data that contains counts, data encodings that map categories to counts, and even in whole subfields of machine […]
Search results for "normalize"
How to Calculate Principal Component Analysis (PCA) from Scratch in Python
An important machine learning method for dimensionality reduction is called Principal Component Analysis. It is a method that uses simple matrix operations from linear algebra and statistics to calculate a projection of the original data into the same number or fewer dimensions. In this tutorial, you will discover the Principal Component Analysis machine learning method […]
A Gentle Introduction to Expected Value, Variance, and Covariance with NumPy
Fundamental statistics are useful tools in applied machine learning for a better understanding your data. They are also the tools that provide the foundation for more advanced linear algebra operations and machine learning methods, such as the covariance matrix and principal component analysis respectively. As such, it is important to have a strong grip on […]
Gentle Introduction to Eigenvalues and Eigenvectors for Machine Learning
Matrix decompositions are a useful tool for reducing a matrix to their constituent parts in order to simplify a range of more complex operations. Perhaps the most used type of matrix decomposition is the eigendecomposition that decomposes a matrix into eigenvectors and eigenvalues. This decomposition also plays a role in methods used in machine learning, […]
A Standard Multivariate, Multi-Step, and Multi-Site Time Series Forecasting Problem
Real-world time series forecasting is challenging for a whole host of reasons not limited to problem features such as having multiple input variables, the requirement to predict multiple time steps, and the need to perform the same type of prediction for multiple physical sites. In this post, you will discover a standardized yet complex time […]
How to Develop a Neural Machine Translation System from Scratch
Develop a Deep Learning Model to Automatically Translate from German to English in Python with Keras, Step-by-Step. Machine translation is a challenging task that traditionally involves large statistical models developed using highly sophisticated linguistic knowledge. Neural machine translation is the use of deep neural networks for the problem of machine translation. In this tutorial, you […]
How to Prepare a French-to-English Dataset for Machine Translation
Machine translation is the challenging task of converting text from a source language into coherent and matching text in a target language. Neural machine translation systems such as encoder-decoder recurrent neural networks are achieving state-of-the-art results for machine translation with a single end-to-end system trained directly on source and target language. Standard datasets are required […]
Why Applied Machine Learning Is Hard
How to Handle the Intractability of Applied Machine Learning. Applied machine learning is challenging. You must make many decisions where there is no known “right answer” for your specific problem, such as: What framing of the problem to use? What input and output data to use? What learning algorithm to use? What algorithm configuration to […]
How to Prepare News Articles for Text Summarization
Text summarization is the task of creating a short, accurate, and fluent summary of an article. A popular and free dataset for use in text summarization experiments with deep learning methods is the CNN News story dataset. In this tutorial, you will discover how to prepare the CNN News Dataset for text summarization. After completing […]
A Gentle Introduction to Calculating the BLEU Score for Text in Python
BLEU, or the Bilingual Evaluation Understudy, is a score for comparing a candidate translation of text to one or more reference translations. Although developed for translation, it can be used to evaluate text generated for a suite of natural language processing tasks. In this tutorial, you will discover the BLEU score for evaluating and scoring […]