Blog - Page 47 of 136

Box Plot of PCA Number of Components vs. Classification Accuracy

Principal Component Analysis for Dimensionality Reduction in Python

By Jason Brownlee on August 18, 2020 in Data Preparation 83

Reducing the number of input variables for a predictive model is referred to as dimensionality reduction. Fewer input variables can result in a simpler predictive model that may have better performance when making predictions on new data. Perhaps the most popular technique for dimensionality reduction in machine learning is Principal Component Analysis, or PCA for […]

Introduction to Dimensionality Reduction for Machine Learning

By Jason Brownlee on June 30, 2020 in Data Preparation 11

The number of input variables or features for a dataset is referred to as its dimensionality. Dimensionality reduction refers to techniques that reduce the number of input variables in a dataset. More input features often make a predictive modeling task more challenging to model, more generally referred to as the curse of dimensionality. High-dimensionality statistics […]

Box Plot of Gradient Boosting Ensemble Tree Depth vs. Classification Accuracy

How to Develop a Gradient Boosting Machine Ensemble in Python

By Jason Brownlee on April 27, 2021 in Ensemble Learning 20

The Gradient Boosting Machine is a powerful ensemble machine learning algorithm that uses decision trees. Boosting is a general ensemble technique that involves sequentially adding models to the ensemble where subsequent models correct the performance of prior models. AdaBoost was the first algorithm to deliver on the promise of boosting. Gradient boosting is a generalization […]

Box Plot of AdaBoost Ensemble Weak Learner Depth vs. Classification Accuracy

How to Develop an AdaBoost Ensemble in Python

By Jason Brownlee on April 27, 2021 in Ensemble Learning 13

Boosting is a class of ensemble machine learning algorithms that involve combining the predictions from many weak learners. A weak learner is a model that is very simple, although has some skill on the dataset. Boosting was a theoretical concept long before a practical algorithm could be developed, and the AdaBoost (adaptive boosting) algorithm was […]

Difference Between Algorithm and Model in Machine Learning

By Jason Brownlee on August 19, 2020 in Machine Learning Algorithms 66

Machine learning involves the use of machine learning algorithms and models. For beginners, this is very confusing as often “machine learning algorithm” is used interchangeably with “machine learning model.” Are they the same thing or something different? As a developer, your intuition with “algorithms” like sort algorithms and search algorithms will help to clear up […]

Box Plot of Random Subspace Ensemble Number of Features vs. Classification Accuracy

How to Develop a Bagging Ensemble with Python

By Jason Brownlee on April 27, 2021 in Ensemble Learning 16

Bagging is an ensemble machine learning algorithm that combines the predictions from many decision trees. It is also easy to implement given that it has few key hyperparameters and sensible heuristics for configuring these hyperparameters. Bagging performs well in general and provides the basis for a whole field of ensemble of decision tree algorithms such […]

A Gentle Introduction to Degrees of Freedom in Machine Learning

By Jason Brownlee on August 19, 2020 in Statistics 14

Degrees of freedom is an important concept from statistics and engineering. It is often employed to summarize the number of values used in the calculation of a statistic, such as a sample statistic or in a statistical hypothesis test. In machine learning, the degrees of freedom may refer to the number of parameters in the […]

Box Plot of Extra Trees Minimum Samples per Split vs. Classification Accuracy

How to Develop an Extra Trees Ensemble with Python

By Jason Brownlee on April 27, 2021 in Ensemble Learning 14

Extra Trees is an ensemble machine learning algorithm that combines the predictions from many decision trees. It is related to the widely used random forest algorithm. It can often achieve as-good or better performance than the random forest algorithm, although it uses a simpler algorithm to construct the decision trees used as members of the […]

Box Plot of Random Forest Bootstrap Sample Size vs. Classification Accuracy

How to Develop a Random Forest Ensemble in Python

By Jason Brownlee on April 27, 2021 in Ensemble Learning 47

Random forest is an ensemble machine learning algorithm. It is perhaps the most popular and widely used machine learning algorithm given its good or excellent performance across a wide range of classification and regression predictive modeling problems. It is also easy to use given that it has few key hyperparameters and sensible heuristics for configuring […]

Box Plot of Soft Voting Ensemble Compared to Standalone Models for Binary Classification

How to Develop Voting Ensembles With Python

By Jason Brownlee on April 27, 2021 in Ensemble Learning 82

Voting is an ensemble machine learning algorithm. For regression, a voting ensemble involves making a prediction that is the average of multiple other regression models. In classification, a hard voting ensemble involves summing the votes for crisp class labels from other models and predicting the class with the most votes. A soft voting ensemble involves […]

← Previous 1 … 46 47 48 … 136 Next →

Navigation

MachineLearningMastery.com