Archive | Machine Learning Process

How to Get Started with Kaggle

How to Get Started with Kaggle

4-Step Process for Getting Started and Getting Good at Competitive Machine Learning. Kaggle is a community and site for hosting machine learning competitions. Competitive machine learning can be a great way to develop and practice your skills, as well as demonstrate your capabilities. In this post, you will discover a simple 4-step process to get […]

Continue Reading 2
Swedish Auto Insurance Dataset

10 Standard Datasets for Practicing Applied Machine Learning

The key to getting good at applied machine learning is practicing on lots of different datasets. This is because each problem is different, requiring subtly different data preparation and modeling methods. In this post, you will discover 10 top standard machine learning datasets that you can use for practice. Let’s dive in. Overview A structured¬†Approach […]

Continue Reading 2
How To Deploy Your Predictive Model To Production

Deploy Your Predictive Model To Production

5 Best Practices For Operationalizing¬†Machine Learning. Not all predictive models are at Google-scale. Sometimes you develop a small predictive model that you want to put in your software. I recently received this reader question: Actually, there is a part that is missing in my knowledge about machine learning. All tutorials give you the steps up […]

Continue Reading 8
Data Leakage in Machine Learning

Data Leakage in Machine Learning

Data leakage is a big problem in machine learning when developing predictive models. Data leakage is when information from outside the training dataset is used to create the model. In this post you will discover the problem of data leakage in predictive modeling. After reading this post you will know: What is data leakage is […]

Continue Reading 4
Machine Learning Checklist

How to Use a Machine Learning Checklist to Get Accurate Predictions, Reliably (even if you are a beginner)

How do you get accurate results using machine learning on problem after problem? The difficulty is that each problem is unique, requiring different data sources, features, algorithms, algorithm configurations and on and on. The solution is to use a checklist that guarantees a good result every time. In this post you will discover a checklist […]

Continue Reading 23