Not all data attributes are created equal. More is not always better when it comes to attributes or columns in your dataset. In this post you will discover how to select attributes in your data before creating a machine learning model using the scikit-learn library. Let’s get started. Update: For a more recent tutorial on feature selection in […]
Archive | Python Machine Learning
How to Load Data in Python with Scikit-Learn
Before you can build machine learning models, you need to load your data into memory. In this post you will discover how to load data for machine learning in Python using scikit-learn. Let’s get started. Update March/2018: Added alternate link to download the dataset as the original appears to have been taken down. Packaged Datasets […]
Quick and Dirty Data Analysis with Pandas
Before you can select and prepare your data for modeling, you need to understand what you’ve got to start with. If you’re a using the Python stack for machine learning, a library that you can use to better understand your data is Pandas. In this post you will discover some quick and dirty recipes for […]
Prepare Data for Machine Learning in Python with Pandas
If you are using the Python stack for studying and applying machine learning, then the library that you will want to use for data analysis and data manipulation is Pandas. This post gives you a quick introduction to the Pandas library and point you in the right direction for getting started. Let’s get started. Data […]
Machine Learning Algorithm Recipes in scikit-learn
You have to get your hands dirty. You can read all of the blog posts and watch all the videos in the world, but you’re not actually going to start really get machine learning until you start practicing. The scikit-learn Python library is very easy to get up and running. Nevertheless I see a lot […]
IPython from the shell to a book with a single tool with Fernando Perez
If you get serious with data analysis and machine learning in python then you will make good use of IPython notebooks. In this post we will review some takeaway points made by Fernando Perez, the creator of IPython in a keynote presentation at SciPy 2013. The title of the talk was IPython: from the shell to […]
How to Get Started with Machine Learning in Python
The Python conference PyCon2014 has held recently and the videos for the conference are online. I have been working my way through the interesting machine learning ones and will share a few on this over the coming weeks. A great talk if you are starting out in data science or machine learning in python was […]
A Gentle Introduction to Scikit-Learn: A Python Machine Learning Library
If you are a Python programmer or you are looking for a robust library you can use to bring machine learning into a production system then a library that you will want to seriously consider is scikit-learn. In this post you will get an overview of the scikit-learn library and useful references of where you […]
Python Machine Learning Books
Python is a very popular language for machine learning. The machine learning libraries and frameworks in Python (especially around the SciPy stack) are maturing quickly. They may not be as feature rich as R, but they are robust enough for small to medium scale production implementation. If you are a Python programmer looking to get […]
Project Spotlight: Event Recommendation in Python with Artem Yankov
This is a project spotlight with Artem Yankov. Could you please introduce yourself? My name is Artem Yankov, I have worked as a software engineer for Badgeville for the last 3 years. I’m using there Ruby and Scala although my prior background includes use of various languages such as: Assembly, C/C++, Python, Clojure and JS. I […]