Getting started is much easier than you think.
In this post I show you the top-down approach for getting started in applied machine learning. You will discover the four steps to this approach. They should feel familiar because it’s probably the same top-down approach that you used to learn how to program. Namely, get the basics, practice a lot and dive into the details later after you’re hooked.
At the end of the post, I link to my mini-course that can shortcut the path and give you step-by-step instructions to follow to start and practice applied machine learning.
Beginners are Different
Beginners have an interest in machine learning but are not sure how to take that first step. They are confused because the material on blogs and in courses is almost always pitched at an intermediate level.
Typical books and university-level courses are bottom-up. They teach or require the mathematics before grinding through a few key algorithms and theories before finishing up. This can be a good approach if you have the time, patience and appropriate background. Not everyone has so much free time or the desire to move through so much low-level material before getting to the meat and potatoes of applied machine learning.
I get a lot of emails from beginners asking for advice on how to get started in machine learning. It’s a tough problem, because there are so many possibilities and so many things I could recommend. I tell them not to dive into the math and not to go straight back to school.
The students and professionals I advise are almost always programmers or have an engineering background, and I tell them that there is a much more efficient path into machine learning for them.
Solution is Top-Down
My advice for beginners in machine learning is to take a top-down approach.
I advise beginners to take a faster route to discover what applied machine learning is all about before dedicating huge time resources into studying the theory. It makes sense and it is familiar because it’s the way you get excited about programming first, before diving in and making it a focus of study and career.
The top down approach is to quickly learn the high-level step-by-step process of working through a machine learning problem end-to-end using a software tool. With modern platforms, it is possible to work through small problems in minutes to hours using complex state-of-the-art algorithms and rigorous validation and statistical hypothesis testing, all performed automatically within the tools.
It is after you are familiar and confident with the process that I advise you start looking deeper into the algorithms and theory side of machine learning. How first, why later.
We can summarize this top-down approach as follows:
- Learn the high-level process of applied machine learning.
- Learn how to use a tool enough to be able to work through problems.
- Practice on datasets, a lot.
- Transition into the details and theory of machine learning algorithms.
Applied Machine Learning Process
I have written a lot about the process of applied machine learning. I advocate a 6-step process for classification and regression type problems, the common problem types at the heart of most machine learning problems. The process is as follows:
- Problem Definition: Understand and clearly describe the problem that is being solved.
- Analyze Data: Understand the information available that will be used to develop a model.
- Prepare Data: Discover and expose the structure in the dataset.
- Evaluate Algorithms: Develop a robust test harness and baseline accuracy from which to improve and spot check algorithms.
- Improve Results: Leverage results to develop more accurate models.
- Present Results: Describe the problem and solution so that it can be understood by third parties.
By following this structured process on each problem you work through, you enforce a minimum level of rigour and dramatically increase the likelihood of getting good (or more likely excellent) results.
Use the Weka Machine Learning Workbench
The software platform for beginners to learn when getting started is the Weka Machine Learning Workbench.
I think the decision to use Weka when getting started is a complete no-brainer because:
- It provides a simple graphical user interface that encapsulates the process of applied machine learning outlined above.
- It facilitates algorithm and dataset exploration as well as rigours experiment design and analysis.
- It is free and open source, licensed under the GNU GPL.
- It is cross-platform and runs on Windows, Mac OS X and Linux (requires a Java virtual machine).
- It contains state-of-the-art algorithms with an impressive abundance of Decision Trees, Rule Based Algorithms and Ensemble methods, as well as others.
You can see for yourself how easy the platform is to use, I have written a number of 5-minute Weka tutorials, such as:
- How to run your first classifier
- How to design and run your first experiment
- How to tune an algorithm rigorously in a designed experiment
- How to use ensemble methods in Weka
Additionally, if you get right into Weka, you can run algorithms from the command line and integrate algorithms into your application via the application programming interface. It is an extensible platform and you can quickly and easily implement your own algorithms to the interface and use them in the GUI.
Practice, Practice, Practice, on Datasets
Once you are up and running with Weka, you need to practice the 6-step process of applied machine learning.
The Weka installation includes a data directory with many standard machine learning datasets, most taken from actual scientific problem domains. There is also a wealth of excellent datasets to trial and learn from on the UCI Machine Learning Repository. These datasets are an excellent place for you to get started learning and practicing.
- The datasets are small and easily fit into memory.
- The small size of the datasets also means that algorithms and experiments are quick to run.
- The problems and data are real, including noise, biases in sampling and data collection that you need to consider.
- The data is well understood so that you can leverage what is known and openly discuss the data with peers.
- There are known “good results” for you to compare to and recreate.
You can choose your own level of detail on each step of the structured process. I recommend spending no more than one-hour on each step when getting started. You can do and learn a lot about a problem in one hour with Weka, especially when designing and running experiments. This will keep your motivation and project velocity high.
The structured process encourages you to make observations and record results and findings as you work through a given problem. It is wise to keep these observations and findings together, perhaps in a project directory or Github project.
I recommend blogging about each of your projects, even each step of a project as you complete it. You can do this on your own blog (if you have one) or as Facebook or Google+ updates (that now support images and text formatting). I like the honesty that publicly blogging projects encourages. It also provides an indicator to your peers and colleagues that you are interested, serious about and developing some chops in applied machine learning.
Because the projects are small and the process is structured, you can quickly learn a lot about a problem and move through a number of projects. You can also collect data on problems of your own and use the same process to deliver useful and meaningful results on projects at work or for your own benefit.
The next step is to dive deeper into the algorithms and learn why they work and how to get more out of them. I recommend transitioning deeper into the subject by picking up the book Data Mining: Practical Machine Learning Tools and Techniques. It is written by the original authors of the Weka platform and provides a treatment of how and why the algorithms used in Weka work and other deeper concerns of machine learning.
The deeper knowledge will allow you to get more from the platform on your own custom problems. It will also allow you to better appreciate the methods in Weka and you will start to build an intuition as to the mapping between problem and algorithm types
In this post you discovered the top-down approach to getting started in machine learning that advocates learning the specific structured process, a powerful tool that supports this process and to practice applied machine learning in a series of focused projects.
You learned that this is the exact opposite of the traditional bottom-up approach that expects you to perform the heavy lifting in the field first, (before you even know if the field is right for you) and leaves you to figure out how to apply algorithms in practice all by yourself.
I truly believe in this approach. It has worked for me and created the funnel through which I fell back into grad school and ended up completing a number of higher degrees.
If you like this approach and are interested in giving it a go, I have created a mini-course that guides you through each step.
The mini-course contains five modules of material. The first module gives you a grounding in the 6-step process of applied machine learning. The second module walks you through getting started in Weka step-by-step with tutorials. Finally, modules 3-to-5 are detailed case studies with tutorials and video examples of working through three real-world problems. Specifically, predicting the onset of diabetes, free electron structure in the ionosphere and the recurrence of breast cancer.
The course is called Beginning Weka: Discover Applied Machine Learning and it comes in a professional version with the example videos and another version without. You can learn more about it here.
Bonus: If you leave a comment explaining why you are interested in the course, I will email you a coupon to get 20% off the professional version of the course.