Tour of Real-World Machine Learning Problems

Last Updated on September 5, 2016

Real-world examples make the abstract description of machine learning become concrete.

In this post you will go on a tour of real world machine learning problems. You will see how machine learning can actually be used in fields like education, science, technology and medicine.

Each machine learning problem listed also includes a link to the publicly available dataset. This means that if a particular concrete machine learning problem interest you, you can download the dataset and start practicing immediately.

Real World Machine Learning

Real World Machine Learning
Photo by SMI Eye Tracking some rights reserved.

Most Popular Kaggle Datasets

These first 10 examples of machine learning problems were taken from the competitive machine learning website Popularity was based on the number of participating teams.

Most Popular Research Datasets

The next 10 machine learning problems are the most popular on the University California at Irvine Machine Learning Repository website that traditionally hosts machine learning datasets used by the machine learning research community.

Final World

We took a whirlwind tour of 20 real-world machine learning problems.

These are actual problems posed or investigated by science and business organizations around the world.

What’s even more exciting is that these diverse problems have publicly available datasets and are also widely studied and understood.

This means you can download the data right now and explore the problem by implementing your own model, or reproduce someone else’s from a paper or blog post.

24 Responses to Tour of Real-World Machine Learning Problems

  1. shivaprasad October 27, 2017 at 3:02 am #

    I am very much impressed by this article sir,really it helped like anything.thank you sir

  2. Paul January 18, 2018 at 9:10 am #

    Dear Mr. Jason,
    Hundreds of thousands of students decide to take up machine learning but more than half of this number get phased out due to the sheer fear of complexity of the subject but you on the other hand did a fantastic job explaining the subject with such ease. I just wanted to extend a warm gesture of gratitude. Thanks a lot for helping me and thousands of other like me. Thank you.

  3. Aimee November 28, 2018 at 11:57 am #

    Hi Jason! 🙂

    I’m planning on playing around with the poker data set above and was going to try it with LDA, CART and finally Gradient Boosted Decision Trees (GBDT) with XGBoost, but I’m concerned about the classification process since some hands could fit into more than one class. Ideally, you want to predict the best possible hand out of multiple possibilities so I wasn’t quite sure how this may be done. Logically, I guess, you’d somehow determine all possible classes a hand could fit in and then use the class with the greatest value as the final answer since the classes increase as the hand improves. Any suggestions on this approach? What other models would you suggest trying for multi-class classification?

    Thanks! Love your books so far!!! 😀

    • Jason Brownlee November 28, 2018 at 2:52 pm #

      Sounds like an intersting problem, sorry, I’m not familiar with it. I’m hesitant to make suggestions.

  4. Fredrick Ughimi February 13, 2019 at 6:54 pm #

    Awesome! Thank you, Jason.

  5. Santosh June 10, 2019 at 3:48 am #

    Hi Jason,

    Your knowledge is very vast and details over here are excellent. Thanks a lot.

    I was looking on Prediction models on Application behavior to predict like when Application may crash or when it can start behaving different.

    Any help on the same would be excellent.

    • Jason Brownlee June 10, 2019 at 7:38 am #

      Perhaps try searching on

      • Santosh June 11, 2019 at 4:10 am #

        Thanks a lot. Let me search over there.

  6. Gunasekaran September 6, 2019 at 2:05 pm #

    Thanks Jason for the wonderful tip. I am from a non Computer Science background, I hear cool things about Data science so i wanted to learn machine learning. But basically i just wanted to ask you few questions.I could see lot of POC’s, research projects and sample datasets to practice machine learning but :
    if i get a job as a Data scientist what level of work would i be doing?
    Is it using existing libraries and come up with model or invent new algorithms ?
    If the big companies have readymade drag and drop model readily available on the Cloud platforms what is the need for a data scientist there ?

  7. Santosh September 8, 2019 at 5:34 am #

    Thanks Jason for all the inputs on ML. I was browsing through different study material but could not get the info like how a ML model stores the Info of a Trained Model. Is it Binary which is created post Pickle or it has its own Database where it memorize the pattern to predict on next data set?

    Any study material would be helpful. Thanks once again in advance.

    • Jason Brownlee September 9, 2019 at 5:07 am #

      Different models have a different internal representation.

      For example CART is a decision tree, a neural network is a set of weights, etc.

      The model specific representation is saved to file.

      Does that help?

      • Santosh September 17, 2019 at 1:30 am #

        This helped a lot.. Thanks. Where do we get this mapping as once models are Trained and saved using Pickle it stores as a Binary file.

        • Jason Brownlee September 17, 2019 at 6:32 am #

          If you use pickle, then the internal representation does not matter as pickle will handle the saving and loading.

          • Santosh September 17, 2019 at 3:41 pm #

            Thanks once again for you input.

          • Jason Brownlee September 18, 2019 at 5:55 am #

            You’re welcome.

  8. J Chouinard March 3, 2021 at 8:59 am #

    Thank you Jason for this post. It gives motivation to look at different applications of Machine Learning before diving into it.

  9. Suganya February 26, 2022 at 2:37 am #

    Hello Mr.Jason, Thanks a lot for sharing your intelligence with us. God will bless you for your good work.

    • James Carmichael February 26, 2022 at 12:30 pm #

      Thank you for the feedback Suganya!

Leave a Reply