Books for Machine Learning with R

R is a powerful platform for data analysis and machine learning.

It is my main workhorse for things like competitions and consulting work. The reason is the large amounts of powerful algorithms available, all on the one platform.

In this post I want to point out some resources you can use to get started in R for machine learning.

Kick-start your project with my new book Machine Learning Mastery With R, including step-by-step tutorials and the R source code files for all examples.

Let’s get started.

Introduction to R

You might want to familiarize yourself with the platform and language before you start addressing your problems with machine learning.

I think the best way to familiarize yourself is to start addressing problems. The trial of real work will force you to learn what you must learn to solve your problem.A good reference can help you answer your “how do I…” questions.

R in a Nutshell

R in a Nutshell

R in a Nutshell

The book I read cover-to-cover when first starting out with R was R in a Nutshell. It walks you through the platform from installation, basic operations, data analysis and even some machine learning algorithms. I highly recommend it.

I chose it because it was a broad reference. I wanted to know a little bit of everything in the platform, so I would know where to look when I had a specific question.

Need more Help with R for Machine Learning?

Take my free 14-day email course and discover how to use R on your project (with sample code).

Click to sign-up and also get a free PDF Ebook version of the course.

Machine Learning

There is a wealth of machine learning algorithms implemented in R, many by the academics and their teams that actually developed them in the first place. This alone is a compelling reason to get started in R. Additionally, the data handling/manipulation and graphing tools are very powerful (although Python’s SciPy stack is catching up).

CRAN: Machine Learning and Statistical Learning

Not a book, but a great place you can start out is the Machine Learning and Statistical Learning view on CRAN maintained by Torsten Hothorn. It lists most of the R packages you can use for machine learning, grouped by algorithm and algorithm types.

It is a great place to start, but one thing that I think it could do better is point out canonical packages and to elaborate more on some of the wrapper packages available like caret.

Applied Predictive Modeling

Applied Predictive Modeling

Applied Predictive Modeling

Max Kuhn, an author to this book is the creator of the famous caret package. Applied Predictive Modeling is very practical and opens in the first part with a description of predictive analytics process and case studies. Parts 2 and 3 look at regression and classification algorithms and the final Part covers more advanced topics like feature selection.

It’s a thick book and an excellent reference, and I’m a a fan of this book. Also check out the companion website for relevant resources.

An Introduction to Statistical Learning: with Applications in R

An Introduction to Statistical Learning- with Applications in R

An Introduction to Statistical Learning- with Applications in R

This is the more accessible version of the classic “The Elements of Statistical Learning: Data Mining, Inference, and Prediction” and includes two of the same authors.

An Introduction to Statistical Learning opens with an introduction to Statistical Learning and concerns such as model accuracy and the bias-variance tradeoff. Chapters 3 and 4 looks at linear regression and some simpler classification algorithms. Following chapters look at cross validation, model selection before moving into non-linear regression, decision trees, SVM and finishing up with unsupervised methods.

The book is also available online for free from the authors webpage.

Practical Data Science with R

Practical Data Science with R

Practical Data Science with R

Practical Data Science with R has more of a data science spin than machine learning. Part 1 is introductory looking at loading data into R. Part 2 starts off with model evaluation and works through models in increasing complexity through k-NN, Naive Bayes, Linear Regression, clustering, association rules and SVM. Part 3 works through advanced issues like self-documenting scripts and presenting results.

Provides a good introduction with solid practical advice.

Machine Learning with R

Machine Learning with R

Machine Learning with R

Machine Learning with R provides an overview of machine learning in R without going into detail or theory. It also heavily uses case studies to demonstrate each algorithm. It opens with a brief introduction to machine learning and R and in data management in R. It goes on in subsequent chapters to cover k-NN, Naive Bayes, Decision Trees, Regression, Neural Networks, Apriori, and Clustering.

It finishes up with chapters on model evaluation, algorithm tuning and other advanced topics. A good feature of this text are the step-by-step sequences provided in each chapter, providing an actionable framework around the case studies.

Data Mining with R: Learning with Case Studies

Data Mining with R- Learning with Case Studies

Data Mining with R- Learning with Case Studies

After a quick introduction to R in the first chapter, Data Mining with R presents case study after case study. These include: predicting algae blooms, stock market returns, fraudulent transactions and classifying microarray samples. Each study explores various different data preparation, model building and model evaluation methods.

It’s a dense by valuable book if you’re looking for getting a feel for working through real problems.

Data Mining and Business Analytics with R

Data Mining and Business Analytics with R

Data Mining and Business Analytics with R

Data Mining and Business Analytics with R provides worked examples using R, but the examples are more business focused than scientifically focused, as in some other books. The chapters work through the key machine learning methods using R with smaller case studies throughout. The book finishes with some larger case studies on sentiment analysis in text and modeling network data.

Data Mining with Rattle and R: The Art of Excavating Data for Knowledge Discovery (Use R!)

Data Mining with Rattle and R- The Art of Excavating Data for Knowledge Discovery

Data Mining with Rattle and R- The Art of Excavating Data for Knowledge Discovery

Data Mining with Rattle and R provides an introduction to machine learning algorithms, although the twist is that uses the Rattle graphical environment. After the introductory material on loading and handling data in part 1, the standard machine learning algorithms are covered in part 2.

What I do like about the presentation of the algorithms is the standardized description that includes a tutorial, parameter tuning and command summary. I’m big fan of the consistent structured presentation of algorithms.

Summary

We have coved 7 popular machine learning books that focus on using the R platform.

The best advice I can give is to pick one and read it. Read it cover to cover, take notes and do the exercises. Like programming, using R is a practical skill that you can only build by practicing. Practice machine learning R.

Have I missed a Machine Learning book on R? Leave a comment and let me know.

Discover Faster Machine Learning in R!

Master Machine Learning With R

Develop Your Own Models in Minutes

...with just a few lines of R code

Discover how in my new Ebook:
Machine Learning Mastery With R

Covers self-study tutorials and end-to-end projects like:
Loading data, visualization, build models, tuning, and much more...

Finally Bring Machine Learning To Your Own Projects

Skip the Academics. Just Results.

See What's Inside

18 Responses to Books for Machine Learning with R

  1. Avatar
    yin June 30, 2014 at 9:19 pm #

    The book Machine Learning for Hackers is good. Also based on R.

    • Avatar
      jasonb July 2, 2014 at 10:39 am #

      Of course, I can’t believe I left this book off my list. Thanks yin!

  2. Avatar
    Santiago Cepas July 2, 2014 at 2:11 am #

    Another great post Jason, keep it up!

    Another good book I’ve been reading recently is “Doing Data Science: Straight Talk from the Frontline”.

    Accessible and practical and with lots of examples in R.

    • Avatar
      jasonb July 2, 2014 at 10:40 am #

      Thanks for the tip Santiago. I’m ordering a copy now.

  3. Avatar
    Max Barrass July 14, 2014 at 9:17 am #

    Nice summary Jason, great work as usual. Thanks.

  4. Avatar
    sumendar November 7, 2014 at 4:55 am #

    Nice post, These Books are really helpful for R programmers

    • Avatar
      jasonb November 7, 2014 at 5:07 am #

      Thanks sumendar

    • Avatar
      shan March 31, 2015 at 6:30 am #

      Hi…

      I am searching for ecopy of the following books .. I need to study for completing college assignment..If you have any soft copy pls share.

      Doing Data Science: Straight Talk from the Frontline

      Practical Data Science with R

      Data Mining and Business Analytics with R

      Data Mining with Rattle and R: The Art of Excavating Data for Knowledge Discovery (Use R!)

      ggplot2

      “Web Application Development with R using Shiny”
       by Chris

      Text Mining : classification, clustering and applications by Ashok N. Srivastava

      Multiple Regression and beyond by Timothy Z

      thanks…

  5. Avatar
    Rafi November 24, 2014 at 4:24 am #

    Nice post.

    I was wondering if you know of any books that helps guide you through building machine learning algorithms in R in the same way Andrew NG does with Matlab in his famous Machine Learning course?

    I know there are many algorithms built in to packages, but given that R handles linear algebra pretty well, it would be nice if there was book that helps develop intuition behind building algorithms with its syntax.

    I’m going through the ISLR book at the moment, and have to say that is an amazing book if you want to get started in Statistical/Machine Learning. It goes very well with these archived lecture videos from the eDX website;
    https://class.stanford.edu/courses/HumanitiesScience/StatLearning/Winter2014/progress

    There are even R sessions in there where the lecturers guide you through examples. I believe the course is due to begin again in mid-January 2015. I highly recommend it!

  6. Avatar
    yazhou hao May 21, 2015 at 9:42 am #

    Nice post.Maybe you will post another book list which is about python in machine learning.

  7. Avatar
    roni June 21, 2015 at 1:14 pm #

    What about Elements of Machine Learning. More advanced version of Introduction to Machine Learning.

    • Avatar
      Leo July 19, 2015 at 6:44 am #

      Roni,

      I have read ESL. It’s good for a graduate level course in Statistical learning. Applied Predictive Modeling combined with Intro to SL is the way to go if your aim is the application of machine learning methods.

  8. Avatar
    Santhan October 9, 2018 at 8:47 pm #

    Can you please suggest me any books or any source which explain probabilistic neural network, learning vector quantization in R.

  9. Avatar
    Stat1 May 7, 2020 at 11:38 am #

    The following book on Machine Learning also useful

    Machine Learning: An overview with the help of R software

    https://www.amazon.com/B07KQSN447

  10. Avatar
    Rajib Ahsan July 28, 2020 at 11:51 pm #

    hi,

    I would also recommend the following books.

    1. R in Action
    2. Machine learning with R, the tidyverse and mlr.

Leave a Reply