Stanford Convolutional Neural Networks for Visual Recognition Course (Review)

By Jason Brownlee on July 5, 2019 in Deep Learning for Computer Vision 23

The Stanford course on deep learning for computer vision is perhaps the most widely known course on the topic.

This is not surprising given that the course has been running for four years, is presented by top academics and researchers in the field, and the course lectures and notes are made freely available.

This is an incredible resource for students and deep learning practitioners alike.

In this post, you will discover a gentle introduction to this course that you can use to get a jump-start on computer vision with deep learning methods.

After reading this post, you will know:

The breakdown of the course including who teaches it, how long it has been taught, and what it covers.
The breakdown of the lectures in the course including the three lectures to focus on if you are already familiar with deep learning.
A review of the course, including how it compares to similar courses on the same subject matter.

Kick-start your project with my new book Deep Learning for Computer Vision, including step-by-step tutorials and the Python source code files for all examples.

Let’s get started.

Overview

This tutorial is divided into three parts; they are:

Course Breakdown
Lecture Breakdown
Discussion and Review

Course Breakdown

The course CS231n is a computer science course on computer vision with neural networks titled “Convolutional Neural Networks for Visual Recognition” and taught at Stanford University in the School of Engineering

This course is famous for being both early (started in 2015 just three years after the AlexNet breakthrough), and for being free, with videos and slides available.

The course was also popularized by interesting experiments created by Andrej Karpathy, such as demonstrations of neural networks on computer vision problems in Javascript (ConvNetJS).

Example From the Introductory Lecture to the CS231n Course

At the time of writing, this course has run for four years and most of the content from each of those years is still available:

The course is taught by Fei-Fei Li, a famous computer vision researcher at the Stanford Vision Lab and more recently as a Chief Scientist at Google. Through 2015-16, the course was co-taught by Andrej Karpathy, now at Tesla. Justin Johnson has also been involved since the beginning and has co-taught with Serena Yeung through 2017 to 2018.

The focus of the course is the use of convolutional neural networks (CNNs) for computer vision problems, with a focus on how CNNs work, image classification and recognition tasks, and introduction to advanced applications such as generative models and deep reinforcement learning.

This course is a deep dive into details of the deep learning architectures with a focus on learning end-to-end models for these tasks, particularly image classification.

— CS231n: Convolutional Neural Networks for Visual Recognition

Want Results with Deep Learning for Computer Vision?

Take my free 7-day email crash course now (with sample code).

Click to sign-up and also get a free PDF Ebook version of the course.

Lecture Breakdown

At the time of writing, the 2018 videos are not available publicly, but the 2017 videos are.

As such, we will focus on the 2017 syllabus and video content.

The course is divided into 16 lectures, with 14 covering topics in the course, two guest lectures on advanced topics, and a final video on student talks that is not public.

The full list of videos with links to each is provided below:

Do not overlook the Course Syllabus webpage. It includes valuable material such as:

Links to PDF slides from the lectures that you can have open in a separate browser tab.
Links to the papers discussed in the lecture, which are often required reading.
Links to HTML notes pages that include elaborated descriptions of methods and sample code.

Example of HTML Notes Available via the Course Syllabus

Must-Watch Lectures for Experienced Practitioners

Perhaps you are already familiar with the basics of neural networks and deep learning.

In that case, you do not need to watch all the lectures if you want a crash course in techniques for computer vision.

A cut-down, must-watch list of lectures is as follows:

Lecture 5: Convolutional Neural Networks. This lecture will get you up to speed with CNN layers and how they work.
Lecture 9: CNN Architectures. This lecture will get you up to speed with popular network architectures for image classification.
- Video
- PDF Slides
Lecture 11: Detection and Segmentation. This lecture will get you up to speed with image classification and object recognition tasks.
- Video
- PDF Slides

That is the minimum set.

You can add three more lectures to get a little more; they are:

Lecture 12: Visualizing and Understanding.
- This lecture describes ways of understanding what a fit model sees or has learned.
Lecture 13: Generative Models.
- This lecture provides an introduction to VAEs and GANs and modern image synthesis methods.
Lecture 14: Deep Reinforcement Learning.
- This lecture provides a crash course into deep reinforcement learning methods.

Discussion and Review

I have watched all the videos of this course, I think for each year it is made available.

Most recently, I watched all the lectures for the 2017 version of the course over two days (two mornings on 2x speed) and took extensive notes. I recommend this approach, even if you are an experienced deep learning practitioner.

I recommend this approach for a few reasons:

The field of deep learning is changing rapidly.
Stanford is a center of innovation and excellence in the field (e.g. the vision lab).
Repetition of the basics leads to new ideas and insights.

Nevertheless, if you want to get up to speed fast with deep learning for computer vision, the three lectures suggested in the previous section are the way to go (e.g. lectures 5, 9, and 11).

The course has a blistering pace.

It expects you to keep up, and if you don’t get something, it’s up to you to pause and go off and figure it out.

This is fair enough, the course is at Stanford after all, but it is less friendly than other courses, most notably Andrew Ng’s DeepLearning.ai convolutional neural networks course.

As such, I do not recommend this course if you need some hand-holding; take the other course as it was designed for developers, not Stanford students.

That being said, you are hearing about how CNNs and modern methods work from top academics and grad students in the world, and that is invaluable.

The fact that the videos are made freely available is a unique opportunity for practitioners.

Summary

In this post, you discovered a gentle introduction to this course that you can use to get a jump-start on computer vision with deep learning methods.

Specifically, you learned:

The breakdown of the course including who teaches it, how long it has been taught, and what it covers.
The breakdown of the lectures in the course, including the three lectures to focus on if you are already familiar with deep learning.
A review of the course including how it compares to similar courses on the same subject matter.

Do you have any questions?
Ask your questions in the comments below and I will do my best to answer.

23 Responses to Stanford Convolutional Neural Networks for Visual Recognition Course (Review)

Zerio March 20, 2019 at 7:34 am #

Very helpful, thank you

Reply
- Jason Brownlee March 20, 2019 at 8:39 am #
  
  Thanks, I’m happy to hear that.
  
  Reply
Colin March 20, 2019 at 2:07 pm #

Hi Jason,

I am a Taiwanese AI beginner, your tutorials always help me a lot.

Not only my thesis but also any AI tricks.

Now I am an AI engineer in Taiwan and hope one day I will be a AI master like you.

Thanks for your tutorials!

Best Regards

Reply
- Jason Brownlee March 20, 2019 at 2:36 pm #
  
  Thanks.
  
  Reply
MD ZAHIDUL ISLAM March 20, 2019 at 2:11 pm #

wow, I like it

Reply
- Jason Brownlee March 20, 2019 at 2:36 pm #
  
  THanks.
  
  Reply
Suri Parthasarathy March 20, 2019 at 2:26 pm #

Thanks Jason for your effort in compiling and sharing.. It is very useful.

Reply
- Jason Brownlee March 20, 2019 at 2:36 pm #
  
  I’m glad it helped.
  
  Reply
Abhi March 20, 2019 at 6:36 pm #

I like your books . Do you have any book for computer vision.

Reply
- Jason Brownlee March 21, 2019 at 8:00 am #
  
  Thanks!
  
  Yes, I hope to release one soon.
  
  Reply
Sushant Gautam March 22, 2019 at 2:44 am #

I like your concept.

Reply
- Jason Brownlee March 22, 2019 at 8:30 am #
  
  Thanks.
  
  Reply
Nitish March 22, 2019 at 6:31 am #

Hi Jason, do you plan to make a video tutorial on CNN projects which could be a good highlight to our CVs when we apply for internships ?

Reply
- Jason Brownlee March 22, 2019 at 8:43 am #
  
  I plan on writing a book on the topic, but no videos, here’s why:
  https://machinelearningmastery.com/faq/single-faq/do-you-have-videos
  
  Great suggestion about projects, thanks.
  
  Reply
Abdul Saboor Khan March 22, 2019 at 10:51 pm #

How can I get this course?

Reply
- Jason Brownlee March 23, 2019 at 9:29 am #
  
  The links are in the above post, for example:
  http://cs231n.stanford.edu/
  
  Reply
Sovit Ranjan Rath December 20, 2019 at 4:09 am #

Hello, Jason. Great article. I am also learning from CS231n now. In the article, you said that you take notes. May I know whether you take digital or pen & paper notes. If digital, then what resources do you use.

Thanks.

Reply
- Jason Brownlee December 20, 2019 at 6:51 am #
  
  Pen and paper.
  
  Reply
  - Sovit Ranjan Rath December 20, 2019 at 12:53 pm #
    
    Thanks for letting me know.
    
    Reply
Prabhav Kaula July 23, 2020 at 3:17 pm #

I sincerely thank you for helping me embark my journey in deeplearning and ai. I was curious about how to do the assignments of the course just by going through the slides and videos unofficially?
cheers!

Reply
- Jason Brownlee July 24, 2020 at 6:20 am #
  
  You’re welcome.
  
  You might have to do some independent reading and research.
  
  Reply
Mohammad Javad March 2, 2021 at 7:40 pm #

Hi Jason,

Thanks for this great review. I found another course on deep learning from MIT, “Deep Learning 6S191,” taught by Alexander Amini and Ava Soleimani, which dives deep into more deep learning models and provides some real-world examples of deep learning applications in Python notebooks, available at introtodeeplearning.com

Reply
- Jason Brownlee March 3, 2021 at 5:29 am #
  
  Thanks for sharing!
  
  Reply

Navigation

Stanford Convolutional Neural Networks for Visual Recognition Course (Review)

Overview

Course Breakdown

Want Results with Deep Learning for Computer Vision?

Lecture Breakdown

Must-Watch Lectures for Experienced Practitioners

Discussion and Review

Further Reading

Summary

Develop Deep Learning Models for Vision Today!

Develop Your Own Vision Models in Minutes

Finally Bring Deep Learning to your Vision Projects

More On This Topic

23 Responses to Stanford Convolutional Neural Networks for Visual Recognition Course (Review)

Leave a Reply Click here to cancel reply.