Welcome back to the Five-Minute Friday episode of the SuperDataScience Podcast!
This week, discover Jon’s extensive library of machine learning content and learn why Jon’s Machine Learning House forms the building blocks to ML and DL mastery.
Show All
If you tuned in last week, you’ll be familiar with all the possible avenues to begin learning Jon Krohn’s deep learning curriculum. It was while developing this content that he realized many were missing the foundational subjects that underlie machine learning in general, and deep learning in particular. So after publishing all his deep learning content, he set to work on creating content that covers these critical subjects for understanding machine learning at an expert level — namely, linear algebra, calculus, probability, statistics, and computer science.
In this week’s episode, he outlines the foundational skills and topics that he refers to as the Machine Learning House. Together, these blocks form the knowledge structure of an outstanding data scientist or ML engineer.
When the foundations of the Machine Learning House are firm, it also makes it much easier to make the jump from general ML principles to specialized ML domains such as deep learning, natural language processing, machine vision, and reinforcement learning. Unlike the deep learning content that Jon provided an overview of last week, his ML Foundations content is still under development but will eventually be available for free – stay tuned.
ITEMS MENTIONED IN THIS PODCAST:
- SDS 474: The Machine Learning House
- Machine Learning House
- Where and When section of Jon’s ML Foundations GitHub repo
- Jon’s O’Reilly’s Linear Algebra for ML
- Jon’s O’Reilly’s Calculus for ML
- Jon’s O’Reilly’s Probability and Statistics for ML
- Jon’s O’Reilly’s Data Structures, Algorithms, and Machine Learning Optimization
- Linear Algebra for Machine Learning
- Calculus for Machine Learning
- Mathematical Foundations of Machine Learning
- Jon’s Newsletter
DID YOU ENJOY THE PODCAST?
- What foundational subjects from the Machine Learning House could you improve on today?
- Download The Transcript
Podcast Transcript
(00:05):
This is Five-Minute Friday on My Machine Learning Content.
(00:19):
For last week’s Five-Minute Friday episode, I provided a summary of the various methods of undertaking my deep learning curriculum, be it via YouTube, my book, or the associated repository of GitHub code. I mentioned at the end of the episode that while teaching this deep learning content to students online and in-person, I discovered that many folks could use a primer on the foundational subjects that underlie machine learning in general and deep learning in particular. So after publishing all my deep learning content, I set to work on creating content that covers these subject areas that are critical to understanding machine learning expertly — namely, those subjects are linear algebra, calculus, probability, statistics, and computer science.
For last week’s Five-Minute Friday episode, I provided a summary of the various methods of undertaking my deep learning curriculum, be it via YouTube, my book, or the associated repository of GitHub code. I mentioned at the end of the episode that while teaching this deep learning content to students online and in-person, I discovered that many folks could use a primer on the foundational subjects that underlie machine learning in general and deep learning in particular. So after publishing all my deep learning content, I set to work on creating content that covers these subject areas that are critical to understanding machine learning expertly — namely, those subjects are linear algebra, calculus, probability, statistics, and computer science.
(01:04):
Way back in Episode #474 of this podcast, I detailed why these particular subject areas form the sturdy foundations of what I call the Machine Learning House. As a quick recap, the idea is that to be an outstanding data scientist or ML engineer, it doesn’t suffice to only know how to use machine learning algorithms via the abstract interfaces that the most popular libraries like scikit-learn and Keras provide. To train innovative models or deploy them to run performantly in production, an in-depth appreciation of machine learning theory may be helpful — or even essential. And, to cultivate such an in-depth appreciation of ML, one must possess a working understanding of the foundational subjects, which again are linear algebra, calculus, probability, statistics, and computer science.
Way back in Episode #474 of this podcast, I detailed why these particular subject areas form the sturdy foundations of what I call the Machine Learning House. As a quick recap, the idea is that to be an outstanding data scientist or ML engineer, it doesn’t suffice to only know how to use machine learning algorithms via the abstract interfaces that the most popular libraries like scikit-learn and Keras provide. To train innovative models or deploy them to run performantly in production, an in-depth appreciation of machine learning theory may be helpful — or even essential. And, to cultivate such an in-depth appreciation of ML, one must possess a working understanding of the foundational subjects, which again are linear algebra, calculus, probability, statistics, and computer science.
(02:00):
When the foundations of the Machine Learning House are firm, it also makes it easier to make the jump from general ML principles, which I convey as the ground floor of the Machine Learning House, to make that jump to specialized ML domains, which I consider to be the upper floor of the house, and this includes topics like deep learning, natural language processing, machine vision, and reinforcement learning. And the reason why you need strong, firm foundations in order to understand these specialized domains is because, the more specialized the application, the more likely its details for implementation are available only in academic papers or graduate-level textbooks, either of which typically assume an understanding of the foundational subjects.
When the foundations of the Machine Learning House are firm, it also makes it easier to make the jump from general ML principles, which I convey as the ground floor of the Machine Learning House, to make that jump to specialized ML domains, which I consider to be the upper floor of the house, and this includes topics like deep learning, natural language processing, machine vision, and reinforcement learning. And the reason why you need strong, firm foundations in order to understand these specialized domains is because, the more specialized the application, the more likely its details for implementation are available only in academic papers or graduate-level textbooks, either of which typically assume an understanding of the foundational subjects.
(02:50):
So, there’s an introduction to what my ML Foundations content covers and why. Unlike the deep learning content that I provided an overview of last week, my ML Foundations content is still under development. But, also unlike some other components of my deep learning curriculum, my entire ML Foundations curriculum will eventually be available for free.
So, there’s an introduction to what my ML Foundations content covers and why. Unlike the deep learning content that I provided an overview of last week, my ML Foundations content is still under development. But, also unlike some other components of my deep learning curriculum, my entire ML Foundations curriculum will eventually be available for free.
(03:12):
Conveniently, you can check out the current state of affairs for my ML Foundations content in one place by visiting the Where and When section of my ML Foundations GitHub repo. On that note, all of the code for this curriculum is already complete and is available as open-source Jupyter notebooks within that GitHub repo.
Conveniently, you can check out the current state of affairs for my ML Foundations content in one place by visiting the Where and When section of my ML Foundations GitHub repo. On that note, all of the code for this curriculum is already complete and is available as open-source Jupyter notebooks within that GitHub repo.
(03:34):
The notebooks of code, however, are not intended to stand alone. They are intended to be accompanied by my lectures, which I first offered online via the O’Reilly learning platform in 2020. At the time, the world was under strict lockdown and with lots of data scientists and engineers stuck at home with seemingly nothing better to do than hang out with me online, these lectures ended up being some of the most popular lectures in the history of O’Reilly, with over a thousand students registering for each of them. That was an exhilarating experience for me and a welcome distraction from the pandemic for me too.
The notebooks of code, however, are not intended to stand alone. They are intended to be accompanied by my lectures, which I first offered online via the O’Reilly learning platform in 2020. At the time, the world was under strict lockdown and with lots of data scientists and engineers stuck at home with seemingly nothing better to do than hang out with me online, these lectures ended up being some of the most popular lectures in the history of O’Reilly, with over a thousand students registering for each of them. That was an exhilarating experience for me and a welcome distraction from the pandemic for me too.
(04:09):
In June of last year, the O’Reilly platform also became the first place where you could undertake my entire ML Foundations curriculum outside of live lectures. Specifically, I broke the curriculum up into the four subject areas: Six hours on Linear Algebra for Machine Learning; Six hours on Calculus for Machine Learning; Nine hours on Probability and Statistics for Machine Learning; and Six hours on Data Structures, Algorithms, and Machine Learning Optimization.
In June of last year, the O’Reilly platform also became the first place where you could undertake my entire ML Foundations curriculum outside of live lectures. Specifically, I broke the curriculum up into the four subject areas: Six hours on Linear Algebra for Machine Learning; Six hours on Calculus for Machine Learning; Nine hours on Probability and Statistics for Machine Learning; and Six hours on Data Structures, Algorithms, and Machine Learning Optimization.
(04:40):
To access this content, if you don’t already have an O’Reilly subscription personally or through your employer, you can get a free seven-day trial to check it out. And, I’m currently working with O’Reilly to obtain free 30-day trials for SuperDataScience listeners so stay tuned for that.
To access this content, if you don’t already have an O’Reilly subscription personally or through your employer, you can get a free seven-day trial to check it out. And, I’m currently working with O’Reilly to obtain free 30-day trials for SuperDataScience listeners so stay tuned for that.
(04:54):
However, remember that I did say that my ML Foundations curriculum would be available for free. For that, I’m recording my own personal version of all of the videos at home. The content available via O’Reilly was recorded in a professional studio with full-time professional staff. So yes, I’m recording these on my own personal version at home and I’m releasing that onto YouTube. All of the Linear Algebra content is already available via a playlist on YouTube and the Calculus content will be finished when we publish the final video to the playlist of over 50 videos next week. After that, we’ll start publishing the Probability videos and we won’t stop until my entire ML Foundations curriculum is freely available on YouTube.
However, remember that I did say that my ML Foundations curriculum would be available for free. For that, I’m recording my own personal version of all of the videos at home. The content available via O’Reilly was recorded in a professional studio with full-time professional staff. So yes, I’m recording these on my own personal version at home and I’m releasing that onto YouTube. All of the Linear Algebra content is already available via a playlist on YouTube and the Calculus content will be finished when we publish the final video to the playlist of over 50 videos next week. After that, we’ll start publishing the Probability videos and we won’t stop until my entire ML Foundations curriculum is freely available on YouTube.
(05:38):
That said, if you do feel like supporting my YouTube effort, you can buy my ML Foundations course on Udemy, which is often available at a deep deep discount; in US-dollar terms, it should be easy to spot a sale and grab it for under $20.
That said, if you do feel like supporting my YouTube effort, you can buy my ML Foundations course on Udemy, which is often available at a deep deep discount; in US-dollar terms, it should be easy to spot a sale and grab it for under $20.
(05:53):
Finally, I recently began work on a book version of this content. There’s so much ML Foundations material to cover that I need to break it up into several books. The first book will be called the Mathematical Foundations of Machine Learning and focus primarily on the Linear Algebra and Calculus subject areas. Pearson, the world’s largest publisher of university textbooks, will be publishing it and chapters should start to become available this year. If you’d like to stay up to date on book-release details and anything else I’m working on — be it code, YouTube videos, live in-person lectures, or podcast episodes — you can sign up for my email newsletter on jonkrohn.com.
Finally, I recently began work on a book version of this content. There’s so much ML Foundations material to cover that I need to break it up into several books. The first book will be called the Mathematical Foundations of Machine Learning and focus primarily on the Linear Algebra and Calculus subject areas. Pearson, the world’s largest publisher of university textbooks, will be publishing it and chapters should start to become available this year. If you’d like to stay up to date on book-release details and anything else I’m working on — be it code, YouTube videos, live in-person lectures, or podcast episodes — you can sign up for my email newsletter on jonkrohn.com.
(06:31):
I mentioned tons of links in today’s episode and as always you can find those in the show notes. All right, that’s it for today. I hope that provides you with lots of helpful content for continuing your data science journey, deepening your data science journey. In the meantime, keep on rockin’ it out there; I’m looking forward to catching you on another round of SuperDataScience very soon.
I mentioned tons of links in today’s episode and as always you can find those in the show notes. All right, that’s it for today. I hope that provides you with lots of helpful content for continuing your data science journey, deepening your data science journey. In the meantime, keep on rockin’ it out there; I’m looking forward to catching you on another round of SuperDataScience very soon.
Share on
Related Podcasts

August 15, 2025
Oz Katz
- Artificial Intelligence, Data Science
- 24 MINS

August 12, 2025
Julien Launay
- Artificial Intelligence, Data Science, Python
- 72 MINS