SDS 446: Getting Started in Machine Learning

Podcast Guest: Jon Krohn

February 18, 2021

Welcome back to the FiveMinuteFriday episode of the SuperDataScience Podcast! 

Today I’m discussing how you can get started in machine learning.
 

In last week’s episode, I answered questions regarding future-proofing one’s career. This week, I’m tackling your questions on learning machine learning, specifically if you’re starting from scratch. 

  • What is the best free learning path for those starting out in machine learning?
    – It’s impossible for me to resist pointing out that SDS is a great place to start for this, by creating a free account and checking out the four learning paths we have available there. If you’re looking for a single standalone course, Machine Learning A to Z on Udemy for a small price. Remember I’m a little biased towards this. I’d also recommend Data Quest. Another user recommended my own machine learning course which looks at a lot of prerequisite materials that you might find as part of a graduate-level study in machine learning.
  • What is one of the hardest concepts for you (John) to understand in machine learning?
    – There’s plenty of tricky stuff out there. While it may be easy for me to execute support vector machine methods with high-level Python code in scikit-learn, the details of how the underlying kernel trick works to enable support vector machines is something I sometimes forget. I’ve also been slow to understand how many of the deep reinforcement learning concepts work in theory. From hands-on experience, I can appreciate the DeepQ algorithm but cannot picture the underlying Q learning table updates in my mind. There’s always more to learn in machine learning.

In next week’s episode, we’ll build on today’s machine learning topics by discussing how to work on becoming a leader in machine learning.

ITEMS MENTIONED IN THIS PODCAST:
DID YOU ENJOY THE PODCAST?
  • How can you examine your own blind spots in machine learning and find ways to continue your machine learning education?
  • Download The Transcript

Podcast Transcript

(00:05):
This is Five-Minute Friday on Getting Started in Machine Learning. 

(00:19):
During last Friday’s episode, I answered questions from podcast listeners on the futureproof-ness of a data science career. In today’s episode, I’m handling questions on how to learn machine learning, particularly if you’re starting from scratch. 
(00:34):
More specifically, Chad Coon reached out to me in a tweet on Twitter to ask me: “What is the best free learning path for those starting to learn machine learning?”
(00:45):
All right, this is kind of a spammy and super-biased way to start the episode, which isn’t something I set out to do here on purpose I swear, but when you ask about learning paths in machine learning I can’t resist mentioning that you can create a free account on SuperDataScience.com and check out the four specific learning paths we have there. Roughly ranked from novice to expert in machine learning, they are: The Data Analyst — or Business Intelligence Analyst Track; The Data Scientist Track; The AI Engineer Track; and the Data Science Manager Track.
(01:24):
You can check out everything that’s there for free and then there’s a small monthly subscription fee to get unlimited access to all of the hundreds and hundreds of hours of content in the SuperDataScience.com platform.
(01:36):
If you’re looking for a single, standalone course to get started, the Machine Learning A-Z course, again by SuperDataScience, is 44 hours long and available via Udemy for typically a couple dozen US dollars or less (note that Udemy pricing is highly variable day-to-day) but assuming you get a discount day it’s a steal at a few cents per hour of video. Again, as host of the SuperDataScience podcast I obviously have a lot of bias on the SuperDataScience courses.
(02:04):
Another option — and one that I’m finally in no way biased about — is that if you’d like to interactively execute example code directly within a platform, I think DataQuest is pretty darn cool. They have a free tier to get started and then charge about $50/month for their Premium tier, which gives you access to everything.
(02:23):
In a Tweet responding to Chad’s original Twitter question on the best free learning paths in ML, the software engineer Matej Kovac kindly recommended my own ML Foundations course, which introduces the foundational subjects — linear algebra, calculus, probability, statistics, and computer science — that you’d typically learn in grad school before moving on to machine learning itself. I do personally think it’s the best place to start if you have the time to invest in these foundations but, again, I would be a big proponent of my own course, wouldn’t I?
(02:59):
Anyway, I’ve finished filming more than a quarter of my ML Foundations content and the first eighth is online now with more going up every week; I anticipate it will all be publicly available by the summer. The content is available for free on YouTube, but in partnership with SuperDataScience, it’s also available as a Udemy course, which includes fully-worked solutions to every exercise and a certificate of completion; and eventually — after I have all the content recorded and uploaded — I’ll focus on adding interactive quizzes and handy study sheets to the Udemy course as well.
(03:31):
Once you have a firm grip on all the prerequisite linear algebra, calculus, and probability theory, a classic book to completely master the advanced theory of machine learning is a book called Deep Learning by Ian Goodfellow, Yoshua Bengio, and Aaron Courville. It’s 100% free at deeplearningbook.org.
(03:54):
Finally, on my personal website, at jonkrohn.com/resources, I maintain tons of free, categorized resources — data sets, blog posts, videos, books, and so on — for getting started in machine learning all the way through to being an expert in particular machine learning areas like financial time-series analysis or generally staying at the cutting-edge of all the academic research.
(04:18):
Feel free to tag me in a LinkedIn post or in a tweet to let me know of your favorite machine learning resource — I’d love to be enlightened and, if I think it’s appropriate, will add it to the resources page of my website or maybe even talk about it on a forthcoming episode.
(04:33):
Ok, well, that was a long answer to one question but let’s tackle one more. Let’s wrap this episode up with a related question from Jousef Murad, who’s a brilliant German engineer and host of the Engineered-Mind podcast that I happened to recently be a guest on. Jousef asked me in a tweet: “What is one of the hardest concepts for you to understand in ML?”
(04:59):
Haha, well, let’s see here… there’s lots that’s tricky. So, while it’s trivially easy for me to execute support vector machine methods with high-level Python code, say in scikit-learn, I’ve forgotten the details of how the underlying kernel trick works to enable support vector machines to work under the covers.
(05:21):
And I also have been very slow to understand how many deep reinforcement learning concepts work in theory. For example, again through hands-on experience I appreciate how well the Deep Q-Learning algorithm works for complex, sequential tasks like playing video games, but I cannot picture the underlying Q-learning table updates in my mind yet… which definitely annoys me.
(05:47):
There’s always always always more to learn in machine learning, and that is part of why I find this field so bloody exciting.
(05:54):
All right-y, we’re out of time for this week. On next week’s FiveMinuteFriday, building on what we talked about today — getting started in machine learning — I’ll be back to discuss how to be a leader in the field of machine learning.
(06:06):
If you’d like to ask me your own data science or machine learning questions or anything at all, feel free to tag me in a post on LinkedIn or on Twitter where my handle’s @JonKrohnLearns, and I’ll aim to answer them via social media or perhaps on an upcoming SuperDataScience episode! See ya there soon. 
Show All

Share on

Related Podcasts