Gifted author and software engineer, Wah Loon Keng, joins the podcast for an in-depth look at reinforcement learning. From its history to limitations, modern industrial applications, and its future in the coming decades, Keng provides an in-depth introduction to deep reinforcement learning and explores the latest research and applications in the field.
Thanks to our Sponsors
About Wah Loon Keng
Wah Loon Keng is a Senior AI Engineer at AppLovin. He co-authored Foundations of Deep Reinforcement Learning and created SLM Lab – a popular Deep RL framework.
Overview
Wah Loon Keng joined Jon Krohn live in Manhattan for an introduction to deep reinforcement learning that delves deep into its rich roots in gaming. If you’re new to reinforcement learning, you’ll learn that it is the third category of learning that differs from unsupervised and supervised learning. The algorithm interacts with the environment, which changes the data, and so every time you run your algorithm, you’re using a new dataset.
At its core, Keng says that reinforcement learning is essentially “learning from trial-and-error” and compares it to a game-playing agent. And when it comes to differentiating reinforcement learning from deep reinforcement learning, Keng says the difference lies in the fact that deep RL is learning from a training set and then applying that learning to a new data set. RL meanwhile is dynamically learning with a trial and error method to maximize the outcome.
The way deep reinforcement algorithms are set up today makes them unable to reason, which is a significant inefficiency. And so even the most straightforward information is challenging to translate, says Jon.
Now that Keng and Jon have defined RL and deep RL and differentiated them from unsupervised learning and supervised learning, Keng dove straight into RL’s early beginnings and breakthroughs. From the ’80s and ’90s with Actor-Critic algorithms and TD-Gammon through to the game-changing emergence of deep reinforcement learning in the past decade with approaches like Deep Q-Learning, AlphaGo, and MuZero, Keng impressively walks us through the most critical developments that lead us to where we are now.
After a thorough history of RL and gaming, it was time for Jon and Keng to cover the limitations of RL today. While the goal is eventually to get to real-world complexity, moving from training in the virtual world to training in the real world involves passing several hurdles–some of which relate to the problems of generalization and sample inefficiency, says Keng. Other limitations include the cost of collecting data, the cost of running the algorithm and then deploying it in the real world. Despite these limitations, you still see deep RL applications within the robotics and logistics industries. Specific examples can include the scheduling of trains and the management of inventory.
Finally, Keng discussed an open-source framework he helped co-developed, called SLM-Lab. Keng’s framework provides a “right-out-of-the-box” agent for users to run their algorithms with, and can also plug into different environment packages.
As an AI engineer on a small team, Keng often finds himself working from “end-to-end,” he says. From understanding a problem to figuring out how to solve it, and coding the solution himself, Keng can do it all. And as far as the tools he uses daily, he likes to keep things simple: Python, PyTorch, and Kubernetes for deployment.
In the future, Keng hopes to see more useful robots, but stresses that the issues with sample efficiency and generalization must be addressed to see RL become more useful in the industry.
In this episode you will learn:
- What is reinforcement learning? [4:50]
- Deep reinforcement learning vs reinforcement learning [13:17]
- A timeline of reinforcement learning breakthroughs [16:17]
- The limitations of deep RL today [39:53]
- Deep RL applications [53:10]
- Keng’s open-source SLM-Lab framework [57:51]
- Keng’s responsibilities as an AI engineer [1:02:17]
- What is the future of RL? [1:08:05]
Items mentioned in this podcast:
- Udemy Business
- Alternative Data Academy
- Foundations of Deep Reinforcement Learning by Wah Loon Keng and Laura Graesser
- Deep Learning Illustrated by Jon Krohn, Grant Beyleveld, Aglaé Bassens