SDS 754: A Code-Specialized LLM Will Realize AGI, with Jason Warner

Jason Warner, CEO of poolside, discusses the advantages of code-specialized LLMs over generalized models such as GPT-4, highlighting their significant value to the coding community. He also predicts a paradigm shift to an “AI-led, developer-assisted” approach, driven by the potential of specialized LLMs to accelerate the development of AGI through their training on complex, multi-step logic code.

About Jason Warner

Jason Warner is the Co-Founder and CEO of Poolside. Prior to founding Poolside, Jason was the Managing Director at Redpoint Ventures on the early growth team. He also served as the CTO at GitHub, where he was responsible for bringing products like GitHub Actions, Packages, Advanced Security, Connect, and Codespaces to market, as well as incubating experimental projects like GitHub Co-pilot. Before joining GitHub, Jason was the VP of Engineering at Heroku. Additionally, he oversaw Product Engineering for Ubuntu Desktop and Ubuntu Phone at Canonical. Jason graduated from Penn State University with a Bachelors in Computer Science and received a Masters of Science from Rensselaer Polytechnic University.

Overview

Jason explores the evolving landscape of artificial intelligence, focusing on the contrast between specialized language learning models (LLMs) like poolside’s and more generalized ones such as GPT-4. He likens specialized LLMs to a robust Ford F-150, highlighting their targeted efficiency in coding tasks, in contrast to the broader capabilities of a Toyota Camry-like GPT-4. Jason’s comparison illuminates how specialized models are transforming coding interactions, offering tailored solutions that significantly outperform their generalist counterparts.

Jason also dives into the potential shift in the developer-AI dynamic, envisioning a future where AI leads, supported by developers. It’s a shift that inspired the naming of his company, poolside. This change, he believes, will be driven by the advanced capabilities of specialized LLMs trained on complex, multi-step logic code, like those developed by poolside. Such models promise to boost coder productivity and creativity, pushing the boundaries towards achieving artificial general intelligence (AGI) by redefining software development roles and enhancing AI-developer collaboration.

The episode highlights Jason’s ability to articulate complex ideas about AI’s future in coding. It emphasizes poolside’s unique approach, focusing on high-quality code and a training set refined for relevance and impact. This ensures their LLM understands not just code, but the broader context of software development, promising a significant leap in how AI aids in coding.

Items mentioned in this podcast:

poolside
Fresh Ink: Hello, Poolside! At the intersection of AI and Software comes the system that will build all future systems
GPT-4
Redpoint Ventures
GitHub
Perplexity
RLHF
Anthropic Constitutional AI
AlphaGeometry
poolside co-founder, Eiso Kant
OpenAI’s Q*
Bridgewater
The Score Takes Care of Itself by Bill Walsh, Steve Jamison, Craig Walsh
Simple Sabotage by Robert M. Galford, Bob Frisch and Cary Greene

Follow Jason:

Did you enjoy the podcast?

What skills should coders develop to stay relevant in a landscape dominated by specialized AI?
Download The Transcript

Podcast Transcript

Jon Krohn: 00:00

This is episode number 754 with Jason Warner, co-founder and CEO at poolside.

00:27

Welcome back to the Super Data Science Podcast. Today we are extremely fortunate to have the exceptionally gifted, and exceptionally visionary, Jason Warner on the show. Jason is co-founder and CEO of poolside, a hot venture capital-backed startup that will shortly be launching its code-specialized large language model and accompanying interface that is designed specifically for people who code, like software developers and data scientists. Previously, Jason was managing director at the renowned Bay-Area VC Redpoint Ventures. Before that, he held a series of senior software leadership roles at major tech companies, including being CTO of GitHub.

01:02

Today’s episode should be fascinating to anyone keen to stay abreast of the state-of-the-art in AI today and what could happen in the coming years. In today’s episode, Jason details why a code generation-specialized LLM like poolside will be far more valuable to humans who code than generalized LLMs like GPT-4 or Gemini. And he also fills us in on why he thinks AGI itself will be brought about by a code-specialized ML model like poolside’s. All right, let’s jump right into our conversation.

01:28

Jason, welcome to the Super Data Science Podcast. It’s great to have you on the show. Where you calling in from today?

Jason Warner: 01:34

Thanks for having me. I’m calling in from my home in Victoria, British Columbia, Canada.

Jon Krohn: 01:39

Nice. And you’re not Canadian originally, but you have found my home nation and fallen in love with it. I could add something I said to you before we started recording, is that if I ever moved back to Canada from New York, the West Coast, like where you are… So I always say Vancouver, which is on the mainland, but you’re on Victoria, which is slightly confusingly on Vancouver Island.

Jason Warner: 02:05

It’s a great place to live, great place to raise kids. It’s a wonderful place to be. And yeah, I’m an American, but though I’m now a Canadian permanent resident, and if I’m traveling abroad, I’ll say, “I’m a Canadian.” It’s easier in that way, but yeah, people who don’t know it, Victoria is a special place.

Jon Krohn: 02:22

I’m going to have to check it out someday soon. It does sound amazing. So we know each other through Kyle Daigle. He was in episode number 730. He is the COO of GitHub, and you used to be the CTO, so you guys overlap there. And he highly recommended you as a guest. When I asked him if there’s anyone I should speak to, he said, “You got to talk to Jason. He’s doing such exciting things.” I looked at what you were doing and I was like, yeah, we got to get Jason on the show right away. So you’re co-founder and CEO of poolside, which was funded by, or is funded by, Redpoint Ventures.

03:01

So for a couple of years you worked at Redpoint in between GitHub and now founding poolside. Coming up on a year ago that you founded it. So co-founder, CEO, and the thesis, from what I’ve read, and you’re welcome to correct me, but from what I’ve read doing my research, the thesis with poolside is that there is tremendous value in large language models and in the flow tools for developers. So I have a lot of experience using GPT-4 for code generation as a part of ChatGPT. And so you had this experience in venture capital. Obviously there’s a market there. Fill me in, fill my audience in on why a tool, a large language model specifically for code like you’re working on at poolside, can be better than the general approach that I get. And I’m quite happy with using something like GPT-4 in the ChatGPT interface.

Jason Warner: 04:04

So let’s start with a different way of trying to describe this. So imagine, if you will, that GPT-4, which is, as far as I’m concerned, the gold standard. It is by far the best in almost every way possible at what it does. But let’s just call GPT-4, for the moment, the Toyota Camry. It is a vehicle. It is the bestselling sedan in the world, and is a general purpose vehicle. It can take you to work, go on vacation, haul your family around, go get groceries. But imagine all of a sudden, because it’s the only vehicle in the world at the moment, you start abusing it for things that it really wasn’t built for. You put a tow hitch on it, you start to pull loads, larger and larger loads over time. And for necessity, you start using it on the farm or on the job site, or you start tuning the s*** out of this thing and start racing it on the racetrack.

05:09

That works because it’s the only thing out there. And then you have other people coming into the world building other versions of sedans saying they’re slightly different. So Anthropics building the Honda Accord, open source models might be the Hyundai Sonata or whatever version of these things that exist. And developers are the ones who are putting the tow hitch on this, or trying to have it haul tons of hay, or using it on the job site, if you will. Well, we’re introducing a new vehicle type. So still a large language model with applications that are built on top of it, but it’s a new vehicle type, and it’s the Ford F-150. And it is specifically built for those environments and those orientations and those jobs. And you could still abuse us. You could say, “Hey, well I’m going to take you on a 10,000-mile road trip.”

06:03

I’m like, “Hey, you know what? That’s not what we’re for.” Yeah, we’re going to be capable of doing that, but you might be more comfortable going to get the newly introduced other vehicle type, like the minivan. And that’s what’s going to happen in the world over the next couple of years is that not one vehicle type is going to exist to serve all purposes. You’re going to start to have specialty vehicles introduced. Now we happen to believe that we are not a fully specialized vehicle, we’re more of a general special purpose, like a truck is a very different general special purpose than something like a dump truck, as an example. So that’s how I’m trying to explain to the world what we’re doing. These general purpose models are all sedans. We’re introducing the world to a truck for the very first time, and that’s what we’re doing.

Jon Krohn: 06:51

That is an amazing analogy, and you can tell that you have a lot of experience either listening to you or giving pitches, because that was probably, of the hundreds of guests that I’ve interviewed on the show, that was probably the clearest analogy describing what somebody’s doing in their AI company that I’ve heard. So, crystal clear. You’re building the F-150. When will my listeners, roughly, be able to get their hands on an F-150?

Jason Warner: 07:21

Q2 of this year. That’s when we want to launch.

Jon Krohn: 07:24

Oh wow.

Jason Warner: 07:27

This is a sprint now for us to get there. We’re working with design partners right now, and we’re showing them the applications that we’re building on top of everything, and we’re in the middle of model training at the moment. So Q2 is our target. Who knows? We just… When this is being recorded, things get announced. Meta just announced that they’re trying to soak up every single GPU in the world at the moment. So things might change a little bit, but that’s our target, Q2 this year.

Jon Krohn: 07:58

Very cool. Well that’s exciting. And are you able to let us get a glimpse as to how modeling outputs might be different with an F-150 versus trying to use a Camry for code generation? Or are you able to give some insights for us into what the user experience would be like using poolside as opposed to using a ChatGPT interface?

Jason Warner: 08:22

A bit.

Jon Krohn: 08:23

Makes perfect sense. Totally understand.

Jason Warner: 08:26

And so what we’re trying to show is that there is a moment in time that we exist in right now, which is why analogy like the Camry to the Ford F-150 makes sense, because we are all entering this land at the same time, at the same starting point, but we’re diverging in the woods. The further we diverge in the woods as we walk these different paths, the more obvious what the terrain differences start to look like. But we are literally only a couple of steps into this park together. OpenAI has a couple of, let’s say, a couple meters on everybody else at the moment, but the point being, we’re looking roughly at the same landscape. From an experience perspective, what we care about is the future of software. So let me break down how I think about product development, and we’ll get to why I’m going to answer the question this way.