SDS 650: SparseGPT: Remove 100 Billion Parameters but Retain 100% Accuracy

Have you heard of SparseGPT? This one-shot pruning technique downsizes large language models by 50% while maintaining 100% accuracy. Jon Krohn fills listeners in on everything you need to know in this week’s Five-Minute Friday episode.

Did you know that putting a large language model like GPT-3 into production requires $75K worth of GPUs alone? With even bigger models coming our way soon (GPT-4 is rumored to be several orders of magnitude larger than GPT-3), it’s clear that a solution is required to decrease the cost of running these expensive models.

Thankfully, a new paper from IST Austria researchers presents a parameter-pruning technique called SparseGPT that is able to prune more than half of GPT’s full 175-billion-parameters without impacting accuracy. It’s a significant improvement compared to previous methods like Magnitude Pruning, which could only prune 10% of GPT-3 before accuracy was affected.

Whether it’s increasing inference speed in production, decreasing the model size in memory storage or lowering compute costs, pruning certainly has its obvious benefits. But SparseGPT isn’t just significant for its pruning abilities. It’s significantly easier to use too! Its “one-shot,” post-training pruning approach makes it easier to apply compared to previous best-performing iterative approaches. But the best news about SparseGPT is, perhaps, its potential to eventually reduce model size by up to 90% without adversely affecting accuracy. Tune into this week’s Friday episode to learn more.

Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

ITEMS MENTIONED IN THIS PODCAST:

SDS 559: GPT-3 for Natural Language Processing
IST Austria paper on SparseGPT
Shaan Khosla
Shaan Khosla’s Let’s Talk Text Substack newsletter
Jon’s virtual conference on natural language processing with large language models
SDS special code for a free 30-day trial of O’Reilly: SDSPOD23

DID YOU ENJOY THE PODCAST?

Will parameter-tuning techniques like SparseGPT impact how you use large language models within your work?
Download The Transcript

Podcasts SDS 650: SparseGPT: Remove 100 Billion Parameters but Retain 100% Accuracy

Podcast Transcript

Share on

Related Podcasts

December 12, 2025

December 9, 2025

December 5, 2025

Podcasts SDS 650: SparseGPT: Remove 100 Billion Parameters but Retain 100% Accuracy

Share