Podcasts SDS 821: The Skills You Need to Be an Effective Data Scientist, with Marck Vaisman

72 minutes
Career Tips, Data Science

SDS 821: The Skills You Need to Be an Effective Data Scientist, with Marck Vaisman

Subscribe on Apple Podcasts, Spotify, Stitcher Radio or TuneIn

Marck Vaisman speaks to Jon Krohn about his paradigm for understanding core data practitioner types. Hear Marck detail the four data practitioner personas that he has identified in his research, why he believes the roadmaps that influencers like to promote as surefire ways to a data science career don’t work in practice, and why the term “data scientist” is still so elusive and hard to recruit for.

Thanks to our Sponsors:

Interested in sponsoring a Super Data Science Podcast episode? Email natalie@superdatascience.com for sponsorship information.

About Marck Vaisman

Marck is a data science expert with 15 years of experience, currently aiding customers in harnessing Azure for their data science and AI workloads. As an Adjunct Professor at Georgetown and George Washington University, he integrates industry knowledge into courses like Big Data and Cloud Computing. He’s also the founder of Data Community DC, promoting data science in the Washington DC area. An advocate for R programming, Marck has authored several data science publications, including “Analyzing the Analyzers”. He holds an MBA from Vanderbilt University and a B.S. in Mechanical Engineering from Boston University.

Overview

For Marck, data science isn’t just about making data visible; it’s about finding interesting ways to tell the stories they reveal to us. And in his latest booklet, “Analyzing the Analyzers”, Marck breaks down the five skill areas and four distinct personas of data professionals based on his survey findings. Those personas are: data businessperson, data creative, data developer, and data researcher. The five skill areas are: business machine learning, big data, operations research, programming, and statistics.

Marck was quick to note the porosity between these personas. People can see themselves in more than one category, and many people move between personas over time. Nevertheless, Marck says that these personas exist to present a more accurate picture of needs in the tech industry. Given the speed at which tools and libraries change and adapt, hiring for character types might be a more reliable method to ensure companies get the right people.

Marck and Jon also talk about what they see as an unnecessary complexity in many interviews for data science positions, where candidates are asked about details that would never come up in the jobs they are being recruited for. In their view, technical problems that have one answer can always be looked up and resolved. Part of the reason why interviews ask unnecessary questions is that data science is so hard to define. Marck says this is the nature of this new and dynamic field and that hopeful data scientists would do better to understand how to interpret and communicate data than master a Python library that may become obsolete in a few years.

Listen to the episode to hear details about these four data science personas, the top skills you need to be an effective data scientist, and why ‘mindset’ is so important in the tech industry.

In this episode you will learn:

How Marck started his work in defining data science roles [08:06]
The relationship between the four data practitioner personas [15:26]
About Marck’s “menu” for effective data science [40:43]
How recruiters can hire the best data scientist for the job [59:31]

Items mentioned in this podcast:

This episode is brought to you by Gurobi
AGNTCY
What Makes for an Effective Data Practitioner in 2024?
What Makes for an Effective Data Practitioner in 2024 Slides
SDS 790: Open-Source Libraries for Data Science at the New York R Conference
SDS 794: Exciting (and Frightening!) Trends in Open-Source AI
SDS 813: Solving Business Problems Optimally with Data, with Jerry Yurchisin
Web Summit
Analyzing the Analyzers by Sean Murphy, Marck Vaisman, and Harlan Harris
The Data Science Venn Diagram by Drew Conway
SuperDataScience
Intro to Data Structures and Algorithms
Data Structures and Algorithms, Level II: Hashing, Trees, Graphs
SDS special code for a free 30-day trial on O’Reilly: SDSPOD23
The Super Data Science Podcast Team

Follow Marck:

Follow Jon:

Episode Transcript:

Download The Transcript

Podcast Transcript

Jon Krohn: 00:00:00

This is episode number 821 with Marck Vaisman, Senior Cloud Solutions Architect at Microsoft.