The Test Set by Posit

The Answer Was Never Us — with Leilani Battle 27.07.2026 1h 14m

Leilani Battle studies how software shapes what we see and believe. The University of Washington professor and co-director of the UW Interactive Data Lab talks with Michael, Hadley, and Wes about an experiment that manipulated people using nothing but loading speed, and why AI models don't seem to recommend charts the way the community that studies charts actually does. Other highlights: rationality's blind spots and a thorough disc golf origin story. What's inside: Loading spinners that quie...

Curiosity, duty, and existential dread — with Joe Cheng 13.07.2026 1h 5m

Joe Cheng is the CTO of Posit and the creator of Shiny. He joins Michael and Hadley to talk about why he almost walked away from AI work entirely over ethics concerns and what it takes to lead a team that didn't necessarily choose you. Plus, why saying yes to everyone is a worse strategy than it sounds. Bonus: Hadley calls out Joe's people-pleasing in real time. What's inside: Joe's 2012 self-doubt spiral that accidentally created ShinyWhy Joe almost quit working on AI entirelyThe "loaded gun...

Confidently Incorrect — with Caitlin Colgrove 29.06.2026 1h

Caitlin Colgrove is the CTO of Hex, the data workspace for building and sharing data projects using SQL and Python that somehow counts a Sweetgreen chef as a power user. She joins Michael, Hadley, and Isabel to talk about what AI agents actually get wrong in data work (it's not the hallucinations, it's supreme overconfidence), why data teams aren't going anywhere, and how she thinks about building products for humans and agents at the same time. What's inside What Hex's Context Studio does, a...

The Bothness of It — with Alex Hillman 15.06.2026 1h 14m

Alex Hillman built one of America's first co-working spaces, wrote a business book in tweets, and recently handed his inbox to a Claude Code agent — not to draft emails, but to notice when a friendship is going cold. In this episode, Alex, Michael, Wes, and Hadley dig into marketing for people who hate marketing, what 20 years of email reveals about your relationships, and why the hardest part of AI-assisted coding was always before you wrote a single line. What's inside: Marketing is r...

The Code Doesn't Lie — with Mike Bostock 01.06.2026 1h 8m

Mike Bostock made D3 when the browser was still a joke. He built bl.ocks when people needed somewhere to share their work. Now he's building Observable — reactive notebooks with an AI that actually looks at what it made. In this episode: the three-GIF bar chart that launched 25 years of viz, why open source needs both intrinsic and extrinsic motivation, and why an agent that can't see its own output is likely to be confidently wrong. What's Inside The 1998 visualization library that could onl...

The Wonder-Driven Builder — with Paige Bailey 18.05.2026 45m

Paige Bailey is a developer relations engineering lead at Google DeepMind. She's a geophysicist-turned-AI-engineer who was once told by her professors that building open-source libraries was a waste of time. We talk about her path from planetary science to TensorFlow, why statisticians have a hidden edge in the age of AI, and what it means to be a curious generalist when the cost of building software is approaching zero. Bonus: installing solar-powered silent-film birdhouses as street art in ...

Widgets Are Lego Bricks (and Other Things People Are Sleeping On) — with Vincent Warmerdam 04.05.2026 1h 15m

Vincent Warmerdam has been the first full-time hire at a startup, a spacey punster who accidentally got himself a job, a bartender at an Amsterdam comedy theater, and a Dutch bike tour guide — and he'll tell you all of it was career development. Now doing DevRel at Marimo, Vincent makes the case for reactive notebooks, Lego-brick widgets, and why "number go up" is not a data science strategy. Also: chickens die. The model doesn't know. This matters more than you think. What's inside How a spa...

Everything's a Fad (Including This Podcast) — with Benn Stancil 20.04.2026 1h 35m

Benn Stancil built Mode Analytics, spent a decade in the data trenches, and now writes some of the sharpest, funniest essays in the data world. On The Test Set, he talks about the cultural shift from Nate Silver to Rick Rubin why AI might kill the analytics dashboard, and what happens when a thousand startups all build the same thing. Plus: boy bands as a model for collaboration, and why the best creative work starts with cheating. What's inside: Why the modern data stack was basically ...

Deeply Unsexy: SQL's Redemption Arc — with Tristan Handy 06.04.2026 1h 5m

dbt Labs CEO Tristan Handy drops into The Test Set to map the fault lines between the data science world and the enterprise data world — and explain why analytics engineers are basically pissed-off data analysts who decided to organize the bookshelf. We get into SQL's glow-up, the community magic of dbt Slack, what AI agents mean for data warehouses, and why everyone's building iOS apps with Claude now. What's inside: What analytics engineers *actually* doSQL's journey from deeply unsexy to i...

Your VP Is Doing a Rogue Analysis in Cursor Right Now — with Nell Thomas 23.03.2026 1h 22m

Nell Thomas has spent two decades in data — from equity research to the DNC to Facebook to leading a 400-person data org at Shopify. She walks Michael and Wes through the modern data stack role by role, gets honest about what AI is and isn't changing about data work, and admits the semantic layer has been her greatest leadership failure. Plus: Sneakers gets the respect it deserves. Episode Notes What does it actually look like to run data infrastructure for millions of merchants while the ent...

Sleeping Rats and Sociopathic Agents — with Phillip Cloud 09.03.2026 56m

Phillip Cloud has been shaping the Python data ecosystem since the early pandas days — and he has *opinions*. Now a principal engineer at NVIDIA leading the Ibis project, Phillip talks about how he stumbled into open source via an eye movement lab, why he prefers his coding agents cold and emotionless, and what happens when you ask an LLM for woodworking trig. Plus: terminal user interfaces, the file hierarchy standard hot take nobody asked for, and the pineapple-on-pizza hill he's willing to...

More productive but a lot less fun — with Charlie Marsh 23.02.2026 1h 35m

Charlie Marsh built Ruff, uv, and Ty — the tools that mass-fixed Python's worst pain points. Now he's grappling with what happens when agents start writing most of the code. In this episode, Charlie gets real about his team trusting his PRs less, the gnarly middle of coding with agents, and whether Python is even the right language for an agentic future. It's honest, a wee existential, and deeply relatable if you ship code for a living. Episode Notes Charlie Marsh is the founder and CEO of As...

Alenka Frim: What yoga teaches us about discipline and collaboration in data science 09.02.2026 1h 1m

Alenka Frim went from teaching yoga full-time to becoming a committer and PMC Member on Apache Arrow. In this episode, Alenka joins The Test Set hosts to talk about how Arrow grew from spec to critical infrastructure, and why she started contributing to a project she had never even used. She reflects on imposter syndrome, the discipline of showing up (on the mat and in GitHub), and how agents are changing what it means to write code. Plus: managing 4,000 open issues without losing your mind. ...

Emily Riederer: Column selectors, data quality, and learning in public 26.01.2026 58m

Emily Riederer writes Python with an R accent, and we’re all comfortable with it. In this episode, Emily reflects on her journey through R, Python, and SQL — from lessons learned in averaging default values (oops, we're not all rich!) to discovering that column selectors are way cooler than they sound. She weighs in on the delicate art of learning in public, why frustration often makes the best teacher, and how to find your niche by solving the boring problems. Oh, Oh, and the crew casually d...

Rebecca Barter: Persistent learning, tool building, and ‘Will code even exist?’ 12.01.2026 56m

Rebecca Barter, senior data scientist at Arine and adjunct assistant professor at the University of Utah, refuses to work on things she doesn’t care about. Lucky for us, she cares about a lot, most of all impact. In this episode, Rebecca joins The Test Set to talk about learning fast, building better tools, and staying motivated and adaptable. She shares how moving between R, Python, SQL, and dashboards reshaped how she thinks about expertise. Plus a reflection on her recent posit::conf talk,...

Marco Gorelli: Narwhals, ecosystem glue, and the value of boring work 15.12.2025 51m

You’ve probably used Narwhals without realizing it. It’s the compatibility layer helping apps and libraries like Plotly play nice with Pandas, Polars, Arrow, and more — while keeping computation native instead of converting everything to Pandas. In this episode, Marco Gorelli explains how his weekend experiment turned into essential ecosystem infrastructure and why data types, not APIs, are where interoperability gets tricky. Plus what it takes to build trust and community around an open-sour...

Kelly Bodwin — Quarto hacks, AI in the classroom, and why R should stay weird 01.12.2025 51m

In this episode, we’re joined by Kelly Bodwin — candy corn defender, board game enthusiast, and Associate Professor of Statistics and Data Science at Cal Poly. We discuss her path from English and French to statistics, how she builds teaching tools and navigates AI in the classroom, and what it takes to keep a programming community weird in the best possible way. Episode notes Kelly is curious, collaborative, and unafraid to lean in on quirky. Kelly shares how she balances teaching three cour...

James Blair: Part 2 — Solutions engineering, critical thinking, and staying human 17.11.2025 42m

This episode is Part 2 of our conversation with James Blair. He explains how he found his “accidental perfect fit” as a solutions engineer and how that role became a pipeline into product management. Get a peek into the AI-powered tooling he’s now building for the Posit ecosystem, and hear how he’s using Claude Code, Positron Assistant, and DataBot to generate synthetic, industry-specific demos on the fly — plus, why the real magic is keeping humans firmly in the loop. Episode notes Th...

James Blair: Part 1 — Portfolios, practice, and staying curious 03.11.2025 29m

In Part 1 of our conversation with James Blair, we trace his delightfully non-linear path from childhood robotics dreams to journalism to R, with a few stops in between. We hear about the Shiny app that changed his career, plus a candid roundtable with Michael, Hadley, and Wes about whether a data-science master’s still pays off in the age of AI. Episode notes This is a story about staying hands-on and fiercely inquisitive — whether analyzing bike telemetry or in teaching data science. Jame...

Julia Silge: Part 2 — Glue work, licensing, and open source in the age of LLMs 20.10.2025 28m

In part two of our conversation with Julia Silge, we discuss how work actually ships: the boundaries, the glue, and the tools that turn noise into signal. From there, we go macro and wonder what the LLM era means for humanity’s contributions, plus how licensing is evolving to protect sustainability without abandoning openness. Episode notes Both practical and philosophical, this conversation spans workplace energy, team connective tissue, and the big questions LLMs have us asking in a shift...

The Test Set by Posit

Episodes

Popular in

The Test Set by Posit

Episodes

Similar podcasts

Popular in