thu · ep 05 · jun 18
SEASON01 SHIPPED04 UPCOMING01 VENUEYOUTUBE LIVE

SHOW US YOUR [ AGENT ] SKILLS

[ vanishing / gradients ] × PyMC LABS
EXCEL WORLD CHAMPIONSHIPS × EUROVISION

Long-time Python, ML, AI and data builders show how they're actually using agents today. Not vibe coding. Not demos. The real workflows: agent skills, harnesses, voice-memo memory, background reviewers, from people whose software you've been using for years.

SEASON
01
EPISODES
04
BUILDERS
17
SKILLS · WORKFLOWS
34 · 68
NEXT EPISODE JUN 18 · 2026 thursday · live on youtube

EP 05 · COPILOTS & CODING AGENTS

Three builders on coding agents, search, and context engineering
HOSTS HUGO BOWNE-ANDERSON · THOMAS WIECKI
REGISTER ▶
EP 04
· MAY 2026 · FULL EPISODE ON YOUTUBE · HIGHLIGHT REEL SOON

EVALS, SEARCH & SPORTS

Three builders on LLM evals, search relevance, and Bayesian sports modelling — the real workflows they run.

EP 04 · EVALS, SEARCH & SPORTS — three builders on evals, search relevance, and Bayesian sports modelling
Hamel Husain
HAMEL HUSAINparlance labs
Chris Fonnesbeck
CHRIS FONNESBECKpymc labs · mets / brewers / yankees
Doug Turnbull
DOUG TURNBULLsearch · shopify / reddit
▣ HIGHLIGHT REEL
Coming soon: the skills and workflows from this episode, packaged into the companion repo with repo links and timestamps — same as EP 01 & EP 02. Subscribe to get them when they land.
EP 03
· MAY 2026 · RUNTIME 3H 18M · 22 SKILLS · 30 WORKFLOWS

GOING TO EUROPE

Long-time European builders show their real agent setups: agent skills, harnesses, and the workflows they actually run.

EP 03 · GOING TO EUROPE — long-time European builders on their real agent setups
Paul Iusztin
PAUL IUSZTINdecoding ai
Eleanor Berger
ELEANOR BERGERelite ai coding
Alan Nichol
ALAN NICHOLrasa
Vincent Warmerdam
VINCENT WARMERDAMmarimo
Nicolay Gerold
NICOLAY GEROLDamp
Matthew Honnibal
MATTHEW HONNIBALspacy / explosion
Ines Montani
INES MONTANIspacy / explosion
▣ HIGHLIGHT REEL
01
Reads a Python codebase and tightens every try/except so the try covers only what can fail and the except catches the right exception. · Matthew Honnibal
00:12:09
02
Reads production code, finds where it is fragile, and writes post-mortems for bugs that have not happened yet but a plausible change could introduce. · Matthew Honnibal
00:14:10
03
Measures test-suite strength by introducing deliberate bugs one at a time and reporting which ones no test caught. · Matthew Honnibal
00:14:10
04
here-now skill
Publishes HTML pages, files, and whole sites to live URLs without leaving the terminal. · Eleanor Berger
00:45:55
05
Drives Anki through the AnkiConnect API, gating every note- or card-modifying operation behind explicit confirmation. · Eleanor Berger
00:49:46
06
Hands a coding agent a full frontend design language so it builds production-grade interfaces instead of generic ones. · Eleanor Berger
00:50:02
07
Reads your YouTube Watch Later playlist, summarises every video from its transcript, and publishes each summary as a secret gist. · Eleanor Berger
00:52:57
08
Introspects a thread that went sideways, traces each misstep to the instruction behind it, and proposes edits biased toward deletion. · Nicolay Gerold
01:59:04
09
Encodes a builder's design judgment for programmatic video, so Claude turns a few minutes of recorded audio into a finished explainer. · Alan Nichol
02:46:00
Still to come from this episode: Paul Iusztin's writing loop — diff a hand-edit against the agent's draft, extract the signal, and fold it back into a markdown style profile the agent reads next time — plus Vincent Warmerdam on notebooks as a shared canvas for humans and agents (his marimo-pair skill already shipped in EP 02).
EP 02
· MAY 2026 · RUNTIME 2H 14M · 05 SKILLS · 28 WORKFLOWS

HIDDEN DOORS, EVAL-DRIVEN CHARTS & LOCAL-FIRST INFERENCE

Not vibe coding — four builders, four ways to put scaffolding around an agent. Hilary's weekly gremlins, Bryan's eval-driven chart library, Eric's human-in-the-loop EDA in Marimo, and Tom's local-first inference at 120 t/s on Qwen 35B.

EP 02 · HILARY MASON — gremlins: cron-scheduled agents for bad ideas
Hilary Mason
HILARY MASONhidden door
Bryan Bischof
BRYAN BISCHOFtheory ventures
Eric Ma
ERIC MAmoderna
Tomasz Tunguz
TOMASZ TUNGUZtheory ventures
▣ HIGHLIGHT REEL
01
Interview the user's intent, ask for three variations at different magnitudes of change, score against a rubric you wrote up front. · Hilary Mason
01:01:00
02
A coding agent drives a reactive Marimo notebook through a bash bridge into the Python kernel, for human-in-the-loop EDA. · Eric Ma
00:11:57
03
agentic-eda workflow
Human-in-the-loop EDA: agent renders the next plot, human picks the next question, every claim backed by an artifact. · Eric Ma
00:23:27
04
Build an agent-facing chart library by generalising eval failures into features; the package can never regress on an eval it once passed. · Bryan Bischof
01:25:11
05
Three agent personas pull from a bad-ideas backlog, pitch and critique each other, and write design docs for moonshots no roadmap would schedule. · Hilary Mason
01:14:20
Still to come from this episode: Tom Tunguz's public-company financial analysis skill — local Qwen 35B pulls earnings data and assembles an HTML walkthrough in ~2.5 minutes — plus the local-first inference stack it sits on.
EP 01
· APR 2026 · RUNTIME 1H 32M · 07 SKILLS · 10 WORKFLOWS

ROBOREV, THE LOST ART OF VERIFICATION & VOICE-MEMO MEMORY

Three builders, three layers of structure around an agent. Wes's RoboRev daemon reviewing every commit at ~1.3B tokens/day, Jeremiah's personal skill folder bending agents to his voice and verb vocabulary, and Randy's LLM-as-judge verifier loop encoding Tufte's rules into a daily chart-making skill.

EP 01 · WES MCKINNEY — a million lines of code in six months: spicytakes.org & the agentic engineering stack
Wes McKinney
WES MCKINNEYpandas / posit
Jeremiah Lowin
JEREMIAH LOWINprefect / fastmcp
Randy Olson
RANDY OLSONgoodeye labs
▣ HIGHLIGHT REEL
01
explain skill
Agent narrates what it just did, like a teammate handing off. · Jeremiah Lowin
00:46:14
02
Replies to GitHub contributors in your voice, no "Great work, but rejected" sandwiches. · Jeremiah Lowin
00:54:08
03
ship-it skill
Re-trains "ship it" to mean open a PR, not merge. · Jeremiah Lowin
00:54:52
04
Turns a one-line idea into a Tufte-style chart, with an LLM-as-judge verifier loop. · Randy Olson
01:12:37
05
Turns guest headshots into short 8-bit pixel-art video clips for livestream intros and cutaways. · Show Us Your Agent Skills
ep 01
Still to come from this episode: Wes McKinney's stack writeup — agents reviewing agents (every commit read by agents 4–5 times before merge), a fleet of long-running Superpowers sessions (one ran 14 hours and 45 tasks unattended), and "off the rails?" review (no line-level reading, just whether the agent strayed structurally).

SELECTED SKILLS & WORKFLOWS

a selection from the companion repo · hugobowne/show-us-your-agent-skills ↗

A selection of skills and workflows from the streams, packaged into the companion repo.

$ npx skills add https://github.com/hugobowne/show-us-your-agent-skills
skill
explain
EP 01
Agent narrates what it just did, like a teammate handing off.
Jeremiah Lowin · Prefect / FastMCP ↗ youtube · 00:46:14
skill
Replies to GitHub contributors in your voice. No "great work, but rejected" sandwiches.
Jeremiah Lowin · Prefect / FastMCP ↗ youtube · 00:54:08
skill
ship-it
EP 01
Re-trains "ship it" to mean open a PR, not merge.
Jeremiah Lowin · Prefect / FastMCP ↗ youtube · 00:54:52
skill
One-line idea → Tufte-style chart with an LLM-as-judge verifier loop.
Randy Olson · Goodeye Labs · r/dataisbeautiful ↗ youtube · 01:12:37
skill
Turns guest headshots into 8-bit pixel-art video clips for livestream intros and cutaways.
Show Us Your Agent Skills ↗ ep 01 on youtube
skill
Interview intent, ask for three variations at different magnitudes, score against a rubric.
Hilary Mason · Hidden Door ↗ youtube · 01:01:00
skill
Agent drives a reactive Marimo notebook through a bash bridge into the Python kernel.
Eric Ma · Moderna ↗ youtube · 00:11:57
workflow
Human-in-the-loop EDA. Every claim backed by an artifact.
Eric Ma · Moderna ↗ youtube · 00:23:27
workflow
Build an agent-facing chart library by generalising eval failures into features; the package can never regress on an eval it once passed.
Bryan Bischof · Theory Ventures ↗ youtube · 01:25:11
workflow
Three agent personas pull from a bad-ideas backlog, pitch and critique each other, and write design docs for moonshots no roadmap would schedule.
Hilary Mason · Hidden Door ↗ youtube · 01:14:20

YOUR HOSTS

two builders · two hosts · on a mission to find out what people at the top of the game are actually doing
Hugo Bowne-Anderson
HUGO
BOWNE-ANDERSON
host · vanishing gradients
AI builder, consultant, educator of 6+ million students; ex-Yale, ex-Max Planck.
Thomas Wiecki
THOMAS
WIECKI
host · pymc labs
Co-creator of PyMC. Founder of PyMC Labs. Has built Bayesian models for hedge funds, Fortune 500s, and indie tinkerers for over a decade.
DON'T MISS THE NEXT
EPISODE.

Episode announcements, the workflows we cut for time, and what long-time builders are actually doing with agents. Free, no spam.

▶ SUBSCRIBE ON SUBSTACK ↗

Doing something weird with agents? Nominate yourself or someone else ↗