thu · ep 03 · may 21
SEASON01 SHIPPED02 UPCOMING02 VENUEYOUTUBE LIVE

SHOW US YOUR [ AGENT ] SKILLS

[ vanishing / gradients ] × PyMC LABS
EXCEL WORLD CHAMPIONSHIPS × EUROVISION

Long-time Python, ML, AI and data builders show how they're actually using agents today. Not vibe coding. Not demos. The real workflows: agent skills, harnesses, voice-memo memory, background reviewers, from people whose software you've been using for years.

SEASON
01
EPISODES
02
BUILDERS
07
SKILLS · WORKFLOWS
12 · 35
NEXT EPISODE MAY 21 · 2026 thursday · live on youtube

EP 03 · GOING TO EUROPE

Six long-time European builders show their real agent setups
Ines Montani
INES MONTANI (spaCy / Explosion) may pop in.
SPECIAL GUEST
HOSTS HUGO BOWNE-ANDERSON · THOMAS WIECKI
REGISTER ▶
AND AFTER THAT MAY 29 · 2026 friday · ep 04 · live on youtube
EP 02
· MAY 2026 · RUNTIME 2H 14M · 05 SKILLS · 25 WORKFLOWS

HIDDEN DOORS, EVAL-DRIVEN CHARTS & LOCAL-FIRST INFERENCE

Not vibe coding — four builders, four ways to put scaffolding around an agent. Hilary's weekly gremlins, Bryan's eval-driven chart library, Eric's human-in-the-loop EDA in Marimo, and Tom's local-first inference at 120 t/s on Qwen 35B.

EP 02 · HILARY MASON — gremlins: cron-scheduled agents for bad ideas
Hilary Mason
HILARY MASONhidden door
Bryan Bischof
BRYAN BISCHOFtheory ventures
Eric Ma
ERIC MAmoderna
Tomasz Tunguz
TOMASZ TUNGUZtheory ventures
▣ HIGHLIGHT REEL
01
Interview the user's intent, ask for three variations at different magnitudes of change, score against a rubric you wrote up front. · Hilary Mason
01:01:00
02
A coding agent drives a reactive Marimo notebook through a bash bridge into the Python kernel, for human-in-the-loop EDA. · Eric Ma
00:11:57
03
agentic-eda workflow
Human-in-the-loop EDA: agent renders the next plot, human picks the next question, every claim backed by an artifact. · Eric Ma
00:23:27
04
Build an agent-facing chart library by generalising eval failures into features; the package can never regress on an eval it once passed. · Bryan Bischof
01:25:11
05
Three agent personas pull from a bad-ideas backlog, pitch and critique each other, and write design docs for moonshots no roadmap would schedule. · Hilary Mason
01:14:20
Still to come from this episode: Tom Tunguz's public-company financial analysis skill — local Qwen 35B pulls earnings data and assembles an HTML walkthrough in ~2.5 minutes — plus the local-first inference stack it sits on.
EP 01
· APR 2026 · RUNTIME 1H 32M · 07 SKILLS · 10 WORKFLOWS

ROBOREV, THE LOST ART OF VERIFICATION & VOICE-MEMO MEMORY

Three builders, three layers of structure around an agent. Wes's RoboRev daemon reviewing every commit at ~1.3B tokens/day, Jeremiah's personal skill folder bending agents to his voice and verb vocabulary, and Randy's LLM-as-judge verifier loop encoding Tufte's rules into a daily chart-making skill.

EP 01 · WES MCKINNEY — a million lines of code in six months: spicytakes.org & the agentic engineering stack
Wes McKinney
WES MCKINNEYpandas / posit
Jeremiah Lowin
JEREMIAH LOWINprefect / fastmcp
Randy Olson
RANDY OLSONgoodeye labs
▣ HIGHLIGHT REEL
01
explain skill
Agent narrates what it just did, like a teammate handing off. · Jeremiah Lowin
00:46:14
02
Replies to GitHub contributors in your voice, no "Great work, but rejected" sandwiches. · Jeremiah Lowin
00:54:08
03
ship-it skill
Re-trains "ship it" to mean open a PR, not merge. · Jeremiah Lowin
00:54:52
04
Turns a one-line idea into a Tufte-style chart, with an LLM-as-judge verifier loop. · Randy Olson
01:12:37
05
Turns guest headshots into short 8-bit pixel-art video clips for livestream intros and cutaways. · Show Us Your Agent Skills
ep 01
Still to come from this episode: Wes McKinney's stack writeup — agents reviewing agents (every commit read by agents 4–5 times before merge), a fleet of long-running Superpowers sessions (one ran 14 hours and 45 tasks unattended), and "off the rails?" review (no line-level reading, just whether the agent strayed structurally).

SELECTED SKILLS & WORKFLOWS

a selection from the companion repo · hugobowne/show-us-your-agent-skills ↗

A selection of skills and workflows from the streams, packaged into the companion repo.

$ npx skills add https://github.com/hugobowne/show-us-your-agent-skills
skill
explain
EP 01
Agent narrates what it just did, like a teammate handing off.
Jeremiah Lowin · Prefect / FastMCP ↗ youtube · 00:46:14
skill
Replies to GitHub contributors in your voice. No "great work, but rejected" sandwiches.
Jeremiah Lowin · Prefect / FastMCP ↗ youtube · 00:54:08
skill
ship-it
EP 01
Re-trains "ship it" to mean open a PR, not merge.
Jeremiah Lowin · Prefect / FastMCP ↗ youtube · 00:54:52
skill
One-line idea → Tufte-style chart with an LLM-as-judge verifier loop.
Randy Olson · Goodeye Labs · r/dataisbeautiful ↗ youtube · 01:12:37
skill
Turns guest headshots into 8-bit pixel-art video clips for livestream intros and cutaways.
Show Us Your Agent Skills ↗ ep 01 on youtube
skill
Interview intent, ask for three variations at different magnitudes, score against a rubric.
Hilary Mason · Hidden Door ↗ youtube · 01:01:00
skill
Agent drives a reactive Marimo notebook through a bash bridge into the Python kernel.
Eric Ma · Moderna ↗ youtube · 00:11:57
workflow
Human-in-the-loop EDA. Every claim backed by an artifact.
Eric Ma · Moderna ↗ youtube · 00:23:27
workflow
Build an agent-facing chart library by generalising eval failures into features; the package can never regress on an eval it once passed.
Bryan Bischof · Theory Ventures ↗ youtube · 01:25:11
workflow
Three agent personas pull from a bad-ideas backlog, pitch and critique each other, and write design docs for moonshots no roadmap would schedule.
Hilary Mason · Hidden Door ↗ youtube · 01:14:20

YOUR HOSTS

two builders · two hosts · on a mission to find out what people at the top of the game are actually doing
Hugo Bowne-Anderson
HUGO
BOWNE-ANDERSON
host · vanishing gradients
AI builder, consultant, educator of 6+ million students; ex-Yale, ex-Max Planck.
Thomas Wiecki
THOMAS
WIECKI
host · pymc labs
Co-creator of PyMC. Founder of PyMC Labs. Has built Bayesian models for hedge funds, Fortune 500s, and indie tinkerers for over a decade.
DON'T MISS THE NEXT
EPISODE.

Episode announcements, the workflows we cut for time, and what long-time builders are actually doing with agents. Free, no spam.

▶ SUBSCRIBE ON SUBSTACK ↗

Doing something weird with agents? Nominate yourself or someone else ↗