thu · ep 05 · jun 18
SEASON01 SHIPPED04 UPCOMING01 VENUEYOUTUBE LIVE

SHOW US YOUR [ AGENT ] SKILLS

[ vanishing / gradients ] × PyMC LABS
EXCEL WORLD CHAMPIONSHIPS × EUROVISION

Long-time Python, ML, AI and data builders show how they're actually using agents today. Not vibe coding. Not demos. The real workflows: agent skills, harnesses, voice-memo memory, background reviewers, from people whose software you've been using for years.

SEASON
01
EPISODES
04
BUILDERS
17
SKILLS · WORKFLOWS
34 · 68
NEXT EPISODE JUN 18 · 2026 thursday · live on youtube

EP 05 · COPILOTS & CODING AGENTS

Three builders on coding agents, search, and context engineering
HOSTS HUGO BOWNE-ANDERSON · THOMAS WIECKI
REGISTER ▶
EP 04
· MAY 2026 · 03 GUESTS · FULL EPISODE

HOW TO EVALUATE AGENTIC WORKFLOWS

Skill scepticism, plan review, implementation review, agentic search, and hidden holdout tests.

EP 04 · HOW TO EVALUATE AGENTIC WORKFLOWS - skill scepticism, review loops, and hidden holdout tests
▣ HIGHLIGHT REEL
01
Read public skills like code: check provenance, maintenance, and constraints, then fork the idea instead of importing a shortcut. · Hamel Husain
00:22:32
02
Turn cheap experimentation into checkpoints: plan first, review red and yellow flags, implement, then review the code before trust. · Chris Fonnesbeck
01:05:53
03
Give the agent room to mutate a search ranker, but keep the final score hidden so improvements have to survive a real holdout. · Doug Turnbull
01:41:07
EP 03
· MAY 2026 · 06 GUESTS · RUNTIME 3H 18M

FROM SKILLS TO AGENT HARNESSES

Research memory, local boxes, debug panes, live notebooks, video generation, and code repair.

EP 03 · FROM SKILLS TO AGENT HARNESSES - research memory, debug surfaces, notebooks, video, and code repair
▣ HIGHLIGHT REEL
01
A narrow audit pass for the failure mode agents love: broad try blocks and exception handlers that make bad code look green. · Matthew Honnibal
00:12:09
02
Instead of hunting today's bugs, write incident reports for the failures a future reasonable edit could cause. · Matthew Honnibal
00:14:10
03
Deliberately break the code, one mutation at a time, to find the bugs your test suite would let through. · Matthew Honnibal
00:14:10
04
here-now skill
Collapse publishing into one instruction: the agent ships HTML pages and small sites to live URLs without a GitHub Pages detour. · Eleanor Berger
00:45:55
05
Drive Anki through its local API with confirmation checks, so an agent can maintain flashcards without silently mutating memory. · Eleanor Berger
00:49:46
06
Give coding agents a design language: fewer generic panels, more interfaces that look like someone meant it. · Eleanor Berger
00:50:02
07
Asked once from a phone; the agent invented browser login, transcript fetching, caching, and secret gist summaries. · Eleanor Berger
00:52:57
08
Use failed threads as harness training data: trace missteps to instructions, then delete or sharpen the rule that caused them. · Nicolay Gerold
01:59:04
09
Record a few minutes of audio; let the skill carry video taste, timing rules, frame checks, and avatar compositing. · Alan Nichol
02:46:00
10
research skill
Turn trusted sources into a durable research wiki, so future agents query accumulated context instead of starting over. · Paul Iusztin
02:19:52
11
Run a personal agent on a separate Mac mini, with Discord as the front door and autonomy earned inside a hard perimeter. · Eleanor Berger
00:47:50
EP 02
· MAY 2026 · 04 GUESTS · RUNTIME 2H 14M

BUILDING AGENTS THAT IMPROVE THE WORKFLOW

Prompt refinement, eval-driven charts, human-in-the-loop EDA, and local-first inference.

EP 02 · BUILDING AGENTS THAT IMPROVE THE WORKFLOW - prompt refinement, eval-driven charts, EDA, and local-first inference
▣ HIGHLIGHT REEL
01
Interview intent first, then generate risky variants and score them against a rubric written before the run. · Hilary Mason
01:01:00
02
Drop the agent inside a live Marimo kernel, so plots, widgets, markdown, and corrections happen in one reactive notebook. · Eric Ma
00:11:57
03
agentic-eda workflow
The human chooses the next scientific question; the agent renders evidence fast enough to keep exploration one plot at a time. · Eric Ma
00:23:27
04
Turn every failed chart eval into a library feature, so the package cannot regress on a case it already learned. · Bryan Bischof
01:25:11
05
Schedule three bad-idea personas to pitch, critique, and write moonshot docs no product roadmap would allow. · Hilary Mason
01:14:20
06
Default local: fast Qwen on a laptop, private workflows, offline flights, and cloud calls only for named exceptions. · Tomasz Tunguz
02:07:42
EP 01
· APR 2026 · 03 GUESTS · RUNTIME 1H 32M

THE AGENTIC SOFTWARE FACTORY

RoboRev, agent memory, personal commands, and LLM-as-judge chart checks.

EP 01 · THE AGENTIC SOFTWARE FACTORY - RoboRev, agent memory, personal commands, and LLM-as-judge checks
▣ HIGHLIGHT REEL
01
explain skill
When ten agents are running, each one explains the change like a colleague, not a diff bot. · Jeremiah Lowin
00:46:14
02
A tiny etiquette layer for OSS maintenance: reject clearly, sound human, and stop wrapping "no" in fake praise. · Jeremiah Lowin
00:54:08
03
ship-it skill
One phrase, one override: in Jeremiah's world "ship it" means open the PR, never merge it. · Jeremiah Lowin
00:54:52
04
Run search, chart variants, linting, and an LLM-as-judge Tufte test until the graphic actually carries the story. · Randy Olson
01:12:37
05
The show's own production skill: turn guest photos into pixel art, animate them, and feed the retro livestream system. · Show Us Your Agent Skills
ep 01
06
Scale agentic engineering with commits every turn, RoboRev reading every line, and a review queue agents must drain. · Wes McKinney
00:27:14
07
second-brain workflow
Feed daily voice memos into editable agent memory, turning personal context into a substrate future sessions can use. · Jeremiah Lowin
00:35:50

GUEST DOSSIERS

one page per builder: segment video, field notes, artifacts, and the workflow they showed

SELECTED SKILLS & WORKFLOWS

a selection from the companion repo · hugobowne/show-us-your-agent-skills ↗

A selection of skills and workflows from the streams, packaged into the companion repo.

$ npx skills add https://github.com/hugobowne/show-us-your-agent-skills
skill
explain
EP 01
Agent narrates what it just did, like a teammate handing off.
Jeremiah Lowin · Prefect / FastMCP ↗ youtube · 00:46:14
skill
Replies to GitHub contributors in your voice. No "great work, but rejected" sandwiches.
Jeremiah Lowin · Prefect / FastMCP ↗ youtube · 00:54:08
skill
ship-it
EP 01
Re-trains "ship it" to mean open a PR, not merge.
Jeremiah Lowin · Prefect / FastMCP ↗ youtube · 00:54:52
skill
One-line idea → Tufte-style chart with an LLM-as-judge verifier loop.
Randy Olson · Goodeye Labs · r/dataisbeautiful ↗ youtube · 01:12:37
skill
Turns guest headshots into 8-bit pixel-art video clips for livestream intros and cutaways.
Show Us Your Agent Skills ↗ ep 01 on youtube
skill
Interview intent, ask for three variations at different magnitudes, score against a rubric.
Hilary Mason · Hidden Door ↗ youtube · 01:01:00
skill
Agent drives a reactive Marimo notebook through a bash bridge into the Python kernel.
Eric Ma · Moderna ↗ youtube · 00:11:57
workflow
Human-in-the-loop EDA. Every claim backed by an artifact.
Eric Ma · Moderna ↗ youtube · 00:23:27
workflow
Build an agent-facing chart library by generalising eval failures into features; the package can never regress on an eval it once passed.
Bryan Bischof · Theory Ventures ↗ youtube · 01:25:11
workflow
Three agent personas pull from a bad-ideas backlog, pitch and critique each other, and write design docs for moonshots no roadmap would schedule.
Hilary Mason · Hidden Door ↗ youtube · 01:14:20

YOUR HOSTS

two builders · two hosts · on a mission to find out what people at the top of the game are actually doing
Hugo Bowne-Anderson
HUGO
BOWNE-ANDERSON
host · vanishing gradients
AI builder, consultant, educator of 6+ million students; ex-Yale, ex-Max Planck.
Thomas Wiecki
THOMAS
WIECKI
host · pymc labs
Co-creator of PyMC. Founder of PyMC Labs. Has built Bayesian models for hedge funds, Fortune 500s, and indie tinkerers for over a decade.
DON'T MISS THE NEXT
EPISODE.

Episode announcements, the workflows we cut for time, and what long-time builders are actually doing with agents. Free, no spam.

▶ SUBSCRIBE ON SUBSTACK ↗

Doing something weird with agents? Nominate yourself or someone else ↗