The "confident idiot" problem (News)

← Back to Podcasts

Podcast Episode

The "confident idiot" problem (News)

Show: The Changelog: Software Development, Open Source

ai-ml intermediate • 7 minutes • Episode page ↗

Published: 2025-12-08 • Added: 2026-01-04 • Transcript: rss

Key Insights

Jerod reminds listeners that the final call for “State of the Log” voicemail submissions is open now, giving producers a week to send in recordings before BMC works on the remixes.
He highlights the “confident idiot” problem in AI: using one LLM to grade or validate another (e.g., GPT‑4o grading GPT‑3.5) creates a circular dependency that can amplify sycophancy and hallucinations rather than reduce them.
The discussion argues that fixing probabilistic AI outputs with more probability—“vibe checks” or layered model judgments—is a losing strategy because the same underlying flaws propagate through the chain.
As a counter‑measure, the Steer SDK (an open‑source Python library) is introduced to treat LLMs as software components governed by explicit, hard‑coded rules instead of treating them as magical black boxes.

Episode Sections

00:00:00 The Confident Idiot Problem - Jerod warns that using LLMs to audit each other creates a circular, sycophantic loop and argues AI systems need strict, hard‑coded rules rather than vague “vibe checks.”
00:03:01 Claude Code Stumbles on SpaceJam - A discussion highlights how Claude Code (Opus 4.1) repeatedly fails to accurately recreate the 1996 Space Jam website, exposing limitations and trust issues in AI‑driven frontend development.
00:06:15 Reviving Lost Services, Gaming Linux Rise - The hosts lament Google's discontinued tools and celebrate Bazzite, a Fedora‑based distro that’s poised to accelerate mainstream Linux gaming.

Transcript

# The "confident idiot" problem (News) **Show:** [The Changelog: Software Development, Open Source](https://changelog.com/podcast) **Published:** 2025-12-08 **Duration:** 7 minutes **Audio:** [Listen](https://op3.dev/e/https://cdn.changelog.com/uploads/news/173/changelog-news-173.mp3) ## Summary - Jerod reminds listeners that the final call for “State of the Log” voicemail submissions is open now, giving producers a week to send in recordings before BMC works on the remixes. - He highlights the “confident idiot” problem in AI: using one LLM to grade or validate another (e.g., GPT‑4o grading GPT‑3.5) creates a circular dependency that can amplify sycophancy and hallucinations rather than reduce them. - The discussion argues that fixing probabilistic AI outputs with more probability—“vibe checks” or layered model judgments—is a losing strategy because the same underlying flaws propagate through the chain. - As a counter‑measure, the Steer SDK (an open‑source Python library) is introduced to treat LLMs as software components governed by explicit, hard‑coded rules instead of treating them as magical black boxes. ## Sections - **[0:00]** The Confident Idiot Problem - Jerod warns that using LLMs to audit each other creates a circular, sycophantic loop and argues AI systems need strict, hard‑coded rules rather than vague “vibe checks.” - **[3:01]** Claude Code Stumbles on SpaceJam - A discussion highlights how Claude Code (Opus 4.1) repeatedly fails to accurately recreate the 1996 Space Jam website, exposing limitations and trust issues in AI‑driven frontend development. - **[6:15]** Reviving Lost Services, Gaming Linux Rise - The hosts lament Google's discontinued tools and celebrate Bazzite, a Fedora‑based distro that’s poised to accelerate mainstream Linux gaming. ## Full Transcript

0:00 Transcript for Changelog News #173 Jerod Santo:

What up, nerds? 0:11 I'm Jerod and this is Changelog News for the week of Monday, December 8th, 2025. 0:17 We're quickly approaching last call for [state of the "log"](https://changelog.com/topic/sotl) voicemails! 0:21 We record in a week and have to give BMC time to make the remixes, so if you're thinking about [sending one](https://changelog.fm/sotl) in (you should), now's the best time! 0:33 Submit yours today at changelog.fm/sotl Ok, let's get into this week's news.

Break:

Jerod Santo:

[The "confident idiot" problem](https://steerlabs.substack.com/p/confident-idiot-problem) _Or, "Why AI needs hard rules, not vibe checks"_ If you've been following the *how-do-we-actually-use-ai-in-production* conversation stream, you've probably heard people propose a strategy where one LLM checks another LLM's results. 0:53 But will that work? 0:55 > We are told to ask GPT-4o to grade GPT-3.5. 0:59 We are told to fix the “vibes.” > > But this creates a dangerous circular dependency. 1:05 If the underlying models suffer from sycophancy (agreeing with the user) or hallucination, a Judge model often hallucinates a passing grade. 1:14 > > We are trying to fix probability with more probability. 1:18 That is a losing game. 1:20 One possible way of dealing with these "confident idiots" we've introduced into our software stacks the last few years is to "stop treating agents like magic boxes and start treating them like software." Hence, the [Steer SDK](https://github.com/imtt-dev/steer) was created. 1:36 > Steer is an open-source Python library that intercepts agent failures (hallucinations, bad JSON, PII leaks) and allows you to inject fixes via a local dashboard without changing your code. 1:48 Another way of dealing with these "confident idiots" in our software stacks... 1:52 remove them. 1:53 If that's possible...

Break:

Jerod Santo:

[Bun is joining Anthropic](https://bun.com/blog/bun-joins-anthropic) The company behind Bun, which is the open source runtime for Claude Code, is joining Anthropic. 2:04 We discussed the big acqui(sition|hire) on [last week's Friends](https://changelog.am/120), but at the time I hadn't quite considered this move and how contrary it is to Anthropic's party line that AI agents are replacing software engineers. 2:18 From Anthropic's announcement: > We’ve been a close partner of Bun for many months. 2:24 Our collaboration has been central to the rapid execution of the Claude Code team, and it directly drove the recent launch of Claude Code’s native installer. 2:34 We know the Bun team is building from the same vantage point that we do at Anthropic, with a focus on rethinking the developer experience and building innovative, useful products. 2:46 Bun is open source. 2:48 Why not just fork it and have a Claude Code powered engineer make all the necessary changes/upgrades to the runtime that Anthropic needs? 2:57 Because there's no getting there from here. 3:00 At least not yet. 3:01 Jarred Sumner and the Bun team's *expertise* is what's so valuable. 3:06 Even to Anthropic.

Break:

Jerod Santo:

[Claude can't recreate classic Space Jam site](https://j0nah.com/i-failed-to-recreate-the-1996-space-jam-website-with-claude) Jonah Glover tried to recreate everyone's favorite [1996 website](https://www.spacejam.com/1996/) by giving Claude Code (running Opus 4.1) a screenshot of the site and all the associated assets. 3:22 It failed (repeatedly) in all the ways I would expect from my own frontend / design attempts with the tool. 3:30 Jonah's finding, which is quite relatable: > Once Claude's version existed, every grid overlay, every comparison step, every "precise" adjustment was anchored to his layout, not the real one. 3:41 At the end of all this, I'm left with the irritating fact that, like many engineers, he's wrong and he thinks he's right. 3:50 > > What this teaches me is that Claude is actually kind of a liar, or at least Claude is confused. 3:59 However, for the drama, I'll assume Claude is a liar. 4:03 I've been giving Claude Code a lot of props lately, but I've also been giving it a lot of tasks it just can't quite accomplish. 4:13 This process starts off as fun and interesting, but each time it ends in failure I am perplexed by all the possible failure paths. 4:22 Was it me and my prompting? 4:25 Was it the agent? 4:26 Was it the model? 4:28 Or perhaps I'm asking for things that aren't easily accomplished with today's tech? 4:33 (I can be quite demanding.) This makes me yearn for the days when the only one to blame for my failures was me...

Break:

Jerod Santo:

It's now time for sponsored news! 4:46 [Depot's Advent of Code 2025](https://depot.dev/events/advent-of-code-2025) Depot is running a community leaderboard for Advent of Code 2025 and they're **putting real money behind it**. 4:56 The top five finishers each direct $1,000 to a registered charity of their choice. 5:01 If you pick a charity supporting STEM education or the developer ecosystem, Depot adds a 50% bonus. 5:08 They've already generated $7,500 in donations. 5:10 The format: 12 days of puzzles, unlocking daily at midnight EST starting December 1st. 5:16 Solve at your own pace. 5:18 There's no time limit. 5:20 Any language, any skill level. 5:22 Each day brings a two-part programming challenge from Eric Wastl's Advent of Code. 5:27 To join Depot's private leaderboard, request access on their events page. 5:31 They'll send you a code. 5:33 Whether you're competing for the top 5 or just want to sharpen your skills alongside other devs, it's a good excuse to write some code this month. 5:44 Check it out at [depot.dev/events/advent-of-code-2025](https://depot.dev/events/advent-of-code-2025) or just follow the link in the newsletter and your chapter data

Break:

Jerod Santo:

[Google *unkills* JPEG XL?](https://tonisagrista.com/blog/2025/google-unkills-jpegxl/) > In a dramatic turn of events, the Chromium team has reversed its "Obsolete" tag, and has decided to support the format in Blink (the engine behind Chrome/Chromium/Edge). 6:05 Given Chrome’s position in the browser market share, I predict the format will become a de factor standard for images in the near future. 6:15 We're used to things being [killed by Google](https://killedbygoogle.com)... 6:18 but *unkilled*?! 6:19 This is a trend I can get behind. 6:22 Unkill requests! 6:23 It's time to bring back [Zeitgeist](https://web.archive.org/web/20210312230138/https://www.lifewire.com/google-zeitgeist-3481903), [Dodgeball](https://en.wikipedia.org/wiki/Dodgeball_(service)), and [Google Reader](https://en.wikipedia.org/wiki/Google_Reader)...

Break:

Jerod Santo:

[The next generation of Linux gaming](https://bazzite.gg/) If the mythical "Year of the Linux Desktop" is ever to materialize, it will first be preceded by a sea change in gaming options for the venerable open source OS. 6:43 The gaming sea change appears to be in full swing, with Steam on Linux hitting an all-time high over 3% usage [last month](https://www.phoronix.com/news/Steam-Linux-November-2025). 6:52 Enter Bazzite, a Fedora-based Linux distro that's hyper-focused on making gaming awesome: > Bazzite is designed for Linux newcomers and enthusiasts alike with Steam pre-installed, HDR & VRR support, improved CPU schedulers for responsive gameplay, and numerous community-developed tools and tweaks to streamline your gaming and streaming experience. 7:11 The project began back in 2023, but it appears to be maturing and aiming at sustainability by setting up ways to donate with its [latest update](https://universal-blue.discourse.group/t/bazzite-fall-update-fedora-43-xbox-allies-legion-go-2-nvidia-gtx/10948): > As Bazzite matures, we begin to tackle more ambitious projects, such as proper secure boot, support for more handheld devices, and conference attendance, which means more costs for us. 7:34 And we would gladly appreciate the help in covering them!

Break:

Jerod Santo:

That's the news for now, but go and subscribe to the Changelog Newsletter for the full scoop of links worth clicking on. 7:48 Such as: - [Why I ignore the spotlight as a staff engineer](https://lalitm.com/software-engineering-outside-the-spotlight/) - [Vanilla CSS is all you need](https://www.zolkos.com/2025/12/03/vanilla-css-is-all-you-need) - [What happens when you take an XKCD joke too literally](https://stacktower.io) Get in on the newsletter at changelog.news Have a great week! 8:04 Like, subscribe, and leave us a 5-star review if you dig the show, and I'll talk to you again real soon.

--- *Topics: ai-ml* *Format: news* *Difficulty: intermediate*