Learning Library

← Back to Library

AI‑Driven Interactive Decision Instruments

16m • Unknown Channel • ai-ml • deep-dive • advanced • Watch on YouTube ↗

Key Points

The workplace is transitioning to a new “operating surface” where AI tools like ChatGPT‑5, Claude, and Gemini turn traditional documents, spreadsheets, and slides into interactive, decision‑making artifacts.
The biggest bottleneck in modern companies is not generating ideas but proving and executing decisions, which AI‑enhanced interactive artifacts can streamline by making decisions auditable, executable, and rapid.
These AI‑driven artifacts act as “front‑end instruments” that combine simple inputs, UI elements, tests, and approvals, replacing multiple meetings and decks with a single, tweakable surface.
The shift is enabled by easy distribution of single‑file canvases, low cost of AI services already bundled in employee plans, and built‑in governance features that capture and lock decision logic.
While such instruments won’t replace large‑scale, highly refined processes like Amazon’s weekly business reviews, they are powerful for most teams, allowing rapid creation, reuse, and remixing of decision‑making tools.

Sections

Full Transcript

# AI‑Driven Interactive Decision Instruments **Source:** [https://www.youtube.com/watch?v=SLYKFHtKR90](https://www.youtube.com/watch?v=SLYKFHtKR90) **Duration:** 00:16:20 ## Summary - The workplace is transitioning to a new “operating surface” where AI tools like ChatGPT‑5, Claude, and Gemini turn traditional documents, spreadsheets, and slides into interactive, decision‑making artifacts. - The biggest bottleneck in modern companies is not generating ideas but proving and executing decisions, which AI‑enhanced interactive artifacts can streamline by making decisions auditable, executable, and rapid. - These AI‑driven artifacts act as “front‑end instruments” that combine simple inputs, UI elements, tests, and approvals, replacing multiple meetings and decks with a single, tweakable surface. - The shift is enabled by easy distribution of single‑file canvases, low cost of AI services already bundled in employee plans, and built‑in governance features that capture and lock decision logic. - While such instruments won’t replace large‑scale, highly refined processes like Amazon’s weekly business reviews, they are powerful for most teams, allowing rapid creation, reuse, and remixing of decision‑making tools. ## Sections - [00:00:00](https://www.youtube.com/watch?v=SLYKFHtKR90&t=0s) **From Docs to Interactive AI Artifacts** - The speaker argues that modern workplaces are shifting from static documents and spreadsheets to AI‑driven, editable, and executable front‑end artifacts—enabled by models like ChatGPT‑5, Claude, and Gemini—that turn decisions into fast, auditable instruments rather than mere paperwork. - [00:03:18](https://www.youtube.com/watch?v=SLYKFHtKR90&t=198s) **Transforming Static Deliverables into Instruments** - The speaker urges replacing slow, document‑centric processes with programmable, auditable “instruments” that bundle explicit schemas, logic, UI controls, testing, and change tracking to accelerate decision‑making and boost team trust. - [00:06:23](https://www.youtube.com/watch?v=SLYKFHtKR90&t=383s) **From Static Docs to Live Artifacts** - The speaker explains that while implementing an access‑review runner is technically straightforward, the real difficulty lies in cultural resistance and discipline needed to adopt interactive, continuously updated artifacts instead of traditional static documents. - [00:09:48](https://www.youtube.com/watch?v=SLYKFHtKR90&t=588s) **Shift Incentives to Decision Instruments** - The speaker proposes rewarding employees for creating and using dynamic, gate‑driven “instrument” blocks that speed decision‑making—altering performance reviews and promotion criteria—while pointing out that tool makers like Notion are already rebranding from static document editors to execution surfaces. - [00:13:52](https://www.youtube.com/watch?v=SLYKFHtKR90&t=832s) **Shift from Authoring to Runtime Value** - The speaker highlights a paradigm shift where the worth of AI-driven artifacts is realized during execution rather than creation, urging teams to experiment with and measure the adoption of AI “instruments” like GPT‑5 canvas in everyday workflows. ## Full Transcript

0:00We are moving to a new operating surface 0:02at work. It is not just a function of 0:05chat GPT5. I know I've talked a lot 0:08about chat GPT5 the last few days. 0:11That's because 700 million 800 million 0:13people have it and we all use it. Now I 0:15want to go beyond that. I want to talk 0:16about the idea that we have a new 0:18operating surface at work that yes is 0:20exemplified by chat GPT5 but is also 0:24going to be exemplified by Claude, by 0:26Gemini and others over the next few 0:28months. The stakes are really high. The 0:31real drag in modern companies is not 0:33creativity where AI has been attacking 0:36over the last two years. It is so easy 0:38now to get a hundred ideas, a thousand 0:40ideas. The real bottleneck is the cost 0:44of proving a decision. We do that in 0:47docs. We do that in spreadsheets. We do 0:50that in slides. And you know what's 0:52happening over and over now? Chats 0:53become docs. Chats become spreadsheets. 0:56Chats become slides. And the challenge 0:58is that we are still bolting on our old 1:02decisionmaking to this new way of 1:05working. I want to suggest to you that 1:07with chat GPT5, we crossed a Rubicon. We 1:11now have for the first time an 1:13incredibly easy way for anyone who is 1:15not a coder to create interactive 1:18artifacts that collapse that chain. 1:21Interactive artifacts that make 1:23decisions executable, auditable, and 1:26fast. My thesis is very simple. The unit 1:29of work is shifting from static 1:32deliverables to instruments of work. 1:34Front-end artifacts that you can open 1:36and tweak and run. An instrument will 1:38couple some small typed inputs, a little 1:41UI, maybe some tests or an audit in very 1:45clean ways to look at the results. A 1:47good instrument will replace several 1:49meetings and a deck with one surface and 1:52a very quick decision. Why is this 1:54happening now? What makes this possible 1:56today? Number one, distribution. It is 1:59easy now to have single file canvases 2:02that can travel as easily as slides 2:04across internal tools. You can share 2:06that chat GPT5 canvas so easily. Claude 2:09does this too. You can share the cloud 2:10canvas really easily. There's no 2:12infrastructure to run that. Cost is also 2:15so cheap. If you have all of your 2:17employees on a chat plan anyway, and the 2:19canvas comes with it, it's essentially 2:21free. Governance is also easy because 2:24tests and approvals live in the 2:26instrument. You can actually capture 2:28what you did and then if you really want 2:30to lock it, just stick a screenshot of 2:32that UI and stick it somewhere. It's 2:35easy to log your decisions. The nice 2:37thing is you can also compound and 2:38remix. So these artifacts are not dead, 2:40they're living. You can remix a weekly 2:42business review artifact and make it 2:44better next time. You can reuse it. Now, 2:46I will add a caveat here. As someone who 2:49worked on weekly business reviews at 2:51Amazon, I am not trying to pretend that 2:54one artifact produced by one person in 2:57chat GPT replaces a weekly business 3:00review instrument that has been honed by 3:02business analysts over years for a team 3:06at Amazon scale. What I am saying is 3:08that most teams don't operate at that 3:10scale and most decisionmaking doesn't 3:13require that formula review process. 3:15There's a whole class of practical work 3:18done decisions that are right now being 3:20made very slowly with documents with 3:22artifacts. They don't need to be and 3:25that is the new class of work that I'm 3:28talking about when I talk about this 3:29motion from static deliverables to 3:32instruments. So what is an instrument? 3:33It has an input, a very explicit schema 3:36and sample fixtures. It has logic 3:38functions that you can read and test. 3:40The code is visible. Edge cases are 3:42declared. It has a UI, a display first 3:45scoreboard that has a few knobs you can 3:48touch and dial. It has tests, so gates 3:51are at the top and if it doesn't work, 3:53it doesn't run. It has an audit encoded 3:55in it and ideally it will have an 3:57export. The key is making sure that you 4:00take those instrument components 4:02seriously. For example, if you have 4:04inputs, logic, and UI, but you have no 4:06audit trail, you can't really see what 4:07changed when people started to mess with 4:09your dashboard. And in the course of a 4:11meeting there can be a lot of changes 4:12and adjustments. So I want to suggest to 4:14you that the strategic impact of getting 4:17this this work done is really 4:20understated because we haven't really 4:22lived it yet. The real challenge here is 4:25actually moving from a world where we 4:27have high latency and high friction and 4:30low trust for a lot of work even among 4:33good functioning teams because people 4:35can't remember the slack and they've 4:37been trained over and over again not to 4:38just trust the meeting but to get stuff 4:40into a doc. get to a world where you 4:43have more leverage, where trust 4:45increases because you can actually see 4:47it in the interactive artifact, where 4:50you can just generate free evidence by 4:52just running the artifact again with new 4:54data points. You want to get to a point 4:57where you have a portfolio of artifacts 4:59that replaces your portfolio of 5:01PowerPoint decks, something that lets 5:03you run the business with artifacts. 5:06Now, this is not going to work for 5:07everybody. Again, I've said it before. 5:10This doesn't replace BAS at scale. This 5:12doesn't replace excellent SAS tools for 5:14scaled up companies, but it does allow 5:17you get a lot of practical work done. 5:19I've created a dozen instruments to get 5:21you started. And they're designed to 5:23work together as a coherent operating 5:25system for smallcale teams and something 5:27that gives large- scale teams ideas 5:29about how to run fast for those in 5:31between the cracks where the real work 5:33gets done stuff. Let me lay them out for 5:36you briefly and then you'll get like 5:38full prompts for them in the substack. 5:40Run the business. You get a WBR 5:42scorecard and you get a data quality 5:44sentinel. Something that like helps you 5:46obsess over your data quality and check 5:48your data quality where it matters. For 5:50shipping decisions, I've got an 5:51experiment decision pad and I've got a 5:53launch gate for you. For reliability, 5:55I've got an incident commander dash and 5:57I've got so radar. Again, these are very 6:00configurable, right? You can adjust the 6:02prompts the way you want. You don't like 6:03my WBR? Don't use my WBR. Use the prompt 6:06and make it yours. That's the whole 6:07point. Revenue and risk. You have deals. 6:10You have contract risk triage. These are 6:12things you can actually put into on the 6:14sales side. Customers, you have customer 6:16health triage. You have a pricing and 6:18mix simulator for your people side. You 6:20can get a hiring and funnel health one. 6:22You can get an access review runner. And 6:23this is just the beginning. These are 6:25designed to form an ops operating system 6:28from the get-go and a loop you can put 6:30in anywhere. But this is just the 6:32beginning of getting your head around 6:34the idea that you have instruments now 6:35and not static artifacts. I want to 6:38suggest to you that there are real 6:41challenges with rolling this out that 6:43are not technical. In fact, the 6:45technical piece is mostly done. It is 6:46relatively easy to get these to start to 6:49work. Instead, the risks show up in 6:51culture terms. People overtrust these, 6:54right? Like sometimes they will have 6:55shallow data in here and they will not 6:58show their thresholds and they will not 7:00pay too much attention to the artifact 7:02because they're not used to it and they 7:04will allow a lot of sprawl. They'll 7:06remix the artifacts in 50 different 7:08versions. This requires discipline just 7:10like having a document standard requires 7:12discipline. It's not like it's a free 7:14ticket and you don't have to 7:16administrate the artifact. The power 7:18lies in the fact that it collapses a 7:20bunch of other work into one clean 7:22interactive artifact that makes 7:23decisionmaking faster. So the work 7:25really changes as a result and you have 7:27to be ready to impose that on the 7:29culture as a leader. So meetings can run 7:32inside the artifact. You can record 7:34them, use granola, use otter, use 7:36whatever your recording uh AI is of 7:38choice and then you have the record of 7:40the meeting. You have the artifact 7:42itself and you're done. That's the whole 7:44thing. You need to get to a point where 7:45you are encouraging people to test 7:47artifacts, version them, and make sure 7:50that people are using artifacts 7:53appropriately in appropriate context and 7:55not just overusing them in ways that 7:57aren't helpful. You don't want 16 7:59different versions of the same meetings 8:00artifact running around because then 8:02people will not trust it. And so there's 8:04a certain level of experimentation you 8:06want to encourage when you're trying to 8:08get an artifact to gel. And then you 8:10need to converge and actually pick the 8:12winner and anoint that and standardize 8:13it and discourage further 8:15experimentation while you focus on other 8:17parts of the workflow you want to align 8:18out. Part of how you do that is by 8:20challenging the operators, the runners 8:23of the business to own the outcomes 8:25associated with these artifacts. If you 8:26have someone in sales who is running the 8:29meeting that the artifact is associated 8:32with, they own the artifact. If you have 8:34someone in legal, they own the artifact 8:36for the legal review. You get the idea. 8:38The key here is that you have artifacts 8:41that tie to consistent organizational 8:44patterns, to product launches, to 8:45incidents, to pricing, to access to 8:47hiring, to weekly business reviews. And 8:50you keep them as consistent and 8:51versioned as possible. And yes, for 8:53bigger teams, for bigger orgs, of 8:55course, you're going to have like, you 8:56know, the weekly business review version 8:59for this team versus that team because 9:01they're like 50 person teams and they 9:03have different business units and they 9:04have different metrics. I get that. 9:06That's why prompts are easy to mix 9:08together. The point though is that you 9:10want to manage that cadence of 9:11evolution. You want to manage who owns 9:13it. You want to assign ownership just 9:15like you do with other good culture 9:16changes. And you want to map the 9:18instruments to the meeting cadence in a 9:20way that gives everyone predictability. 9:22That's what builds trust. That's what 9:24builds trust. I would suggest if you 9:26want to move this way that you stand up 9:29an instrument studio, a place to 9:31maintain schemas, tests, export 9:34standards, what counts as good that you 9:36assign a bar raiser to review the 9:40prompts that are used for any new 9:41version so that people maintain those 9:44standards and you get better over time. 9:46And I want you to challenge people and 9:48change the incentive to reward 9:50decisioning that ships through gates, 9:52not through decks or through docks. You 9:55want people to start to think in terms 9:57of how they can accelerate decisioning 9:59and how they can leverage instruments 10:01instead of flat docks to do so. So 10:04change your incentives. Maybe change how 10:06you do performance reviews. This could 10:07get as far as looking at promotion 10:09readiness is looking at whether someone 10:10can articulate and define a new artifact 10:15in a way that's useful for their team. 10:17Just giving you an example there. I want 10:19to close with one other piece. We've 10:22talked a lot about how this changes 10:23things for business runners, right? 10:25People who run the business, operators. 10:27What happens for tool builders, people 10:29who build Word, Docs, Notion, Sheets, 10:32people who are essentially building the 10:33static docs of the past. I want to 10:35suggest to you that Notion's already 10:37aware of this and others are too because 10:39they know the product is no longer a 10:41document editor. It's an execution 10:43surface. That's why Notion's homepage is 10:45what do you want to make today? There's 10:47going to be a ton of competition for 10:49this space. And as a builder of a 10:51business, you are going to be spoiled 10:53for riches in how you actually convert 10:56your team over to instruments. If you're 10:59in the tool building business and you're 11:01used to static docs, one of the really 11:03interesting ways you can involve here is 11:05choose to ship primitives for 11:07instruments. Shipped inputs as blocks, 11:09logic blocks, tests and gates, stable 11:12exports, things that people need to 11:15compose instruments for various 11:17workflows. I think that's a really 11:19interesting opportunity I haven't seen 11:20anybody fully grasp yet. You want to be 11:22in a place where you can have somewhat 11:25opinionated building blocks if you're in 11:27the document creation space because if 11:30you just have free text, people will 11:32abuse that. Whereas if you have inputs 11:34with regular schemas and people can 11:36choose those, you're going to get much 11:37more useful downstream blocks. Give 11:40people helpful limitations to help them 11:43build composable instruments. And so 11:45part of what I'm doing when I suggest 11:47the prompts down in the substack is I'm 11:49trying to hold to and key to an 11:52opinionated schema, a schema you can 11:54stick with over time because you need 11:57that consistency to build useful 11:59artifacts. One of the interesting 12:01implications of all of this is that we 12:03are moving to a world where policy is 12:05code. So a business rule is literally 12:08encoded in Typescript somewhere or a 12:10business rule is encoded in an artifact 12:12somewhere. approvals happen on the 12:15surface. We don't really have this yet, 12:18but we want to get to a world where we 12:20have lightweight e signatures and not 12:22just a buttonclick go no-go. Right now, 12:25it's just going to be a buttonclick go 12:26no-go that's encoded in the artifact and 12:28you screenshot it. That's fine for 12:30getting started, getting into the 12:32instrument world. I want to see a world 12:34where we actually have artifacts that 12:35start to evolve into those mini 12:38applications. And I expect to see that 12:40over the next 6 months to a year. One of 12:42the keys is making sure that these are 12:44everywhere. You can link them if it's 12:47Claude Claude and and the artifacts or 12:49if it's chat GPT and the canvas, you can 12:52link them anywhere. They're public 12:53facing links. So, link them in the 12:55invites. Link them in the chat. Link 12:57them in the issue. Make sure that these 13:00are interoperable and everyone sees 13:02them. And make sure, and I'm just going 13:04to advise this because I've seen this 13:05happen. Make sure that you do have those 13:07screenshots because people can go in, 13:09click on them, and change things 13:11afterward. right now and remix the 13:12artifacts, which is great if you're chat 13:14GPT and you're trying to encourage 13:16innovation, but it's perfect hell if you 13:18want to make sure you have a steady 13:19state. And so until these artifacts 13:21evolve with a little bit more miniapp 13:24internal checkpoints, you be the one 13:26that takes those screenshots and encodes 13:28this is what we talked about, this is 13:29what we decided. So you can always go 13:31back to that state relatively easily. In 13:34fact, at this point, honestly, you 13:35cannot just encode the screenshot. You 13:37can frankly grab a code snippet and it 13:39becomes an even more immutable record of 13:41what happened. I want to suggest to you 13:43that there are some business model 13:44shifts here. First, we are moving to a 13:46world where AI needs to be visible and 13:48governed from AI being in the shadows. 13:50That means models need to have authors 13:52for these artifacts. They need to have 13:54run summaries and audits. You need to be 13:56able to generate tests from the code 13:58that show what you did. We are at the 14:00beginning of this. you can use some 14:01fancy prompt work and you can get 14:03something like this going in GPT5 with 14:05canvas. There's going to be more. I also 14:07want to suggest to you that value is 14:09starting to acrue at runtime, not author 14:12time. That's a very profound shift. So 14:13think about it for a second. It's not 14:15the authoring of the artifact that 14:16matters the way authoring the PRD 14:18mattered when I came up as a product 14:20person. It is it is the way value occurs 14:23at the time you run the artifact and 14:25have the conversation. And so the value 14:27is in the active instrument itself. And 14:29there's sort of a profound implication 14:31there if you're building in the product 14:32space. One of the things that I want to 14:34suggest is that you lean into adoption 14:37of these instruments if you're doubtful 14:38but want to try and be willing to make 14:42the first step trivial and imperfect. 14:45Hey, let's replace the deck this week. 14:47It might not be perfect, but let's see 14:49how it goes and have the conversation. 14:51That's a two-way door. We can do that. 14:53See if you can start to measure the 14:54share of meetings that run on an 14:56instrument versus the share of meetings 14:58that run on something flat. See if you 15:00can start to optimize to support 15:03instruments being used more and more as 15:05you have internal AI teams that want to 15:07support you. This is so much more 15:09interesting work than just building the 15:12chatbot to talk with the HR policy 15:14manual, which for whatever reason seems 15:16to be the default thing that people who 15:19greenlight AI teams internally always do 15:22first. No, do something interesting like 15:25this that actually accelerates the 15:26business. Think about AI as a outcome 15:30driver directly, not just a chatbot. 15:33Move the center of gravity from defining 15:35a narrative into execution. That's what 15:38these instruments do. Instruments don't 15:40actually kill documents. They just they 15:43demote them, right? Docs will capture 15:45the narrative, the context, the story. 15:46You're going to turn the meeting minutes 15:48automatically into a document and a 15:50story. Instruments are what give you the 15:53decision and capture the record and then 15:56you can move on. Instruments are what 15:57let you go faster. And we've never 15:59really had that before. So my challenge 16:01to you is figure out how you can ship 16:03higher quality decisions and prove that 16:06they're better quality. And that's 16:08exactly what I focused on with these 12 16:10prompts to build these 12 instruments. 16:12Better quality decisioning, fewer slide 16:14decks. Welcome to a world where we have 16:16a new way of working. We're all going to 16:18learn about it