GenAI PM
concept3 mentions· Updated Feb 23, 2026

red/green TDD

A test-driven development pattern adapted for coding agents. It emphasizes an iterative failure/success loop that can make agentic coding more reliable.

Key Highlights

  • red/green TDD adapts classic test-driven development into a workflow that makes coding agents validate their output step by step.
  • The pattern asks agents to write tests first, confirm failure, implement code, and rerun tests until they pass.
  • For AI PMs, it offers a practical way to turn vague feature requests into measurable acceptance criteria.
  • Newsletter mentions tied the rise of red/green TDD to the broader inflection point in AI coding during late 2025 and early 2026.

red/green TDD

Overview

red/green TDD is a test-driven development pattern adapted for coding agents. In this workflow, an agent writes tests first, runs them to confirm they fail (the “red” phase), then implements code and reruns the tests until they pass (the “green” phase). In the AI coding context, this creates a disciplined feedback loop that helps agents stay grounded in explicit requirements instead of generating code that only appears correct.

For AI Product Managers, red/green TDD matters because it turns agentic coding into a more measurable and reliable process. Rather than asking an agent to “build the feature” in one step, teams can structure work around verifiable outcomes, reducing silent failures and making iteration easier to supervise. It has emerged as part of a broader set of agentic engineering patterns associated with modern coding agents and lightweight project scaffolding such as “thin templates.”

Key Developments

  • 2026-02-23: Simon Willison’s Agentic Engineering Patterns was highlighted as a hub for building and operating agentic systems, with red/green TDD called out as one of the coding-agent patterns it collects.
  • 2026-04-03: A newsletter mention explained the operational prompt behind red/green TDD: instruct agents to write tests first, run them to confirm failure, implement the code, and rerun tests to confirm success. This framed the pattern as a practical workflow for tools like Claude Code and GPT-5.4.
  • 2026-04-04: Lenny Rachitsky shared Simon Willison’s view that late 2025 marked an inflection point for AI coding, with autonomous coding agents, benchmark progress, and red/green TDD combined with “thin templates” helping make agentic software development more dependable.

Relevance to AI PMs

  • Specify work in acceptance-test form: PMs can translate feature requirements into concrete tests or failure conditions, giving coding agents a clearer target and reducing ambiguity in execution.
  • Improve reliability in agent workflows: Red/green TDD creates a built-in validation loop, which is useful when shipping AI-assisted code changes that need stronger guardrails than one-shot generation.
  • Measure agent performance more concretely: Teams can evaluate coding agents based on whether they produce failing tests first, close the failure gap, and reach a passing state consistently across tasks.

Related

  • Simon Willison: A key source associated with documenting agentic engineering patterns, including red/green TDD for coding agents.
  • Lenny Rachitsky: Amplified the idea by sharing Willison’s framing of the AI coding inflection point and the growing importance of this workflow.
  • Claude Code: One of the coding-agent tools mentioned in connection with using red/green TDD in practice.
  • GPT-5.4: Referenced as part of the new generation of coding agents capable of following red/green TDD workflows more reliably.
  • agentic-engineering-patterns: The broader pattern library where red/green TDD appears as a specific operational technique.
  • test-driven development / red green test-driven development / red-green TDD: Common aliases and adjacent formulations of the same concept, adapted here specifically for AI coding agents.

Newsletter Mentions (3)

2026-04-04
#8 𝕏 Lenny Rachitsky shares Simon Willison’s insight that November 2025 was the inflection point for AI coding, unleashing autonomous coding agents benchmarked by Pelican Benchmark and driving red/green TDD with “thin templates.

GenAI PM Daily April 04, 2026 GenAI PM Daily 🎧 Listen to this brief 3 min listen Today's top 17 insights for PM Builders, ranked by relevance from X, Blogs, and LinkedIn. Claude subscriptions will no longer cover usage on third-party tools like OpenClaw. #8 𝕏 Lenny Rachitsky shares Simon Willison’s insight that November 2025 was the inflection point for AI coding, unleashing autonomous coding agents benchmarked by Pelican Benchmark and driving red/green TDD with “thin templates.

2026-04-03
Invoking the prompt “red/green TDD” directs agents to write tests first, run them to confirm failure, implement the code, then rerun tests to confirm success.

▶️ Why AI came for coders first, automation timelines, and how we’re inside the AI inflection Lennys Podcast Simon Willison details agentic engineering patterns—using coding agents like Claude Code and GPT-5.4 for red/green TDD, thin project templates, and public GitHub hoarding—to boost software productivity and reliability. GPT-5.1 and Claude Opus 4.5 released in November 2025 advanced coding agents from “mostly working” to “almost always following instructions,” enabling engineers to churn out up to 10,000 lines of code per day. Invoking the prompt “red/green TDD” directs agents to write tests first, run them to confirm failure, implement the code, then rerun tests to confirm success. Willison’s GitHub repositories include simonw/tools with 193 HTML/JavaScript client-side utilities and simonw/ressearch with 75 AI-driven research projects to hoard reusable code experiments.

2026-02-23
#3 📝 Simon Willison Agentic Engineering Patterns - A guide collecting patterns for building and operating agentic systems. It serves as a hub for specific patterns such as red/green TDD for coding agents.

#3 📝 Simon Willison Agentic Engineering Patterns - A guide collecting patterns for building and operating agentic systems. It serves as a hub for specific patterns such as red/green TDD for coding agents. #5 📝 Simon Willison Research WebMCP + Chrome DevTools Protocol Demo - Demo of WebMCP, a proposed browser API for exposing structured, callable tools to AI agents, showing how to register and interact with WebMCP tools from a Python client over the Chrome DevTools Protocol.

Stay updated on red/green TDD

Get curated AI PM insights delivered daily — covering this and 1,000+ other sources.

Subscribe Free