tool10 mentions· Updated Jun 18, 2026

GPT 5.4

A GPT model variant used here for scientific reasoning and agentic chemistry experimentation. The newsletter frames it as a model capable of proposing experimental improvements and driving benchmarked workflows.

Key Highlights

GPT 5.4 is positioned as a long-context, tool-using model for coding, agents, and complex reasoning workflows.
Newsletter coverage ties GPT 5.4 to a 1M-token context window, auto-compaction, and stronger coding performance.
The model appears across products like Codex, ChatGPT, Amp Deep mode, and OpenClaw rather than only as a standalone chatbot.
A standout use case paired GPT-5.4 with Molecule.one’s Maria to improve medicinal chemistry reaction yields across 10,080 experiments.
For AI PMs, GPT 5.4 is most relevant as a workflow engine that should be evaluated on measurable task and product outcomes.

GPT 5.4

Overview

GPT 5.4 is a GPT model family from OpenAI that the newsletter repeatedly frames as a high-capability tool for coding, long-context reasoning, agentic workflows, and scientific experimentation. Across mentions, it appears not just as a chatbot model, but as an execution engine inside products and workflows such as Codex, Amp Deep mode, OpenClaw agents, ChatGPT, and chemistry experimentation systems. Reported capabilities include a 1M-token context window with auto-compaction, stronger tool use, faster coding performance, and variants like GPT-5.4 mini, nano, and Thinking.

For AI Product Managers, GPT 5.4 matters because it shows how frontier models are increasingly becoming workflow components rather than standalone interfaces. In the newsletter, it is used to generate product features from short specs, support red/green TDD engineering patterns, run role-based agents, and even propose chemistry experiment improvements that were validated in lab workflows. That makes GPT 5.4 especially relevant for PMs evaluating where long-context reasoning, tool-calling, coding automation, and agent orchestration can create measurable product or operational advantage.

Key Developments

2026-03-06: OpenAI introduced GPT-5.4, positioning it as a new model release with improved capabilities and broader applications over prior versions.
2026-03-07: Dharmesh Shah highlighted GPT 5.4’s launch details, including a 1M-token context window, auto-compaction, smarter tool-calling, and up to 1.5× faster coding performance. The same coverage also referenced GPT-5.4 Thinking and one-shot app-building examples via Codex.
2026-03-08: Dharmesh Shah described GPT 5.4 as strong for both product management reasoning and back-end architecture, emphasizing long-range execution and precision.
2026-03-09: GPT-5.4 appeared in roundup coverage as a newly shipped OpenAI model, reinforcing its status as a major platform release.
2026-03-18: OpenAI announced GPT-5.4 mini and GPT-5.4 nano, smaller variants designed for lower-latency and more efficient deployment while preserving advanced capabilities. Coverage noted use in ChatGPT, Codex, and the API, optimized for coding, computer use, multimodal understanding, and subagents.
2026-03-27: Amp added GPT-5.4 to its new Deep agent mode, tuning the model toward more Codex-like, long-form, code-focused reasoning and deeper planning.
2026-03-30: Claire Vo used GPT-5.4 alongside Opus-4.6 and Sonnet-4.6 within OpenClaw, configuring role-based agents connected to Telegram bots for business outreach and family operations automation.
2026-04-03: Simon Willison cited GPT-5.4 as part of modern agentic engineering workflows, especially for red/green TDD, reusable templates, and rapid software iteration.
2026-04-06: OpenAI’s Codex team demonstrated using GPT 5.4, Codex Spark, and the Codex app’s plan mode to generate and iterate product features from short specs, backed by an open-source Rust harness supporting parallel agent tasks.
2026-06-18: In a major scientific workflow example, GPT-5.4 paired with Molecule.one’s Maria proposed using TEMPO as an additive for Chan–Lam coupling optimization and generated experimental grids executed across 10,080 reactions. Reported outcomes included mean yield improvement from 16.6% to 25.2%, with broad substrate-level gains and human bench confirmation for most tested pairs.

Relevance to AI PMs

1. Evaluate where long-context + tool use unlock real workflows. GPT 5.4 is repeatedly associated with 1M-token context, auto-compaction, and strong tool-calling. PMs can use this as a benchmark for products involving large documentation sets, enterprise knowledge bases, codebases, or multi-step research tasks.

2. Design agentic product experiences, not just chat features. The newsletter shows GPT 5.4 embedded in Codex, Amp, OpenClaw, and scientific systems. For PMs, the tactical takeaway is to scope products around planning, tool execution, subagents, and iteration loops rather than a single conversational UI.

3. Measure success through workflow outcomes. GPT 5.4 is described in terms of throughput, coding speed, feature generation, and experimental yield improvement. AI PMs should define KPIs around task completion, quality lift, cycle time reduction, and benchmarked performance instead of relying only on subjective model preference.

OpenAI: Creator of GPT-5.4 and its mini/nano variants; central to the model family’s rollout and platform positioning.
ChatGPT / chatgpt: One of the surfaces where GPT-5.4 was reported as available, highlighting end-user access and product packaging.
Codex / openais-codex / codex-spark: Closely linked execution environments where GPT 5.4 was used for code generation, planning, and rapid iteration.
Amp: Integrated GPT-5.4 into Deep mode for more agentic, code-oriented reasoning.
OpenClaw: Used GPT-5.4 as one of several models powering role-based personal and business agents.
Claude Code, Claude Opus 4.5, Opus-4.6, Sonnet-4.6: Competing or complementary agentic coding models frequently mentioned alongside GPT-5.4 in engineering workflows.
Simon Willison: Referenced GPT-5.4 in practical agentic engineering patterns such as red/green TDD and reusable project scaffolds.
Dharmesh Shah: Early commentator emphasizing GPT 5.4’s PM, coding, and long-context strengths.
Molecule.one / Maria: Demonstrated GPT-5.4 in near-autonomous chemistry experimentation and benchmarked lab optimization.
LifeSciBench: Relevant as a scientific benchmarking context for evaluating model performance in life sciences-style tasks.
GPT-5.1 and GPT-5.3: Useful comparison points in the newsletter’s narrative around the evolution of coding and reasoning models.

Newsletter Mentions (10)

2026-06-18

“A near-autonomous AI chemist improves a challenging reaction in medicinal chemistry - GPT‑5.4 paired with Molecule.one’s Maria proposed using TEMPO as an additive to improve Chan–Lam coupling of primary sulfonamides and generated experimental grids that were run (10,080 reactions) in Maria Lab.”

#1 📝 OpenAI News A near-autonomous AI chemist improves a challenging reaction in medicinal chemistry - GPT‑5.4 paired with Molecule.one’s Maria proposed using TEMPO as an additive to improve Chan–Lam coupling of primary sulfonamides and generated experimental grids that were run (10,080 reactions) in Maria Lab. Across two cycles the mean yield rose from 16.6% to 25.2%, yields improved for 88% of boronic acids and 83% of sulfonamides tested, the share of reactions >30% yield increased from 15.6% to 37.5%, and human bench repeats confirmed higher yields for 11 of 14 substrate pairs (most showing >2× increases).

2026-04-06

“Alex and Romain demonstrate how the Codex team uses GPT 5.4, the Codex Spark model, and the Codex app’s plan mode—backed by an open-source Rust harness—to one-shot generate and iterate code features like a NASA Artemis iOS screen and a 2D game at up to 1,200 edits per second.”

#2 ▶️ How OpenAI's Codex Team Builds with Codex (43 Min) | Alex & Romain Peter Yang Alex and Romain demonstrate how the Codex team uses GPT 5.4, the Codex Spark model, and the Codex app’s plan mode—backed by an open-source Rust harness—to one-shot generate and iterate code features like a NASA Artemis iOS screen and a 2D game at up to 1,200 edits per second. The Codex team writes specs in under 10 bullet points when implementing new features, relying on Codex to handle most of the coding work. In “fast mode” with Codex Spark, live edits to a 2D game rendered at an average throughput of 1,200 code changes per second. The Codex app, VS Code extension, and CLI all communicate with the same open-source Rust-based harness, allowing multiple parallel agent tasks independent of a single workspace folder.

2026-04-03

“Simon Willison details agentic engineering patterns—using coding agents like Claude Code and GPT-5.4 for red/green TDD, thin project templates, and public GitHub hoarding—to boost software productivity and reliability.”

▶️ Why AI came for coders first, automation timelines, and how we’re inside the AI inflection Lennys Podcast Simon Willison details agentic engineering patterns—using coding agents like Claude Code and GPT-5.4 for red/green TDD, thin project templates, and public GitHub hoarding—to boost software productivity and reliability. GPT-5.1 and Claude Opus 4.5 released in November 2025 advanced coding agents from “mostly working” to “almost always following instructions,” enabling engineers to churn out up to 10,000 lines of code per day. Invoking the prompt “red/green TDD” directs agents to write tests first, run them to confirm failure, implement the code, then rerun tests to confirm success. Willison’s GitHub repositories include simonw/tools with 193 HTML/JavaScript client-side utilities and simonw/ressearch with 75 AI-driven research projects to hoard reusable code experiments.

2026-03-30

“Claire Vo installed OpenClaw via a one-line Homebrew script on separate macOS machines (three Mac minis and one MacBook Air), configured nine role-based agents (Polly, Finn, Sam, etc.) using Opus-4.6, Sonnet-4.6 and GPT-5.4 models, and linked them to Telegram bots for automating her business outreach and family scheduling.”

#1 ▶️ How OpenClaw’s AI agents run this founder’s business, family and life | Claire Vo Lennys Podcast Claire Vo installed OpenClaw via a one-line Homebrew script on separate macOS machines (three Mac minis and one MacBook Air), configured nine role-based agents (Polly, Finn, Sam, etc.) using Opus-4.6, Sonnet-4.6 and GPT-5.4 models, and linked them to Telegram bots for automating her business outreach and family scheduling. She ran “brew install openclaw” in iTerm, chose personal use, selected Opus-4.6, Sonnet-4.6 and GPT-5.4, then registered each agent as a Telegram bot via BotFather. Agent “Sam” performs a daily sweep of her CRM for product-led growth signups, enriches leads with Exa People Search, drafts and sends outreach emails via Telegram, replacing a human assistant who worked 10 hours/week. She enabled macOS Screen Sharing and Remote Login on her Mac minis to SSH into and view the agent GUIs from her laptop over Wi-Fi, removing the need for dedicated monitors, keyboards or mice.

2026-03-27

“Amp has placed GPT-5.4 into its new Deep agent mode, tuning the model to behave more like Codex for longer-form, more code-focused reasoning.”

#6 📝 Ampcode Chronicle GPT‐5.4 in Deep - Amp has placed GPT-5.4 into its new Deep agent mode, tuning the model to behave more like Codex for longer-form, more code-focused reasoning. The update emphasizes deeper planning and agentic behavior for coding tasks.

2026-03-18

“OpenAI introduces GPT-5.4 mini and nano - OpenAI announces GPT-5.4 mini and nano, smaller variants of the GPT-5.4 family designed for more efficient deployment while retaining advanced capabilities.”

#1 📝 OpenAI News Introducing GPT-5.4 mini and nano - OpenAI announces GPT-5.4 mini and nano, smaller variants of the GPT-5.4 family designed for more efficient deployment while retaining advanced capabilities. The release targets use cases needing lower latency and resource usage. Also covered by: @Simon Willison #2 𝕏 OpenAI released GPT-5.4 mini today in ChatGPT, Codex and the API—optimized for coding, computer use, multimodal understanding and subagents.

2026-03-09

“OpenAI shipped GPT-5.4, Anthropic released a free AI course library, and a browser-based spy-satellite simulator debuted.”

GenAI PM Daily March 09, 2026 GenAI PM Daily 🎧 Listen to this brief 3 min listen Today's top 9 insights for PM Builders, ranked by relevance from X, YouTube, and LinkedIn. OpenAI Ships GPT-5.4 Model #1 𝕏 There's An AI For That : OpenAI shipped GPT-5.4, Anthropic released a free AI course library, and a browser-based spy-satellite simulator debuted. A rogue AI agent went off-script, AI fakes flooded Iran war coverage, and Claude Cowork got a full walkthrough.

2026-03-08

“in Dharmesh Shah Dharmesh Shah finds GPT 5.4 excels as both PM (reasoning, long-range execution) and back-end architect (deep thinking, precise execution).”

in Dharmesh Shah Dharmesh Shah finds GPT 5.4 excels as both PM (reasoning, long-range execution) and back-end architect (deep thinking, precise execution). He sees Lovable as the go-to UX designer for polished prototypes and Opus 4.

2026-03-07

“OpenAI Releases GPT-5.4 with 1M Token Context #1 in Dharmesh Shah announces OpenAI’s GPT 5.4 launch, featuring a 1 million-token context window with auto-compaction, smarter tool-calling for on-demand skill loading, and up to 1.5× faster coding performance—enabling new HubSpot data-dictionary use cases.”

GenAI PM Daily March 07, 2026 GenAI PM Daily 🎧 Listen to this brief 3 min listen Today's top 25 insights for PM Builders, ranked by relevance from LinkedIn, YouTube, X, and Blogs. OpenAI Releases GPT-5.4 with 1M Token Context #1 in Dharmesh Shah announces OpenAI’s GPT 5.4 launch, featuring a 1 million-token context window with auto-compaction, smarter tool-calling for on-demand skill loading, and up to 1.5× faster coding performance—enabling new HubSpot data-dictionary use cases. Also covered by: @LlamaIndex 🦙 #2 ▶️ What the New ChatGPT 5.4 Means for the World AI Explained GPT-5.4 Thinking, released 48 hours after GPT-5.3 Instant, demonstrated one-shot creation of an animated league table for Stockport County FC using OpenAI’s Codex on Windows and Mac.

2026-03-06

“OpenAI Introduces GPT-5.4 Model #1 📝 OpenAI News Introducing GPT-5.4 - Announcement of GPT-5.4 as a new product release, highlighting improvements and new capabilities over prior models. The post introduces features and potential applications of GPT-5.4.”

GenAI PM Daily March 06, 2026 GenAI PM Daily 🎧 Listen to this brief 3 min listen Today's top 25 insights for PM Builders, ranked by relevance from Blogs, X, LinkedIn, and YouTube. OpenAI Introduces GPT-5.4 Model #1 📝 OpenAI News Introducing GPT-5.4 - Announcement of GPT-5.4 as a new product release, highlighting improvements and new capabilities over prior models. The post introduces features and potential applications of GPT-5.4. Also covered by: @There's An AI For That , @Kevin Weil 🇺🇸 #2 𝕏 claire vo 🖤 GPT-5.4 just went live in @chatprd with a 1M-token context window, more human-like dialogue than 5.2/5.3, and chef’s-kiss tool use for deep investigations. She flags it still defaults to bullet points, needs front-end/UX polish, and has latency/stability TBD.

Claude Codetool

Anthropic’s coding product/blog referenced in a customer story about Cognition’s use of Claude Fable 5. For AI PMs, it highlights enterprise coding adoption narratives.

OpenAIcompany

OpenAI is the company behind GPT models and ChatGPT, and it appears here as the launcher of GPT-5.6 Luna and the relauncher of its Bio Bug Bounty. For AI PMs, it signals continued productization of frontier models and safety programs.

Cursortool

A code editor and AI agent workspace that introduced Side Chats and cloud agent hooks in this newsletter. For AI PMs, it shows how copilots are evolving into persistent, context-aware agent threads.

Simon Willisonperson

A developer and AI commentator quoted here in relation to OpenAI’s clarification of ChatGPT Work behavior. He is relevant as an interpreter and critic of product messaging.

Codextool

A ChatGPT-related coding/product mode discussed as a voice-and-tone setting rather than a separate product. For PMs, it highlights how users mentally bucket product experiences.

OpenClawtool

An AI assistant or agent instance used in a public prompt-injection challenge and later in startup support automation. It is relevant to AI PMs as an example of both security testing and customer support automation.

ChatGPTtool

OpenAI's consumer AI assistant and chat product. Here it is the delivery surface for GPT-Live voice features and rollout.

Claire Voperson

A product leader and commentator cited in the newsletter multiple times. She appears in the Gusto shipping story and in discussion of AI-first product development.

Dharmesh Shahperson

A product and startup leader cited here for advising teams to use SQL instead of LLM inference when data can be directly queried. He is presented as giving practical PM guidance.

Ampcompany

A coding agent/product whose interface is described as a capability dial rather than named modes. The newsletter covers its model-routing and reasoning-effort configuration.

Opus 4.6tool

A model used as the underlying engine for an assistant tested against prompt injection. The newsletter notes its explicit anti-prompt-injection rules as a sign that defense measures are improving.

chatprdtool

An AI-first product management tool or startup referenced by Claire Vo. The newsletter uses it in a discussion of shipping an AI-first version of an app without traditional PM tooling.

Lovabletool

A no-code AI app builder referenced here as the platform used to build a production-grade SaaS product. For PMs, it illustrates how agentic coding is changing build-vs-buy and software creation economics.

Sonnet-4.6tool

A Claude model used in the newsletter's example to run Python code and analyze a floor plan. It is discussed as part of an agentic workflow inside Claude Cowork.

red/green TDDconcept

A test-driven development pattern adapted for coding agents. It emphasizes an iterative failure/success loop that can make agentic coding more reliable.

Stay updated on GPT 5.4

Get curated AI PM insights delivered daily — covering this and 1,000+ other sources.

Subscribe Free

GPT 5.4

Key Highlights

GPT 5.4

Overview

Key Developments

Relevance to AI PMs

Related

Newsletter Mentions (10)

Related

Stay updated on GPT 5.4