GenAI PM
person16 mentions· Updated May 10, 2026

Jason Zhou

An AI builder or practitioner mentioned for launching `/goal` support in CodeX and Hermes agents. He is cited as recommending workflow guardrails like interview mode and clear stop conditions.

Key Highlights

  • Jason Zhou is repeatedly cited for shipping practical agent workflows, especially around Codex, Claude Code, and autonomous coding loops.
  • He introduced `/goal` patterns that combine iterative execution with guardrails like interview mode, stop conditions, and state management.
  • His examples show how AI agents can automate concrete business workflows such as support-ticket handling and fraud detection.
  • He emphasizes the enabling stack for reliable agents: memory environments, verification loops, atomic tooling, and better tool-use schemas.
  • His work is especially relevant to AI PMs building developer tools, agent UX, and operational automation products.

Jason Zhou

Overview

Jason Zhou is an AI builder and practitioner frequently cited for hands-on work with coding agents, autonomous workflows, and agent tooling. Across newsletter mentions, he appears as an operator who ships practical integrations and demos rather than just commenting on trends—for example, connecting OpenAI Codex with Claude Code, introducing `/goal`-style autonomous loops, and showing how agents can handle concrete business tasks like support-ticket triage and referral-fraud detection.

For AI Product Managers, Jason Zhou matters because his work sits at the intersection of agent UX, developer tooling, and operational reliability. His examples consistently surface the product patterns that make autonomous systems usable in production: state management, tool-use design, guardrails such as interview mode and stop conditions, token efficiency, and verification-oriented loops. In short, he represents the applied edge of agentic product design.

Key Developments

  • 2026-02-16: Jason Zhou introduced and explained WebMCP, including setup via HTML attributes or React, to make websites agent-ready. He also tied the concept to Chrome Beta 146 and a WebMCP debugger workflow.
  • 2026-02-23: He highlighted Anthropic’s advanced tool calling as an underrated capability, emphasizing programmatic invocation, dynamic filtering, built-in search, and practical examples.
  • 2026-02-23: He also pointed to benchmark gains in tool-use reliability, noting that concrete examples for complex tools improved JSON accuracy from 72% to 90%.
  • 2026-03-06: Jason Zhou argued that breakthroughs in memory environments, verification loops, and atomic tooling made always-on, long-running autonomous agents newly viable.
  • 2026-03-30: He highlighted RTK (Rust Token Killer), an open-source optimization tool that can reduce Claude Code token usage by up to 60% by stripping noise and redundant content.
  • 2026-04-01: He launched direct Codex integration in Claude Code, enabling code-review workflows through plugin commands tied to openai-codex.
  • 2026-04-23: Jason Zhou built a Crewlet agent that detected referral farming by tracing a spike in temp-email signups back to a referral link, then supported remediation through SQL-based resets.
  • 2026-04-24: He built an AI agent that read a support ticket and autonomously submitted a PR in about 10 minutes, demonstrating end-to-end operational automation.
  • 2026-04-30: He shared lesser-known Claude Code setup tricks, including use of the hidden `.claude/rules/` directory, reinforcing his focus on practical workflow tuning.
  • 2026-05-04: Jason Zhou unveiled Codex’s `/goal` command, described as a stateful loop that sets goals, tests, self-corrects, and repeats until completion or budget exhaustion.
  • 2026-05-10: He launched `/goal` support in CodeX and Hermes agents for one-step autonomous coding and recommended workflow guardrails including interview mode, clear stop conditions, and a goal-buddy to manage state and goal files.

Relevance to AI PMs

1. He offers concrete design patterns for agent products. Jason Zhou’s examples go beyond “AI can do X” and show how to structure autonomy with loops, state files, stop conditions, and companion tools like goal-buddy. PMs can use these patterns when defining agent flows, success criteria, and failure boundaries.

2. He demonstrates measurable reliability improvements. His commentary on tool calling, verification loops, and example-based tool schemas is useful for PMs designing evaluation plans. The takeaway is tactical: better tool specs, tighter examples, and structured validation can materially improve completion quality.

3. He connects agent UX to operational outcomes. From support-ticket automation to fraud detection and code-review integrations, his work illustrates where agents create business value fastest: repetitive workflows with clear tools, narrow objectives, and verifiable outputs.

Related

  • codex / openai-codex: Central to Jason Zhou’s work on direct integration in Claude Code and the `/goal` autonomous loop.
  • claude-code / claude / clauderules: Closely tied to his workflow tips, plugin setup, token optimization, and hidden rules-directory practices.
  • hermes-agents / goal / goal-buddy: Connected to his stateful autonomous coding workflows and recommendations for managing goals and execution state.
  • anthropic / tool-calling: Relevant to his emphasis on advanced tool invocation patterns and schema/example quality.
  • memory-environments / verification-loops / atomic-tooling: These concepts underpin his view that long-running autonomous agents became practical.
  • rtk: Supports his focus on token efficiency and improving developer-agent economics.
  • webmcp / chrome-beta-146: Tied to his work on making websites directly agent-compatible.
  • crewlet / support-ticket / multi-agent-system / autonomous-agents: Reflect his applied experiments in business-task automation and agent orchestration.

Newsletter Mentions (16)

2026-05-10
#1 𝕏 Jason Zhou launched `/goal` support in CodeX and Hermes agents for one-step autonomous coding, advising use of interview mode, clear stop conditions, and a goal-buddy to manage state and goal files.

GenAI PM Daily May 10, 2026 GenAI PM Daily 🎧 Listen to this brief 3 min listen Today's top 11 insights for PM Builders, ranked by relevance from X, Blogs, and LinkedIn. PromptLayer’s multi-step agent evaluation framework #1 𝕏 Jason Zhou launched `/goal` support in CodeX and Hermes agents for one-step autonomous coding, advising use of interview mode, clear stop conditions, and a goal-buddy to manage state and goal files. #2 📝 PromptLayer Blog What Is Agent Evaluation? A Practical Guide for AI Teams - Agent evaluation tests whether an AI agent reliably completes tasks across real inputs, edge cases, and new versions by scoring not just final outputs but multi-step behavior via black-box, trajectory, and component-level evaluations, using metrics like task completion rate, tool selection accuracy, unsupported-claim rate, latency/cost per step, and regression pass rate. PromptLayer offers tracing with span-level context, reusable datasets, batch evaluations, backtesting, regression testing, automated evaluation triggers on new prompt versions, and flexible pipelines including code execution, human input, conversation simulation, regex checks, and LLM assertions. #3 in Udi Menkes built his new product’s entire data flow in a single interactive HTML file—complete with diagrams, in-page navigation, and color-coded complexity—letting his team understand it in minutes instead of hours. #4 𝕏 Garry Tan suggests diagramming your AI agent codebases and architecture in plain ASCII, then relentlessly questioning each component to clarify design and accelerate product development. #5 𝕏 Boris Cherny says Claude Code’s switch to a native installer means npm-only stats undercount its real usage. On Thursday it hit its second-highest signup day ever with 15× growth since Jan 1—now you can ask Claude to debug your SQL. #6 𝕏 Boris Cherny is enhancing Claude Code’s UX for snappier performance and adding debug logs so users can self-serve hang diagnostics. #7 𝕏 Harrison Chase calls LangSmith an org-wide platform for building AI agents that speeds up cross-functional collaboration and tightens feedback loops. #8 𝕏 Santiago showcases a step-by-step guide for constructing Python-powered multi-agent systems from scratch, leveraging MCP and A2A patterns to incrementally add complexity and enable collaborative AI agents. #9 𝕏 Garry Tan spends $2K/mo on Openclaw AI tokens to turbocharge product development and startup insights. He’s “tokenmaxxing” now with a goal to make these capabilities affordable for everyone in 18 months. #10 𝕏 Harrison Chase argues that treating AI agents as systems to measure and iteratively improve isn’t just a technical challenge—it demands intentional human collaboration and team processes. #11 in Peter Yang warns that unedited AI-generated markdown can compound small errors over time—what starts as 5% “slop” quickly balloons into an overwhelming pile of confusing, unverified content. Found this valuable? Share it with another PM - they can subscribe at genaipm.com Unsubscribe • Switch to Weekly

2026-05-04
OpenAI Codex unveils /goal stateful loop command #1 𝕏 Jason Zhou unveils Codex’s new /goal command, introducing a stateful Ralph-loop that iteratively sets goals, tests, self-corrects, and repeats until the mission is complete or the budget runs out.

GenAI PM Daily May 04, 2026 GenAI PM Daily 🎧 Listen to this brief 3 min listen Today's top 12 insights for PM Builders, ranked by relevance from X, YouTube, and LinkedIn. OpenAI Codex unveils /goal stateful loop command #1 𝕏 Jason Zhou unveils Codex’s new /goal command, introducing a stateful Ralph-loop that iteratively sets goals, tests, self-corrects, and repeats until the mission is complete or the budget runs out. #2 ▶️ Everything You Need to Know About Context Engineering in 40 Minutes | Ravi Mehta Peter Yang Use 3-layer context engineering (functional spec, Figma wireframe, JSON data enriched via Claude and a custom Cloud Code MCP server) to generate a high-fidelity music genre detail page prototype in Reforge Build that can be instantly re-themed by swapping the data.json file.

2026-04-30
#18 𝕏 Jason Zhou reveals three lesser-known Claude Code setup tricks—beyond the usual `npm install`—including the discovery and use of the hidden `.claude/rules/` directory.

#18 𝕏 Jason Zhou reveals three lesser-known Claude Code setup tricks—beyond the usual `npm install`—including the discovery and use of the hidden `.claude/rules/` directory. #19 𝕏 Harrison Chase predicts that by 2026 closed-model costs will be prohibitively high and he’s optimizing deepagents for peak performance on OSS models.

2026-04-24
Jason Zhou built an AI agent that reads a support ticket and autonomously submits a PR in just 10 minutes, instantly automating customer crediting.

#10 𝕏 Jason Zhou built an AI agent that reads a support ticket and autonomously submits a PR in just 10 minutes, instantly automating customer crediting. #11 𝕏 claire vo 🖤 GPT-5.5 ran a 6-hour autonomous validation, migrated 2M+ records with just one unhandled exception, closed all security issues for a zero-issue pen test, and even reverse-engineered a Divoom MiniToo Bluetooth speaker’s image encoding.

2026-04-23
#7 𝕏 Jason Zhou built a Crewlet agent that detected referral farming by spotting a spike in temp email signups and tracing them to one referral link in the database.

#7 𝕏 Jason Zhou built a Crewlet agent that detected referral farming by spotting a spike in temp email signups and tracing them to one referral link in the database. He then flagged the fake accounts and reset their credits via a SQL script.

2026-04-01
Jason Zhou launched direct Codex integration in Claude Code, enabling CC code reviews via four simple plugin commands—/plugin marketplace add openai/codex-plugin-cc, /plugin install codex@openai-codex, /reload-plugins, and /codex:setup.

𝕏 Jason Zhou launched direct Codex integration in Claude Code, enabling CC code reviews via four simple plugin commands—/plugin marketplace add openai/codex-plugin-cc, /plugin install codex@openai-codex, /reload-plugins, and /codex:setup.

2026-03-30
#3 𝕏 Jason Zhou highlights RTK (Rust Token Killer), an open-source tool that cuts Claude Code tokens by up to 60% by stripping noise, merging repeated content, and removing blank lines and progress bars.

#3 𝕏 Jason Zhou highlights RTK (Rust Token Killer), an open-source tool that cuts Claude Code tokens by up to 60% by stripping noise, merging repeated content, and removing blank lines and progress bars.

2026-03-06
Jason Zhou says December 2025’s LLM breakthrough—fueled by memory environments, verification loops and atomic tooling—enables always-on, long-running autonomous agents.

GenAI PM Daily March 06, 2026 GenAI PM Daily 🎧 Listen to this brief 3 min listen Today's top 25 insights for PM Builders, ranked by relevance from Blogs, X LinkedIn, and YouTube. OpenAI Introduces GPT-5.4 Model #1 📝 OpenAI News Introducing GPT-5.4 - Announcement of GPT-5.4 as a new product release, highlighting improvements and new capabilities over prior models. The post introduces features and potential applications of GPT-5.4. Also covered by: @There's An AI For That , @Kevin Weil 🇺🇸 #16 𝕏 Jason Zhou says December 2025’s LLM breakthrough—fueled by memory environments, verification loops and atomic tooling—enables always-on, long-running autonomous agents.

2026-02-23
𝕏 Jason Zhou hails Anthropic’s advanced tool calling—featuring programmatic invocation, dynamic filtering, built-in search and real-world use examples—as underrated gold in his quick 3-minute breakdown.

GenAI PM Daily February 23, 2026 | Today's top 12 insights for PM Builders, ranked by relevance from X, LinkedIn, Blogs, and YouTube. Anthropic Launches Advanced Tool Calling #1 𝕏 Jason Zhou hails Anthropic’s advanced tool calling—featuring programmatic invocation, dynamic filtering, built-in search and real-world use examples—as underrated gold in his quick 3-minute breakdown. #8 𝕏 Jason Zhou shows that providing LLMs with concrete tool-use examples for complex tools with many optional fields and dependencies boosts JSON output accuracy from 72% to 90% in Anthropic’s benchmarks.

2026-02-16
Jason Zhou walks through configuring webMCP via HTML attributes or a React setup to instantly make websites agent-ready.

#4 𝕏 Jason Zhou walks through configuring webMCP via HTML attributes or a React setup to instantly make websites agent-ready. He invites you to @aibuilderclub_ for a deeper breakdown and live walkthrough in his upcoming weekly call. Also covered by: @AI Jason #10 in Jason Zhou introduced WebMCP, a new Chrome 146 API that lets websites dynamically load and communicate with agents to perform page-specific actions, and shared steps to try it via Chrome Beta 146 and the WebMCP debugger tool.

Related

Claude Codetool

A coding environment for Claude mentioned for its keyboard shortcut that opens a full-featured editor for prompt writing. It is highlighted as making long prompts far easier to manage.

Anthropiccompany

The company behind Claude, mentioned as working with Peter Yang and Alex Albert on Claude's next iteration. It is referenced in the context of model design, harness design, and feedback evaluation.

Claudetool

Anthropic's AI assistant/model used here in multiple contexts: as the product being built next, as a system used to cluster feedback into synthetic evals, and as a tool that non-technical staff use.

Codextool

OpenAI’s coding agent/product that can run against local or remote development environments and surface live state for review and approval. For AI PMs, it’s a strong example of agentic coding workflows moving into mobile and enterprise contexts.

Devintool

An autonomous software engineering agent from Cognition that can investigate and fix issues. PMs use it as an example of agentic coding and security remediation.

Gemini 3tool

A Gemini model variant used here to power agentic workflow examples and multi-agent systems. It is relevant to AI PMs as an example of frontier model capability enabling more complex automated workflows.

OpenAI Codextool

An AI coding assistant/orchestrator used to run stateful goal loops and automate coding workflows. It is presented here as a PM-relevant tool for agentic software development.

WebMCPtool

A W3C-backed browser extension that exposes website functionality to MCP-capable agents. It lets developers register site functions as structured tools in the browser.

GPT-5.2 Protool

An OpenAI model variant discussed here for its ability to collaborate with HarmonicMath on near-autonomous proof generation. For AI PMs, it highlights stronger reasoning and math capabilities in advanced LLMs.

multi-agent systemconcept

An architecture where multiple specialized agents collaborate instead of one general-purpose agent. The newsletter includes debate over whether this is necessary versus using a single tool-loaded agent.

Crewlettool

A company referenced for experimenting with Slack bot-based monitoring and collaboration. It is cited as an example of per-channel task outcome tracking in workplace AI workflows.

Stay updated on Jason Zhou

Get curated AI PM insights delivered daily — covering this and 1,000+ other sources.

Subscribe Free