Opus 4.6
Anthropic’s latest Opus-class model release with a 1 million-token context window. It is positioned for long-context planning, coding, and agentic task execution.
Key Highlights
- Opus 4.6 is Anthropic’s flagship Opus-class model release focused on long-context reasoning, coding, and agentic task execution.
- Its biggest product milestone was general availability of a 1 million-token context window with standard pricing.
- Real-world examples link Opus 4.6 to autonomous growth experimentation, multi-agent automation, and coding workflows.
- The model also surfaced operational risks, including hallucinated tool actions and inefficient context loading without strong instructions.
- For AI PMs, Opus 4.6 is most relevant as a case study in building reliable long-context and action-taking AI products.
Overview
Opus 4.6 is Anthropic’s latest Opus-class model release, commonly referenced as Claude Opus 4.6 or Opus-4.6. It is positioned as a high-capability model for long-context planning, coding, and agentic task execution, with a major differentiator being its 1 million-token context window. In practice, that makes it useful for workflows that require reasoning across large codebases, long documents, multiple transcripts, or multi-step operational tasks without constantly truncating or reloading context.
For AI Product Managers, Opus 4.6 matters because it shows how frontier models are moving from chat-style assistance into systematic execution: growth experimentation, coding, agent orchestration, and long-horizon task handling. Newsletter mentions tie it to real operational use cases ranging from autonomous growth loops at Anthropic to multi-agent personal automation setups and model comparisons in coding environments. At the same time, reports of hallucinated actions and inefficient context loading reinforce an important PM lesson: stronger models expand product possibilities, but they also increase the need for guardrails, context management, and validation layers.
Key Developments
- 2026-02-07: Claude Opus 4.6 was compared with GPT-5.3 Codex in a build-off to create a Poly Market-style product, highlighting Opus’s agent-team approach versus Codex’s ability to be steered mid-execution.
- 2026-02-08: Boris Cherny announced a /fast mode for Opus in Claude Code, describing it as an experimental higher-compute, higher-cost mode built and tested on top of Opus 4.6 for urgent incident response and accelerated work.
- 2026-02-09: Mike Krieger said Anthropic Labs had been using a faster version of Opus 4.6 running 2.5× faster, calling it a major unlock and signaling excitement about broader rollout.
- 2026-02-16: An autonomous Claude Code agent used the OpenCode CLI via OpenRouter to run multiple models in parallel—including Opus 4.6, GLM5, Minimax 2.5, and Gemini 3 Pro—for creative code generation and media workflow automation.
- 2026-02-25: Carl Vellotti criticized Opus 4.6 for loading too many files for simple questions and for rarely spawning context-saving agents, prompting practical discussion around context management and better `CLAUDE.md` instructions.
- 2026-03-04: Guillermo Rauch shared a cautionary example in which Opus 4.6 hallucinated a fake GitHub repo ID and triggered an unintended Vercel deployment, underscoring the need for strict validation of model-generated API actions.
- 2026-03-14: Anthropic announced that 1 million-token context was generally available for Opus 4.6 and Sonnet 4.6, with standard pricing across the full window and no added long-context premium.
- 2026-03-22: Peter Yang said the new 1M-token context window felt like a capability jump from Opus 4.6 to Opus 4.7, emphasizing how much larger context can change perceived model performance.
- 2026-03-30: Claire Vo used OpenClaw with role-based agents powered by Opus-4.6, Sonnet-4.6, and GPT-5.4 to automate business outreach and family scheduling across multiple Macs and Telegram bots.
- 2026-04-06: Anthropic’s growth team launched CASH (Claude Accelerates Sustainable Hypergrowth), using Claude with Opus 4.6 to automate growth experimentation from opportunity identification through post-launch analysis; the team reported results comparable to junior PM-level win rates on copy and UI tweaks, and noted that reliability improved materially after upgrading from Opus 4.5 to Opus 4.6.
Relevance to AI PMs
1. Design for long-context products and workflows.
Opus 4.6’s 1M-token window makes it viable for features that span large specs, long customer transcripts, analytics exports, code repositories, and historical decisions. AI PMs can use this to prototype assistants for roadmap synthesis, research summarization, implementation planning, and cross-document reasoning.
2. Validate agentic execution, not just answer quality.
The CASH and OpenClaw examples show Opus 4.6 being used in multi-step workflows where the model identifies opportunities, builds, analyzes, or takes action across tools. PMs should evaluate success on task completion, reliability, handoff behavior, and error recovery—not only on benchmark scores or chat responses.
3. Invest in guardrails and context hygiene early.
Reports of hallucinated deployments and wasteful file loading show that more capable models can still behave unsafely or inefficiently. Tactically, PMs should require action validation, constrain tool permissions, define context-loading rules, and use instruction files like `CLAUDE.md` to shape model behavior in production systems.
Related
- Anthropic: Creator of Opus 4.6 and the broader Claude model family.
- Claude: The product ecosystem where Opus 4.6 is deployed, including chat and coding workflows.
- Claude Code / Anthropic CLI / ClaudeMD: Developer-facing environments and instruction patterns frequently mentioned alongside Opus 4.6 for coding and agent execution.
- CASH: Anthropic’s internal growth automation system that reportedly became reliable after moving from Opus 4.5 to Opus 4.6.
- OpenClaw: Multi-agent setup used with Opus-4.6, Sonnet-4.6, and GPT-5.4 for role-based automation.
- Sonnet-46: Sibling Anthropic model that also received the 1M-token context upgrade.
- Opus-47: Referenced implicitly through commentary that the 1M context upgrade felt like a jump to a new version class.
- 1M-token-context-window / context-management / agentic-task-handling: Core concepts that define why Opus 4.6 stands out operationally.
- OpenRouter / OpenCode / GLM5 / Minimax-25 / Gemini-3-Pro / GPT-54 / GPT-53-Codex: Related model-routing and comparison ecosystem showing how Opus 4.6 is evaluated in multi-model workflows.
- Vercel / Guillermo Rauch / Simon Willison / Peter Yang / Boris Cherny / Mike Krieger / Greg Isenberg: People and companies connected through product commentary, comparisons, launch notes, and cautionary examples involving Opus 4.6.
Newsletter Mentions (11)
“Anthropic’s growth team launches CASH (Claude Accelerates Sustainable Hypergrowth) using Claude with Opus 4.6 to fully automate growth experimentation—from opportunity identification to post-launch analysis—achieving junior PM-level win rates on copy and UI tweaks.”
#12 ▶️ Head of Growth (Anthropic): “Claude is growing itself at this point” Lennys Podcast Anthropic’s growth team launches CASH (Claude Accelerates Sustainable Hypergrowth) using Claude with Opus 4.6 to fully automate growth experimentation—from opportunity identification to post-launch analysis—achieving junior PM-level win rates on copy and UI tweaks. Anthropic’s ARR jumped from $1 billion at the start of 2025 to $19 billion by February 2026 (10× YoY growth), hitting $4 billion mid-2025 and $9 billion end-2025—a $18 billion increase in 14 months. CASH was initiated a few months ago but only began delivering reliable results after upgrading from Opus 4.5 to Opus 4.6, automating four stages of growth work (opportunity ID, build, QA/brand compliance, and analysis). Co-work’s desktop app runs a scheduled task each morning on ~20–25 Hex chart links and Slack MCP transcripts, then uses Claude to summarize top concerns and insights in Slack.
“Claire Vo installed OpenClaw via a one-line Homebrew script on separate macOS machines (three Mac minis and one MacBook Air), configured nine role-based agents (Polly, Finn, Sam, etc.) using Opus-4.6, Sonnet-4.6 and GPT-5.4 models, and linked them to Telegram bots for automating her business outreach and family scheduling.”
#1 ▶️ How OpenClaw’s AI agents run this founder’s business, family and life | Claire Vo Lennys Podcast Claire Vo installed OpenClaw via a one-line Homebrew script on separate macOS machines (three Mac minis and one MacBook Air), configured nine role-based agents (Polly, Finn, Sam, etc.) using Opus-4.6, Sonnet-4.6 and GPT-5.4 models, and linked them to Telegram bots for automating her business outreach and family scheduling. She ran “brew install openclaw” in iTerm, chose personal use, selected Opus-4.6, Sonnet-4.6 and GPT-5.4, then registered each agent as a Telegram bot via BotFather. Agent “Sam” performs a daily sweep of her CRM for product-led growth signups, enriches leads with Exa People Search, drafts and sends outreach emails via Telegram, replacing a human assistant who worked 10 hours/week. She enabled macOS Screen Sharing and Remote Login on her Mac minis to SSH into and view the agent GUIs from her laptop over Wi-Fi, removing the need for dedicated monitors, keyboards or mice.
“#12 𝕏 Peter Yang says the new 1M-token context window feels like a version bump from Opus 4.6 to 4.7, delivering a noticeable performance and capacity boost.”
A model capability note highlights the impact of longer context windows. #12 𝕏 Peter Yang says the new 1M-token context window feels like a version bump from Opus 4.6 to 4.7, delivering a noticeable performance and capacity boost.
“1M context is now generally available for Opus 4.6 and Sonnet 4.6. Standard pricing now applies”
Claude now offers a 1 million-token context window in its Opus 4.6 and Sonnet 4.6 models, and this upgrade is generally available to all users. Also covered by: @Claude #2 📝 Simon Willison 1M context is now generally available for Opus 4.6 and Sonnet 4.6 - Anthropic announced 1M token context availability for Opus 4.6 and Sonnet 4.6; standard pricing now applies across the full 1M window with no long-context premium.
“Guillermo Rauch recounts how an AI model (Opus 4.6) hallucinated a fake GitHub repo ID and inadvertently used Vercel’s API to deploy random code, underscoring the need for strict validation of AI-generated requests.”
Opus 4.6 is discussed in the context of an unsafe deployment action caused by hallucination.
“#23 in 🥞 Carl Vellotti calls out Opus 4.6 for needlessly loading eight files to answer a two-sentence question and rarely spawning context-saving agents.”
#23 in 🥞 Carl Vellotti calls out Opus 4.6 for needlessly loading eight files to answer a two-sentence question and rarely spawning context-saving agents. He shares a “Context Management” snippet to drop into your CLAUDE.md to fix it.
“All About AI Uses an autonomous Claude Code agent on a Mac Mini to invoke the OpenCode CLI via OpenRouter on four models (GLM5, Minimax 2.5, Gemini 3 Pro, Opus 4.6) in parallel to generate HTML demos of a retro space game, convert them with Remotion into a grid-style MP4 video, and draft a post on X.”
#2 ▶️ How to Run OpenCode Inside an Autonomous Claude Code AI Agent All About AI Uses an autonomous Claude Code agent on a Mac Mini to invoke the OpenCode CLI via OpenRouter on four models (GLM5, Minimax 2.5, Gemini 3 Pro, Opus 4.6) in parallel to generate HTML demos of a retro space game, convert them with Remotion into a grid-style MP4 video, and draft a post on X. Executed “open code run --model openrouter GLM5 'Should I walk or drive to the car wash? It’s 50 m away'” via Cloud Code CLI, receiving “you should walk to the car wash,” and then ran “open code run --model openrouter Gemini-3-Pro …” obtaining “drive. You can’t wash the car if you leave it behind.” Created a Cloud Code skill file open code test skill.md to launch four OpenRouter models (GLM5, Minimax-2.5, Gemini-3-Pro, Opus-4.6) in parallel on the prompt “create a full screen animated retro arcade space battle scene,” saving outputs as llm-test/game- .html.
“Mike Krieger has been building with Labs’ fast Opus—Claude Opus 4.6 running 2.5× faster—and calls it a “crazy unlock.””
#5 𝕏 Mike Krieger has been building with Labs’ fast Opus—Claude Opus 4.6 running 2.5× faster—and calls it a “crazy unlock.” He’s now excited to roll it out beyond Anthropic. Also covered by: @Guillermo Rauch
“Boris Cherny launched the /fast mode in Opus, using significantly more compute than Opus 4.6 and incurring higher costs for incident response and accelerated work on critical projects, and announced his team built and tested this experimental fast mode for Opus 4.6 with Claude over the past few weeks ( tweet ).”
GenAI PM Daily February 08, 2026 GenAI PM Daily 🎧 Listen to this brief 3 min listen Today's top 20 insights for PM Builders, ranked by relevance from X, Blogs, YouTube, and LinkedIn. Anthropic Launches Fast Mode for Claude Code #4 𝕏 Boris Cherny launched the /fast mode in Opus, using significantly more compute than Opus 4.6 and incurring higher costs for incident response and accelerated work on critical projects, and announced his team built and tested this experimental fast mode for Opus 4.6 with Claude over the past few weeks ( tweet ). #15 ▶️ The Two Models that will Dominate AI Discussions Just Got Released (Claude Opus 4.6 + GPT 5.3 Codex) AI Explained Benchmark comparison shows Claude Opus 4.6 outperforms GPT 5.2 by about 140 ELO points on the GDP val white-collar work benchmark, while GPT 5.3 Codex achieves 77.3% on TerminalBench 2.0 extra-high settings versus 65.4% for Opus 4.6 Max.
“Comparison of Claude Opus 4.6 (Anthropic CLI) and GPT-5.3 Codex (OpenAI Mac desktop app) by building a Poly Market competitor to showcase Opus’s agent teams and Codex’s mid-execution steering.”
#7 ▶️ Claude Opus 4.6 vs GPT-5.3 Codex Greg Isenberg Comparison of Claude Opus 4.6 (Anthropic CLI) and GPT-5.3 Codex (OpenAI Mac desktop app) by building a Poly Market competitor to showcase Opus’s agent teams and Codex’s mid-execution steering. GPT-5.3 Codex built a Poly Market competitor in 3 minutes and 47 seconds, scaffolding a core LMSR market-maker engine, REST API router, responsive front end, and passing 10/10 unit and integration tests.
Related
Anthropic's coding assistant used for programming and automation tasks. The newsletter references it for building a custom approval device and for writing and research workflows inside AI agents.
AI company behind Claude. The newsletter references Claude usage and later notes Anthropic may have reached product-market fit.
Anthropic's model family used for agent orchestration and developer workflows. In this newsletter it is highlighted as powering CodeRabbit's agent orchestration system.
A creator mentioned again as raising seed funding and choosing AI agents for onboarding and role learning. He is also the source credit on the Ryan Carson item.
Independent AI commentator and developer known for practical analysis of LLM products. Here he argues Anthropic and OpenAI have found product-market fit.
CEO of Vercel and a prominent web platform builder. The newsletter credits him with launching an AI Gateway plugin for WordPress.
An AI agent workflow system used to automate founder and operator tasks with cron jobs, skills, and integrations. The newsletter cites it as part of a solo-founder operating stack alongside Codex and Devin.
Vercel is the hosting platform used for the rapid prototype demo. It remains a common deployment choice for AI-built web apps and landing pages.
An operator and creator cited for a playbook on building vertical AI agent startups. He is mentioned as laying out a workflow-first approach: map the industry process manually before automating it.
A Claude Code maintainer or product figure credited here with shipping the new `/usage` command. The mention is relevant for PMs tracking feature-level product changes in developer tools.
A newer OpenAI model release with improved natural dialogue, longer context, and stronger tool use. It is discussed as a model now available in Cursor and chatprd.
A model-routing platform used to call multiple LLMs through a common interface. Here it is used to run four models in parallel for comparison and generation tasks.
A coding agent mentioned as supporting context forking, where users can rewind or branch from prior turns.
OpenAI’s coding-focused model/release highlighted for benchmark performance, steerability, and speed improvements. The newsletter frames it as a strong coding agent option with multiple benchmark scores.
A Claude model used in the newsletter's example to run Python code and analyze a floor plan. It is discussed as part of an agentic workflow inside Claude Cowork.
A plan or configuration associated with GPT 5.5 in the benchmark discussion. It is mentioned as the mode under which GPT 5.5 achieved its score.
A Gemini model variant used in a real workflow library project. The newsletter mentions it as one of the tools used to build the ChatPRD index.
A project context file format referenced as something agents can import to understand a codebase or workspace. It is described as enabling immediate context ingestion without manual setup.
Stay updated on Opus 4.6
Get curated AI PM insights delivered daily — covering this and 1,000+ other sources.
Subscribe Free