Opus 4.6
Anthropic’s latest Opus-class model release with a 1 million-token context window. It is positioned for long-context planning, coding, and agentic task execution.
Key Highlights
- Opus 4.6 is Anthropic’s flagship Opus-class model release focused on long-context reasoning, coding, and agentic task execution.
- Its biggest product milestone was general availability of a 1 million-token context window with standard pricing.
- Real-world examples link Opus 4.6 to autonomous growth experimentation, multi-agent automation, and coding workflows.
- The model also surfaced operational risks, including hallucinated tool actions and inefficient context loading without strong instructions.
- For AI PMs, Opus 4.6 is most relevant as a case study in building reliable long-context and action-taking AI products.
Overview
Opus 4.6 is Anthropic’s latest Opus-class model release, commonly referenced as Claude Opus 4.6 or Opus-4.6. It is positioned as a high-capability model for long-context planning, coding, and agentic task execution, with a major differentiator being its 1 million-token context window. In practice, that makes it useful for workflows that require reasoning across large codebases, long documents, multiple transcripts, or multi-step operational tasks without constantly truncating or reloading context.
For AI Product Managers, Opus 4.6 matters because it shows how frontier models are moving from chat-style assistance into systematic execution: growth experimentation, coding, agent orchestration, and long-horizon task handling. Newsletter mentions tie it to real operational use cases ranging from autonomous growth loops at Anthropic to multi-agent personal automation setups and model comparisons in coding environments. At the same time, reports of hallucinated actions and inefficient context loading reinforce an important PM lesson: stronger models expand product possibilities, but they also increase the need for guardrails, context management, and validation layers.
Key Developments
- 2026-02-07: Claude Opus 4.6 was compared with GPT-5.3 Codex in a build-off to create a Poly Market-style product, highlighting Opus’s agent-team approach versus Codex’s ability to be steered mid-execution.
- 2026-02-08: Boris Cherny announced a /fast mode for Opus in Claude Code, describing it as an experimental higher-compute, higher-cost mode built and tested on top of Opus 4.6 for urgent incident response and accelerated work.
- 2026-02-09: Mike Krieger said Anthropic Labs had been using a faster version of Opus 4.6 running 2.5× faster, calling it a major unlock and signaling excitement about broader rollout.
- 2026-02-16: An autonomous Claude Code agent used the OpenCode CLI via OpenRouter to run multiple models in parallel—including Opus 4.6, GLM5, Minimax 2.5, and Gemini 3 Pro—for creative code generation and media workflow automation.
- 2026-02-25: Carl Vellotti criticized Opus 4.6 for loading too many files for simple questions and for rarely spawning context-saving agents, prompting practical discussion around context management and better `CLAUDE.md` instructions.
- 2026-03-04: Guillermo Rauch shared a cautionary example in which Opus 4.6 hallucinated a fake GitHub repo ID and triggered an unintended Vercel deployment, underscoring the need for strict validation of model-generated API actions.
- 2026-03-14: Anthropic announced that 1 million-token context was generally available for Opus 4.6 and Sonnet 4.6, with standard pricing across the full window and no added long-context premium.
- 2026-03-22: Peter Yang said the new 1M-token context window felt like a capability jump from Opus 4.6 to Opus 4.7, emphasizing how much larger context can change perceived model performance.
- 2026-03-30: Claire Vo used OpenClaw with role-based agents powered by Opus-4.6, Sonnet-4.6, and GPT-5.4 to automate business outreach and family scheduling across multiple Macs and Telegram bots.
- 2026-04-06: Anthropic’s growth team launched CASH (Claude Accelerates Sustainable Hypergrowth), using Claude with Opus 4.6 to automate growth experimentation from opportunity identification through post-launch analysis; the team reported results comparable to junior PM-level win rates on copy and UI tweaks, and noted that reliability improved materially after upgrading from Opus 4.5 to Opus 4.6.
Relevance to AI PMs
1. Design for long-context products and workflows.
Opus 4.6’s 1M-token window makes it viable for features that span large specs, long customer transcripts, analytics exports, code repositories, and historical decisions. AI PMs can use this to prototype assistants for roadmap synthesis, research summarization, implementation planning, and cross-document reasoning.
2. Validate agentic execution, not just answer quality.
The CASH and OpenClaw examples show Opus 4.6 being used in multi-step workflows where the model identifies opportunities, builds, analyzes, or takes action across tools. PMs should evaluate success on task completion, reliability, handoff behavior, and error recovery—not only on benchmark scores or chat responses.
3. Invest in guardrails and context hygiene early.
Reports of hallucinated deployments and wasteful file loading show that more capable models can still behave unsafely or inefficiently. Tactically, PMs should require action validation, constrain tool permissions, define context-loading rules, and use instruction files like `CLAUDE.md` to shape model behavior in production systems.
Related
- Anthropic: Creator of Opus 4.6 and the broader Claude model family.
- Claude: The product ecosystem where Opus 4.6 is deployed, including chat and coding workflows.
- Claude Code / Anthropic CLI / ClaudeMD: Developer-facing environments and instruction patterns frequently mentioned alongside Opus 4.6 for coding and agent execution.
- CASH: Anthropic’s internal growth automation system that reportedly became reliable after moving from Opus 4.5 to Opus 4.6.
- OpenClaw: Multi-agent setup used with Opus-4.6, Sonnet-4.6, and GPT-5.4 for role-based automation.
- Sonnet-46: Sibling Anthropic model that also received the 1M-token context upgrade.
- Opus-47: Referenced implicitly through commentary that the 1M context upgrade felt like a jump to a new version class.
- 1M-token-context-window / context-management / agentic-task-handling: Core concepts that define why Opus 4.6 stands out operationally.
- OpenRouter / OpenCode / GLM5 / Minimax-25 / Gemini-3-Pro / GPT-54 / GPT-53-Codex: Related model-routing and comparison ecosystem showing how Opus 4.6 is evaluated in multi-model workflows.
- Vercel / Guillermo Rauch / Simon Willison / Peter Yang / Boris Cherny / Mike Krieger / Greg Isenberg: People and companies connected through product commentary, comparisons, launch notes, and cautionary examples involving Opus 4.6.
Newsletter Mentions (11)
“Anthropic’s growth team launches CASH (Claude Accelerates Sustainable Hypergrowth) using Claude with Opus 4.6 to fully automate growth experimentation—from opportunity identification to post-launch analysis—achieving junior PM-level win rates on copy and UI tweaks.”
#12 ▶️ Head of Growth (Anthropic): “Claude is growing itself at this point” Lennys Podcast Anthropic’s growth team launches CASH (Claude Accelerates Sustainable Hypergrowth) using Claude with Opus 4.6 to fully automate growth experimentation—from opportunity identification to post-launch analysis—achieving junior PM-level win rates on copy and UI tweaks. Anthropic’s ARR jumped from $1 billion at the start of 2025 to $19 billion by February 2026 (10× YoY growth), hitting $4 billion mid-2025 and $9 billion end-2025—a $18 billion increase in 14 months. CASH was initiated a few months ago but only began delivering reliable results after upgrading from Opus 4.5 to Opus 4.6, automating four stages of growth work (opportunity ID, build, QA/brand compliance, and analysis). Co-work’s desktop app runs a scheduled task each morning on ~20–25 Hex chart links and Slack MCP transcripts, then uses Claude to summarize top concerns and insights in Slack.
“Claire Vo installed OpenClaw via a one-line Homebrew script on separate macOS machines (three Mac minis and one MacBook Air), configured nine role-based agents (Polly, Finn, Sam, etc.) using Opus-4.6, Sonnet-4.6 and GPT-5.4 models, and linked them to Telegram bots for automating her business outreach and family scheduling.”
#1 ▶️ How OpenClaw’s AI agents run this founder’s business, family and life | Claire Vo Lennys Podcast Claire Vo installed OpenClaw via a one-line Homebrew script on separate macOS machines (three Mac minis and one MacBook Air), configured nine role-based agents (Polly, Finn, Sam, etc.) using Opus-4.6, Sonnet-4.6 and GPT-5.4 models, and linked them to Telegram bots for automating her business outreach and family scheduling. She ran “brew install openclaw” in iTerm, chose personal use, selected Opus-4.6, Sonnet-4.6 and GPT-5.4, then registered each agent as a Telegram bot via BotFather. Agent “Sam” performs a daily sweep of her CRM for product-led growth signups, enriches leads with Exa People Search, drafts and sends outreach emails via Telegram, replacing a human assistant who worked 10 hours/week. She enabled macOS Screen Sharing and Remote Login on her Mac minis to SSH into and view the agent GUIs from her laptop over Wi-Fi, removing the need for dedicated monitors, keyboards or mice.
“#12 𝕏 Peter Yang says the new 1M-token context window feels like a version bump from Opus 4.6 to 4.7, delivering a noticeable performance and capacity boost.”
A model capability note highlights the impact of longer context windows. #12 𝕏 Peter Yang says the new 1M-token context window feels like a version bump from Opus 4.6 to 4.7, delivering a noticeable performance and capacity boost.
“1M context is now generally available for Opus 4.6 and Sonnet 4.6. Standard pricing now applies”
Claude now offers a 1 million-token context window in its Opus 4.6 and Sonnet 4.6 models, and this upgrade is generally available to all users. Also covered by: @Claude #2 📝 Simon Willison 1M context is now generally available for Opus 4.6 and Sonnet 4.6 - Anthropic announced 1M token context availability for Opus 4.6 and Sonnet 4.6; standard pricing now applies across the full 1M window with no long-context premium.
“Guillermo Rauch recounts how an AI model (Opus 4.6) hallucinated a fake GitHub repo ID and inadvertently used Vercel’s API to deploy random code, underscoring the need for strict validation of AI-generated requests.”
Opus 4.6 is discussed in the context of an unsafe deployment action caused by hallucination.
“#23 in 🥞 Carl Vellotti calls out Opus 4.6 for needlessly loading eight files to answer a two-sentence question and rarely spawning context-saving agents.”
#23 in 🥞 Carl Vellotti calls out Opus 4.6 for needlessly loading eight files to answer a two-sentence question and rarely spawning context-saving agents. He shares a “Context Management” snippet to drop into your CLAUDE.md to fix it.
“All About AI Uses an autonomous Claude Code agent on a Mac Mini to invoke the OpenCode CLI via OpenRouter on four models (GLM5, Minimax 2.5, Gemini 3 Pro, Opus 4.6) in parallel to generate HTML demos of a retro space game, convert them with Remotion into a grid-style MP4 video, and draft a post on X.”
#2 ▶️ How to Run OpenCode Inside an Autonomous Claude Code AI Agent All About AI Uses an autonomous Claude Code agent on a Mac Mini to invoke the OpenCode CLI via OpenRouter on four models (GLM5, Minimax 2.5, Gemini 3 Pro, Opus 4.6) in parallel to generate HTML demos of a retro space game, convert them with Remotion into a grid-style MP4 video, and draft a post on X. Executed “open code run --model openrouter GLM5 'Should I walk or drive to the car wash? It’s 50 m away'” via Cloud Code CLI, receiving “you should walk to the car wash,” and then ran “open code run --model openrouter Gemini-3-Pro …” obtaining “drive. You can’t wash the car if you leave it behind.” Created a Cloud Code skill file open code test skill.md to launch four OpenRouter models (GLM5, Minimax-2.5, Gemini-3-Pro, Opus-4.6) in parallel on the prompt “create a full screen animated retro arcade space battle scene,” saving outputs as llm-test/game- .html.
“Mike Krieger has been building with Labs’ fast Opus—Claude Opus 4.6 running 2.5× faster—and calls it a “crazy unlock.””
#5 𝕏 Mike Krieger has been building with Labs’ fast Opus—Claude Opus 4.6 running 2.5× faster—and calls it a “crazy unlock.” He’s now excited to roll it out beyond Anthropic. Also covered by: @Guillermo Rauch
“Boris Cherny launched the /fast mode in Opus, using significantly more compute than Opus 4.6 and incurring higher costs for incident response and accelerated work on critical projects, and announced his team built and tested this experimental fast mode for Opus 4.6 with Claude over the past few weeks ( tweet ).”
GenAI PM Daily February 08, 2026 GenAI PM Daily 🎧 Listen to this brief 3 min listen Today's top 20 insights for PM Builders, ranked by relevance from X, Blogs, YouTube, and LinkedIn. Anthropic Launches Fast Mode for Claude Code #4 𝕏 Boris Cherny launched the /fast mode in Opus, using significantly more compute than Opus 4.6 and incurring higher costs for incident response and accelerated work on critical projects, and announced his team built and tested this experimental fast mode for Opus 4.6 with Claude over the past few weeks ( tweet ). #15 ▶️ The Two Models that will Dominate AI Discussions Just Got Released (Claude Opus 4.6 + GPT 5.3 Codex) AI Explained Benchmark comparison shows Claude Opus 4.6 outperforms GPT 5.2 by about 140 ELO points on the GDP val white-collar work benchmark, while GPT 5.3 Codex achieves 77.3% on TerminalBench 2.0 extra-high settings versus 65.4% for Opus 4.6 Max.
“Comparison of Claude Opus 4.6 (Anthropic CLI) and GPT-5.3 Codex (OpenAI Mac desktop app) by building a Poly Market competitor to showcase Opus’s agent teams and Codex’s mid-execution steering.”
#7 ▶️ Claude Opus 4.6 vs GPT-5.3 Codex Greg Isenberg Comparison of Claude Opus 4.6 (Anthropic CLI) and GPT-5.3 Codex (OpenAI Mac desktop app) by building a Poly Market competitor to showcase Opus’s agent teams and Codex’s mid-execution steering. GPT-5.3 Codex built a Poly Market competitor in 3 minutes and 47 seconds, scaffolding a core LMSR market-maker engine, REST API router, responsive front end, and passing 10/10 unit and integration tests.
Related
Anthropic’s coding-focused assistant/tool used for building and automating engineering workflows. The newsletter references it in both security and product-usage contexts.
AI company behind Claude and related developer tools. In this newsletter it is highlighted for internal use of Claude Code and for product expansion into legal workflows.
Anthropic’s assistant/model family, referenced in enterprise deployment, managed agents, and coding workflows. For AI PMs, it is central to agentic product design and enterprise integration.
A creator and commentator who shares practical workflows for Claude Code and personal operating systems for agents. He appears here as a curator of implementation advice for AI builders.
Developer and writer known for his AI tooling commentary and the `llm` project. He is credited here with the 0.32a2 release note.
CEO of Vercel and a prominent builder in the AI developer tooling space. He is mentioned releasing npx deepsec and using a Claude agent team to remediate issues quickly.
A software project/company referenced as the codebase Garry Tan worked in while fixing a Dockerfile PATH issue with AI-generated code.
A developer platform referenced for environment secret handling in preview and production settings. Relevant for AI PMs concerned with secure deployment workflows.
A startup and product operator known for sharing AI-driven business and acquisition ideas. Relevant to PMs for workflow mining and product arbitrage ideas.
A developer or product leader associated with Claude Code. He launched a `/usage` command and changed run limits to help users self-serve token and plan debugging.
A newer OpenAI model release with improved natural dialogue, longer context, and stronger tool use. It is discussed as a model now available in Cursor and chatprd.
OpenAI’s coding-focused model/release highlighted for benchmark performance, steerability, and speed improvements. The newsletter frames it as a strong coding agent option with multiple benchmark scores.
A model-routing platform used to call multiple LLMs through a common interface. Here it is used to run four models in parallel for comparison and generation tasks.
A Claude model version referenced for more intelligent outputs with higher token usage. It is discussed alongside Opus 4.6 and effort settings for economical runs.
An AI agent framework referenced with Claude Code and Codex in a browser automation setup. It is part of the broader tooling stack for agentic development workflows.
A Gemini model variant used in a real workflow library project. The newsletter mentions it as one of the tools used to build the ChatPRD index.
A model used in the clip-creation pipeline to select moments from long-form audio or video. Relevant for PMs exploring automated content repurposing and editorial workflows.
A project context file format referenced as something agents can import to understand a codebase or workspace. It is described as enabling immediate context ingestion without manual setup.
Stay updated on Opus 4.6
Get curated AI PM insights delivered daily — covering this and 1,000+ other sources.
Subscribe Free