GenAI PM
person15 mentions· Updated Feb 14, 2026

Boris Cherny

A commentator associated here with Spotify’s use of Claude Code. Relevant to PMs for illustrating AI-driven software delivery narratives.

Key Highlights

  • Boris Cherny is repeatedly cited as a key public voice on Claude Code, AI code review, and developer-agent product design.
  • His commentary offers practical PM lessons on pricing, effort controls, capacity management, and workflow-centric adoption.
  • He argued for agentic search over RAG in cases where privacy, reliability, and stale indexes create operational drag.
  • He framed model effort as a budget-setting mechanism, with Medium positioned as the best default trade-off for many users.
  • His remarks on subscriptions and third-party tool usage highlight how infrastructure constraints directly shape AI product packaging.

Boris Cherny

Overview

Boris Cherny appears in these newsletters as a prominent public commentator on Anthropic’s developer products, especially Claude Code, Claude Code Review, and the operational realities of deploying frontier coding agents at scale. Across multiple mentions, he is associated with practical guidance on model effort settings, code review automation, agentic search, token management, OSS infrastructure, and subscription-capacity trade-offs.

For AI Product Managers, Boris Cherny matters because his comments consistently translate advanced AI tooling into concrete product and engineering operating lessons. His mentions surface recurring PM themes: how to package usage, how to communicate model-performance trade-offs, when to replace RAG with agentic retrieval, how to design AI-assisted developer workflows, and how infrastructure constraints shape customer experience.

Key Developments

  • 2026-02-22: Boris Cherny described using subagents with Git worktrees to parallelize large codebase migrations, assigning each agent a subset of folders while a main agent handled merge conflicts. He also noted that Opus 4.6 and Sonnet 4.6 can produce more intelligent outputs at higher token cost, with lighter runs achievable via lower effort settings.
  • 2026-02-27: He highlighted that Claude Code’s OSS infrastructure had been under development for some time, with pieces such as the Agent sandbox already open-source, and discussed trade-offs around openness and standards.
  • 2026-03-03: He shared a tactical usability tip for Claude Code: running `/setup-terminal` in Apple Terminal enables native paste support.
  • 2026-03-08: He reframed model “effort” as a maximum budget, recommending Medium as the default balance between speed, cost, and capability, versus Low and High.
  • 2026-03-10: He launched Claude Code’s Code Review feature, describing AI agents that deeply review every pull request and claiming major productivity gains for Anthropic engineers, including catching bugs that would otherwise be missed.
  • 2026-03-15: He clarified the distinction between “Ultrathink” and `/effort max`: Ultrathink applies maximum effort to a single turn, while `/effort max` sustains peak effort across the full conversation.
  • 2026-03-22: He said Anthropic had unshipped its RAG setup because of privacy, security, reliability, and stale-index issues, concluding that agentic search produced better outcomes with fewer trade-offs.
  • 2026-03-26: He stated that Claude Code Review automatically catches over 99% of bugs, reducing human review to a fast sanity check in many cases.
  • 2026-04-04: He explained that rising Claude usage from third-party tools had exceeded the assumptions behind existing subscriptions, leading Anthropic to carefully manage capacity and prioritize product and API customers. He also argued that subscriptions are optimized for intended usage patterns, not raw token volume alone.
  • 2026-04-06: He described token-counting implementation differences across platforms: Anthropic offers a native token-counting endpoint, while Bedrock, Vertex, and Azure required custom token-counting workarounds.

Relevance to AI PMs

1. Pricing and packaging strategy: Boris Cherny’s comments on subscriptions, token budgets, and effort settings illustrate how PMs should design plans around usage patterns, latency, and capacity constraints rather than simplistic per-token assumptions. 2. Developer workflow product design: His examples around Claude Code Review, subagents, worktrees, and terminal setup show how AI products win through workflow integration, not just model quality. PMs can use these patterns to prioritize UX improvements that compound adoption. 3. Retrieval and architecture decisions: His move away from RAG toward agentic search is a useful signal for PMs evaluating retrieval architectures. It highlights when operational burdens like privacy, stale indexes, and reliability can outweigh the theoretical benefits of classic RAG systems.

Related

  • Anthropic: The company most directly associated with Boris Cherny in these mentions, especially through Claude and Claude Code.
  • Claude / Claude Code: Core products he is most often connected to, including effort controls, coding workflows, and code review automation.
  • Claude Code Review: A key launch tied to his commentary on automated PR review and engineering productivity.
  • Bedrock, Vertex, Azure: Mentioned in connection with token counting, where these platforms lacked native support and required custom implementation.
  • RAG: An approach he reportedly moved away from due to privacy, security, reliability, and staleness issues.
  • Agentic search: Presented as a better alternative to RAG in some contexts, with fewer operational trade-offs.
  • Opus 4.6 / Sonnet 4.6 / Opus / GPT-4 Turbo: Relevant model comparisons in discussions of intelligence, effort, and token cost.
  • Spotify: Included in the entity graph as part of the broader narrative around enterprise AI-driven software delivery, though the newsletter mentions here focus more directly on Anthropic and Claude Code.

Newsletter Mentions (15)

2026-04-06
Boris Cherny built custom token-counting for Bedrock, Vertex, and Azure (which lack a native API) while using Anthropic’s built-in token-counting endpoint directly.

#4 𝕏 Boris Cherny built custom token-counting for Bedrock, Vertex, and Azure (which lack a native API) while using Anthropic’s built-in token-counting endpoint directly.

2026-04-04
#4 𝕏 Boris Cherny explains that a surge in Claude usage from third-party tools has outstripped the design of existing subscriptions, so capacity is being managed carefully and prioritized for product and API customers.

GenAI PM Daily April 04, 2026 GenAI PM Daily 🎧 Listen to this brief 3 min listen Today's top 17 insights for PM Builders, ranked by relevance from X, Blogs, and LinkedIn. Claude subscriptions will no longer cover usage on third-party tools like OpenClaw. #4 𝕏 Boris Cherny explains that a surge in Claude usage from third-party tools has outstripped the design of existing subscriptions, so capacity is being managed carefully and prioritized for product and API customers. #6 𝕏 Boris Cherny argues that subscription plans are optimized for specific usage patterns rather than raw token counts. He notes that building at scale involves trade-offs that prioritize certain use cases over others.

2026-03-26
#23 𝕏 Boris Cherny says Claude Code Review automatically catches over 99% of bugs, leaving engineers to only perform a quick sanity check.

#23 𝕏 Boris Cherny says Claude Code Review automatically catches over 99% of bugs, leaving engineers to only perform a quick sanity check. #24 𝕏 Anthropic introduced Claude Code auto mode, a safer middle ground that uses trained classifiers to automatically approve or reject code-generation requests instead of relying on manual permission prompts.

2026-03-22
#3 𝕏 Boris Cherny unshipped their RAG setup due to privacy, security, reliability, and index-staleness issues, finding agentic search delivered better results with fewer trade-offs.

Several operational and product insights discuss search, agent skills, and agentic workflows. #3 𝕏 Boris Cherny unshipped their RAG setup due to privacy, security, reliability, and index-staleness issues, finding agentic search delivered better results with fewer trade-offs.

2026-03-15
#10 𝕏 Boris Cherny explains that “Ultrathink” in Claude delivers maximum effort for a single turn, while “/effort max” (or `claude --effort=max`) sustains peak effort for the entire conversation.

Today's top 12 insights for PM Builders, ranked by relevance from X, LinkedIn, and Blogs. Ramp Ships 500+ Features Using Claude Code #10 𝕏 Boris Cherny explains that “Ultrathink” in Claude delivers maximum effort for a single turn, while “/effort max” (or `claude --effort=max`) sustains peak effort for the entire conversation.

2026-03-10
#7 𝕏 Boris Cherny launched Claude Code’s new Code Review feature, where AI agents deep-review every PR, boosting Anthropic engineers’ code output by 200% this year and catching bugs he’d otherwise miss.

Boris Cherny appears in a newsletter item about PR review automation and engineering productivity. He is positioned as a voice explaining the benefits of AI agents in code review.

2026-03-08
𝕏 Boris Cherny reframes “effort” as a max budget for AI models and recommends Medium as the default—offering the best balance between Low (fast but less capable) and High (very smart but slow and token-heavy).

𝕏 Boris Cherny reframes “effort” as a max budget for AI models and recommends Medium as the default—offering the best balance between Low (fast but less capable) and High (very smart but slow and token-heavy).

2026-03-03
#16 𝕏 Boris Cherny shares that running `/setup-terminal` in Apple Terminal enables native paste support—see code.claude.com/docs/en/terminal-config for setup details.

#16 𝕏 Boris Cherny shares that running `/setup-terminal` in Apple Terminal enables native paste support—see code.claude.com/docs/en/terminal-config for setup details. #17 𝕏 Santiago warns that Cowork’s multi-step file workflow—granting folder access, copying files, then generating a plan—slows you down when dealing with many or changing files, and suggests native chat-based file support for a smoother experience.

2026-02-27
Boris Cherny highlights that they’ve been building Claude Code’s OSS infrastructure for a while, with core components like Bun and the Agent sandbox already open-source, and discusses the trade-offs around standards and openness.

#15 𝕏 Boris Cherny highlights that they’ve been building Claude Code’s OSS infrastructure for a while, with core components like Bun and the Agent sandbox already open-source, and discusses the trade-offs around standards and openness.

2026-02-22
Boris Cherny uses subagents with Git worktrees to parallelize large codebase migrations by assigning each agent a few folders, greatly speeding up the process while a main agent resolves any merge conflicts.

#2 𝕏 Boris Cherny uses subagents with Git worktrees to parallelize large codebase migrations by assigning each agent a few folders, greatly speeding up the process while a main agent resolves any merge conflicts. #9 𝕏 Boris Cherny says Opus 4.6 and Sonnet 4.6 deliver more intelligent outputs at the cost of higher token usage, and you can use `/model` to set effort to low or medium for lighter, more economical runs.

Related

Claude Codetool

Anthropic's coding-focused agentic tool for building and automating software workflows. In this newsletter it is discussed as being integrated with Vercel AI Gateway and as a Chrome extension for browser automation.

Anthropiccompany

Anthropic is mentioned as a comparison point in the AI chess game and as the focus of a successful enterprise coding strategy. For PMs, it is framed as a company benefiting from sharp product focus.

Claudetool

Anthropic's general-purpose AI assistant and model family. It appears here as a comparison point for strategy work and in discussions around browser automation and coding.

Opus 4.6tool

Anthropic’s latest Opus-class model release with a 1 million-token context window. It is positioned for long-context planning, coding, and agentic task execution.

RAGconcept

A common pattern for grounding model responses in retrieved documents. The newsletter contrasts LlamaIndex's newer agentic document processing approach against RAG.

Opustool

An Anthropic model family referenced in a comparison against Sonnet. The newsletter frames the trade-off as task- and workflow-dependent rather than absolute.

Sonnet-4.6tool

A Claude model version referenced for more intelligent outputs with higher token usage. It is discussed alongside Opus 4.6 and effort settings for economical runs.

Claude Code Reviewtool

An AI-powered code review feature from Claude Code designed to provide deep PR feedback, catch bugs, and improve development workflows. It is presented as a research-preview beta for Team and Enterprise.

Stay updated on Boris Cherny

Get curated AI PM insights delivered daily — covering this and 1,000+ other sources.

Subscribe Free