person33 mentions· Updated Jul 12, 2026

Santiago

A creator/commentator predicting the future of AI video experiences. The newsletter cites him on interactive livestream-style video and personalized ads.

Key Highlights

Santiago is cited as both a predictor of future AI interfaces and a builder of practical agent infrastructure.
He argues AI PMs should avoid single-model dependency by adopting multi-model gateways and routing layers.
He introduced the "value per token dollar" concept as a simple ROI metric for agent-based products.
His open-source work spans browser automation, RAG assistants, and local-model workflows for private data.
He predicts AI video will shift toward real-time, interactive, personalized livestream-style experiences.

Santiago

Overview

Santiago is a creator, commentator, and builder whose ideas show up repeatedly across AI product, agent infrastructure, and generative media discussions. In the newsletter, he is cited both for forward-looking predictions—especially around AI video becoming real-time, interactive, and personalized—and for practical launches spanning model routing, browser automation, RAG assistants, local model usage, and agent evaluation. That combination makes him notable not just as a commentator, but as an operator testing product patterns in public.

For AI Product Managers, Santiago matters because his posts consistently surface emerging product primitives before they become mainstream: multi-model routing instead of single-vendor dependence, reusable browser skills instead of brittle one-off automations, continuous learning from user corrections, and new unit economics frameworks such as "value per token dollar." He is especially relevant to PMs building agentic products, developer tools, AI infra layers, and next-generation media experiences.

Key Developments

2026-05-13: Warned that depending on a single LLM provider can break an app overnight and launched an API offering access to 400+ models through one key, emphasizing flexibility and supplier diversification.
2026-05-14: Said Cline is among the best agentic coding harnesses, citing benchmarks where it outperformed Claude Code on Opus 4.6 and 4.7.
2026-05-14: Introduced a newly released Computer Use agent that can build and deploy apps inside existing applications like a human operator, positioning it as a more stable and cost-effective approach than more fragile agents.
2026-06-02: Launched The Grid, an OpenAI-compatible API marketplace that routes requests to the cheapest model within a selected quality tier (standard, prime, max) with a one-line code change and supplier quality audits.
2026-06-20: Shared that he had been running gemma-4:26b locally on a Mac Studio since April to process private documents, with local inference already handling roughly 60% of his queries.
2026-06-25: Shared a fully open-sourced RAG assistant for navigating airline policies, including code and a walkthrough, highlighting a concrete vertical use case for retrieval-based assistants.
2026-06-26: Framed agent improvement across three layers—model, harness, and context—but argued that the biggest unlock is continuous learning from every user correction.
2026-06-29: Open-sourced a browser automation system that records user web actions, removes retries and dead ends, and abstracts workflows into reusable skills.
2026-06-30: Proposed the value per token dollar metric—value created divided by token cost—as a practical way to measure agent ROI; below 1 loses money, 1 breaks even, and above 1 is profitable.
2026-07-07: Proposed giving AI agents their own email addresses that users can CC, enabling autonomous task handling without cluttering the primary inbox.
2026-07-12: Predicted AI video will evolve from static clips into real-time, interactive livestream-style experiences, including highly personalized ad formats reminiscent of Minority Report.

Relevance to AI PMs

1. Model strategy and vendor resilience: Santiago repeatedly emphasizes avoiding single-provider lock-in. PMs can apply this by designing model gateways, fallback routing, and quality tiers early, rather than retrofitting them after outages or pricing shifts.

2. Agent product design and evaluation: His work highlights practical building blocks for agents: harnesses, reusable browser skills, continuous correction loops, and clear ROI metrics like value per token dollar. PMs can use these ideas to prioritize instrumentation, memory, learning loops, and measurable success criteria.

3. New interface and experience patterns: From CC-able email agents to interactive AI video, Santiago points toward product formats that feel more ambient, personalized, and action-oriented. PMs can use these signals to prototype beyond chat—into inbox workflows, live media, and embedded computer-use experiences.

The Grid, merge-gateway, internal-ai-model-gateways, 400-models: These connect to Santiago's focus on multi-model routing, cost optimization, and reducing dependence on any single AI provider.
Cline, Claude Code, agentic-coding-toolkit, ai-agents, llm-agents: These relate to his interest in agent harnesses, coding performance, and the infrastructure needed to make agents reliable in production.
browser-automation-system, skill-graph, computer-use-agent, agent-working-protocol, awp-coreawp-skill: These tie into his browser automation and reusable skills approach for turning human workflows into agent capabilities.
RAG, rag-assistant, context, memory-layer, continuous-learning: These reflect his thinking on retrieval, context handling, private knowledge workflows, and learning from user corrections.
gemma-426b, Mac Studio, open-source: These connect to his use of local open models for privacy-sensitive workloads and practical deployment tradeoffs.
PixVerse, pixverse-ai-v6, cinematic-realism-engine, generative-interfaces, UI: These are adjacent to his prediction that AI video will become interactive, real-time, and personalized rather than remaining a static generation medium.
value-per-token-dollar, matrix_build: These link directly to his proposed agent economics metric and its early adoption framing.

Newsletter Mentions (32)

2026-07-12

“Santiago predicts AI video will shift from static clips to real-time, interactive livestream-style experiences (think Minority Report–style personalized ads) and shares a demo link showcasing this early potential.”

#13 𝕏 Santiago predicts AI video will shift from static clips to real-time, interactive livestream-style experiences (think Minority Report–style personalized ads) and shares a demo link showcasing this early potential.

2026-07-07

“Santiago proposes giving AI agents their own email addresses you CC on messages, so they can autonomously handle tasks while keeping your inbox separate.”

GenAI PM Daily July 07, 2026 GenAI PM Daily 🎧 Listen to this brief 3 min listen Today's top 20 insights for PM Builders, ranked by relevance from Blogs, X, YouTube, and LinkedIn. #19 𝕏 Santiago proposes giving AI agents their own email addresses you CC on messages, so they can autonomously handle tasks while keeping your inbox separate.

2026-06-30

“#12 𝕏 Santiago proposes the “value per token dollar” ratio—value created ÷ token cost—to measure agent ROI, where below 1 loses money, at 1 breaks even, and above 1 is profitable.”

#12 𝕏 Santiago proposes the “value per token dollar” ratio—value created ÷ token cost—to measure agent ROI, where below 1 loses money, at 1 breaks even, and above 1 is profitable. He credits @matrix_build as the first team to adopt this metric.

2026-06-29

“#7 𝕏 Santiago open-sourced a browser automation system that records your web actions, prunes retries and dead ends, and abstracts the core logic into reusable skills.”

Two X post summaries attributed to Santiago discuss open-source model strategy and an open-sourced browser automation system.

2026-06-26

“Santiago breaks down agent improvement into three areas—model (for code/math), harness (tools and safety checks), and context (plain-text logs)—but argues that nothing beats continuous learning from every user correction.”

#14 𝕏 Santiago breaks down agent improvement into three areas—model (for code/math), harness (tools and safety checks), and context (plain-text logs)—but argues that nothing beats continuous learning from every user correction.

2026-06-25

“Santiago shared a fully open-sourced RAG assistant for navigating airline policies, complete with code and a @lenadroid walkthrough.”

Santiago is mentioned in two separate items: one about an airline-policy RAG assistant and one about Tripo AI's Project Eden. The newsletter treats him as a source of useful AI demos and releases.

2026-06-20

“Santiago has been running the gemma-4:26b model locally on his Mac Studio since April to process private documents, now handling about 60% of his queries.”

#15 𝕏 Santiago has been running the gemma-4:26b model locally on his Mac Studio since April to process private documents, now handling about 60% of his queries. #16 𝕏 Shreyas Doshi suggests measuring candidates against Claude—ask “are you better than Claude?”—because Claude already outperforms humans at many tasks once considered uniquely human.

2026-06-02

“Santiago launched “The Grid,” an OpenAI-compatible API marketplace that routes requests to the cheapest model in your chosen quality tier (standard, prime, max) via a one-line code change and audits suppliers to ensure quality.”

#10 𝕏 Santiago launched “The Grid,” an OpenAI-compatible API marketplace that routes requests to the cheapest model in your chosen quality tier (standard, prime, max) via a one-line code change and audits suppliers to ensure quality.

2026-05-14

“#17 𝕏 Santiago says Cline is one of the best agentic coding harnesses, with new benchmarks showing it outperforms Claude Code running on Opus 4.6 and 4.7.”

#17 𝕏 Santiago says Cline is one of the best agentic coding harnesses, with new benchmarks showing it outperforms Claude Code running on Opus 4.6 and 4.7. #18 𝕏 Santiago introduces a newly released Computer Use agent that can build and deploy apps within your own applications like a human user, offering a more stable and cost-effective alternative to finicky, expensive agents.

2026-05-13

“#9 𝕏 Santiago warns that relying on a single LLM provider can break your app overnight and launches an API giving you access to 400+ models with one key, so you can stay flexible.”

#9 𝕏 Santiago warns that relying on a single LLM provider can break your app overnight and launches an API giving you access to 400+ models with one key, so you can stay flexible.

Claude Codetool

Anthropic’s coding product/blog referenced in a customer story about Cognition’s use of Claude Fable 5. For AI PMs, it highlights enterprise coding adoption narratives.

Claudetool

Anthropic’s assistant and coding tool, discussed here in both the Reflection dashboard and a physical-AI deployment at UST. The newsletter highlights its usage analytics, workflow suggestions, and enterprise integration.

Cursortool

A code editor and AI agent workspace that introduced Side Chats and cloud agent hooks in this newsletter. For AI PMs, it shows how copilots are evolving into persistent, context-aware agent threads.

Codextool

A ChatGPT-related coding/product mode discussed as a voice-and-tone setting rather than a separate product. For PMs, it highlights how users mentally bucket product experiences.

AI agentsconcept

Systems that use models plus tools, memory, and planning to perform multi-step tasks autonomously or semi-autonomously. The newsletter references both agent architectures and agentic coding/workflows.

vibe-codingconcept

An AI-native development approach where builders use AI tools to rapidly create software. The newsletter treats it as a growth and product-building methodology.

RAGconcept

A pattern for grounding model outputs in retrieved context rather than relying solely on model weights. The newsletter frames it as often outperforming fine-tuning for practical product work.

Claude Fable 5tool

A Claude model used by Cognition for overnight work and production workflows. For AI PMs, it signals trust, reliability, and enterprise readiness for coding tasks.

Xcompany

Social platform referenced as a source of examples, discussion, and scraping/monetization concerns. In this newsletter it is part of the agent workflow stack and content source.

Gemini CLItool

Google’s command-line interface for working with Gemini in developer workflows. It is mentioned as a compatible tool alongside agent skills in antigravity.

Cinematic Realism Enginetool

A headless prompt-to-video engine focused on realism, multi-shot sequencing, and dynamic camera motion. It is framed as the core capability behind PixVerse AI v6's CLI workflow.

Large Memory Modelsconcept

A memory architecture that mimics human memory instead of relying on RAG or vector search. For PMs, it suggests alternative approaches to long-context recall and personalization.

PixVerse AI v6tool

A versioned PixVerse release focused on headless prompt-to-video automation. The newsletter highlights its cinematic realism engine and CLI-based workflow for generating videos programmatically.

PixVersetool

A video creation platform with CLI and API access. The newsletter highlights PixVerse's command-line workflow for generating video from prompts and its newer v6 headless engine.

Stay updated on Santiago

Get curated AI PM insights delivered daily — covering this and 1,000+ other sources.

Subscribe Free

Santiago

Key Highlights

Santiago

Overview

Key Developments

Relevance to AI PMs

Related

Newsletter Mentions (32)

Related

Stay updated on Santiago