Santiago
A named individual cited for commentary on Cline and a Computer Use agent. He is presented as a source of hands-on evaluation of agentic coding tools.
Key Highlights
- Santiago is a practitioner-oriented source on coding agents, model infrastructure, and agent-native product design.
- He is repeatedly cited for hands-on evaluations of Cline, Claude Code workflows, and a new Computer Use agent.
- His commentary emphasizes reducing single-model-provider risk through gateways and broad model access.
- He also contributes product-thinking around local-first memory systems and the split between agent APIs and generative interfaces.
Santiago
Overview
Santiago is a recurring commentator and builder cited across multiple newsletter mentions for hands-on evaluations of agentic coding tools, AI infrastructure, and emerging product patterns. He appears most often as a practical operator sharing benchmark results, workflow tips, open-source launches, and product theses spanning coding agents, model access layers, memory systems, and agent-oriented interfaces.For AI Product Managers, Santiago matters because his commentary is consistently grounded in implementation details rather than abstract hype. His mentions connect day-to-day practitioner workflows—such as improving Claude Code usage, evaluating coding harnesses like Cline, and adding portable memory layers for agents—with broader product strategy questions around model routing, interface design, and the shift toward agent-native software.
Key Developments
- 2026-04-07: Santiago highlighted Linkup’s web index, describing a system that extracts, timestamps, and sources individual facts for semantic retrieval, effectively enabling web-scale RAG for AI agents.
- 2026-04-08: Santiago shared a fully open-source CLI that maps database schemas to context files, enabling teams to create a local AI data analyst workflow without SaaS dependencies or API keys.
- 2026-04-08: He also showcased Modulate AI’s Velma deepfake-detection model, emphasizing high accuracy and much lower cost for real-time voice-call screening.
- 2026-04-10: Santiago pointed to a new Large Memory Models architecture designed to mimic human memory instead of relying on traditional RAG or vector search approaches.
- 2026-04-21: He unveiled Kimi K2.6, an open-source coding model positioned as state-of-the-art on several coding benchmarks, with strengths in long-horizon and cross-language development tasks.
- 2026-04-23: Santiago released an open-source, local-first memory layer for Mac-based LLM agents that records projects, prompts, tools, and device actions as Markdown, making persistent memory shareable across agents.
- 2026-04-26: He shared a practical Claude Code tip: using `Ctrl+R` to search prompt history instantly instead of cycling through prior prompts manually.
- 2026-05-01: Santiago argued that UI is splitting into two layers: headless APIs for AI agents and generative, runtime-composed interfaces for humans.
- 2026-05-04: He described a self-coaching workflow using Claude Code’s `/insights` report in a fresh Claude session to identify bad habits and generate specific improvements.
- 2026-05-13: Santiago warned that depending on a single LLM provider can break products overnight and launched an API layer providing access to 400+ models through one key to improve resilience and flexibility.
- 2026-05-14: He said Cline is among the best agentic coding harnesses and cited new benchmarks showing it outperforming Claude Code running on Opus 4.6 and 4.7.
- 2026-05-14: Santiago also introduced a newly released Computer Use agent that can build and deploy apps inside existing applications like a human user, positioning it as a more stable and cost-effective alternative to more brittle computer-use agents.
Relevance to AI PMs
1. He offers practical evaluation signals for agentic coding products. Santiago’s comparisons across Cline, Claude Code, coding models, and computer-use agents give AI PMs concrete inputs for tool selection, internal dogfooding, and benchmark design.2. He surfaces infrastructure patterns that reduce platform risk. His emphasis on multi-model access, local-first memory, and alternatives to single-provider architectures is directly useful for PMs designing resilient AI products and roadmap priorities.
3. He points toward emerging agent-native UX patterns. Santiago’s view that software is dividing into headless APIs for agents and generative interfaces for humans helps PMs think tactically about where to invest in orchestration, composability, and interface generation.
Related
- Cline / Cline Kanban: Santiago explicitly praised Cline as a top agentic coding harness and referenced benchmark-based performance comparisons.
- Claude Code / Claude / `/insights`: He frequently shares tactical Claude Code workflows, including prompt-history search and retrospection using usage reports.
- Computer Use agent: A major recent mention tied Santiago to a new agent capable of operating software like a human user inside applications.
- internal-ai-model-gateways / merge-gateway / 400-models: His warning about single-provider dependence connects to model routing and gateway strategies for product resilience.
- memory-layer / large-memory-models / RAG: Santiago is associated with both local-first memory tooling and commentary on alternatives to conventional retrieval-based memory systems.
- Linkup: He highlighted Linkup’s fact-level indexed retrieval approach as infrastructure for agent-ready search and web-scale RAG.
- Kimi K2.6, Codex, Cursor, Gemini CLI, agentic-coding-toolkit: These adjacent coding-agent and coding-model entities frame the broader competitive landscape in which Santiago’s evaluations are relevant.
- UI / headless-apis / generative-interfaces: His product thesis directly connects to how AI PMs may redesign software around agents and dynamically generated human interfaces.
Newsletter Mentions (24)
“#17 𝕏 Santiago says Cline is one of the best agentic coding harnesses, with new benchmarks showing it outperforms Claude Code running on Opus 4.6 and 4.7.”
#17 𝕏 Santiago says Cline is one of the best agentic coding harnesses, with new benchmarks showing it outperforms Claude Code running on Opus 4.6 and 4.7. #18 𝕏 Santiago introduces a newly released Computer Use agent that can build and deploy apps within your own applications like a human user, offering a more stable and cost-effective alternative to finicky, expensive agents.
“#9 𝕏 Santiago warns that relying on a single LLM provider can break your app overnight and launches an API giving you access to 400+ models with one key, so you can stay flexible.”
#9 𝕏 Santiago warns that relying on a single LLM provider can break your app overnight and launches an API giving you access to 400+ models with one key, so you can stay flexible.
“Santiago runs Claude Code’s `/insights` every three weeks to generate a usage report, then uploads it in a fresh Claude session asking for his top five bad habits and concrete fixes.”
#5 𝕏 Santiago runs Claude Code’s `/insights` every three weeks to generate a usage report, then uploads it in a fresh Claude session asking for his top five bad habits and concrete fixes. This self-coaching loop reveals new optimizations each time. #6 in Peter Yang shares how to keep AI agents running with your MacBook lid closed by installing the Amphetamine app and unchecking “Allow display sleep” and “Allow system sleep when display is closed” in Session Defaults.
“Santiago argues UI is splitting into headless APIs for AI agents and on-demand generative interfaces for humans, composed at runtime by agents rather than baked into apps.”
#10 𝕏 Santiago argues UI is splitting into headless APIs for AI agents and on-demand generative interfaces for humans, composed at runtime by agents rather than baked into apps.
“Santiago points out that in Claude Code you can press Ctrl+R to instantly search your prompt history instead of toggling through prompts with the arrow keys, speeding up prompt retrieval.”
#7 𝕏 Santiago points out that in Claude Code you can press Ctrl+R to instantly search your prompt history instead of toggling through prompts with the arrow keys, speeding up prompt retrieval. #8 𝕏 Cognition sits down with Grant Sanderson (@3blue1brown) to unpack what it’s really like to build frontier AI—covering daily workflows, collaboration patterns, and a team culture likened to Olympic athletes training together.
“#14 𝕏 Santiago released an open-source, local-first memory layer for Mac-based LLM agents that logs your projects, prompts, tools and on-device actions as Markdown files.”
#14 𝕏 Santiago released an open-source, local-first memory layer for Mac-based LLM agents that logs your projects, prompts, tools and on-device actions as Markdown files. You can then share this persistent memory layer with any agent.
“Santiago unveiled Kimi K2.6, an open-source coding model hitting SOTA on HLE with tools (54.0), SWE-Bench Multilingual (76.7), and Pro (58.6).”
#8 𝕏 Santiago unveiled Kimi K2.6, an open-source coding model hitting SOTA on HLE with tools (54.0), SWE-Bench Multilingual (76.7), and Pro (58.6). It also boosts front-end development, DevOps, performance optimizations, long-horizon coding, and cross-language generalization.
“They’ve built a completely new Large Memory Models architecture that mimics human memory instead of using RAG or vector search. The founders—authors of 160+ Nature and ICLR papers—even closed their Harvard lab to focus on it.”
#16 𝕏 Santiago : They’ve built a completely new Large Memory Models architecture that mimics human memory instead of using RAG or vector search. The founders—authors of 160+ Nature and ICLR papers—even closed their Harvard lab to focus on it.
“Santiago built a 100% open-source CLI that maps your database schema to context files, letting you spin up a full AI data analyst in under two hours—entirely local, no SaaS or API keys required.”
#10 𝕏 Santiago built a 100% open-source CLI that maps your database schema to context files, letting you spin up a full AI data analyst in under two hours—entirely local, no SaaS or API keys required. #15 𝕏 Santiago showcases @modulate_ai’s Velma deepfake detection model hitting 98.9% accuracy on Hugging Face’s Arena at 120× lower cost, making continuous, real-time voice-call screening possible to combat deepfakes.
“#23 𝕏 Santiago : Linkup built a web index from scratch that extracts, timestamps, and sources individual facts for semantic retrieval.”
#23 𝕏 Santiago : Linkup built a web index from scratch that extracts, timestamps, and sources individual facts for semantic retrieval. Instead of returning full pages, it delivers the exact information AI agents need, essentially enabling RAG across the entire web.
Related
Anthropic's coding assistant used for programming and automation tasks. The newsletter references it for building a custom approval device and for writing and research workflows inside AI agents.
Anthropic's model family used for agent orchestration and developer workflows. In this newsletter it is highlighted as powering CodeRabbit's agent orchestration system.
An AI coding editor and automation platform. The newsletter highlights multi-repository support for automations across codebases.
OpenAI's coding agent/tool used here for self-improving tax workflows and long-running autonomous loops. It is presented as capable of iterative task execution with plugins and goal-based runs.
Autonomous or semi-autonomous software systems that can take actions, manage workflows, and assist with operational work. The newsletter references them in multiple founder and startup productivity contexts.
A rapid, intuition-driven way of building software with AI assistance. For PMs, it represents low-friction prototyping and UI iteration.
A pattern for answering questions by retrieving relevant context and generating responses from it. The newsletter highlights multimodal RAG for searching across audio, image, and video data.
Social platform referenced as a source of examples, discussion, and scraping/monetization concerns. In this newsletter it is part of the agent workflow stack and content source.
Google’s command-line interface for working with Gemini in developer workflows. It is mentioned as a compatible tool alongside agent skills in antigravity.
A video creation platform with CLI and API access. The newsletter highlights PixVerse's command-line workflow for generating video from prompts and its newer v6 headless engine.
A memory architecture that mimics human memory instead of relying on RAG or vector search. For PMs, it suggests alternative approaches to long-context recall and personalization.
A headless prompt-to-video engine focused on realism, multi-shot sequencing, and dynamic camera motion. It is framed as the core capability behind PixVerse AI v6's CLI workflow.
A versioned PixVerse release focused on headless prompt-to-video automation. The newsletter highlights its cinematic realism engine and CLI-based workflow for generating videos programmatically.
Stay updated on Santiago
Get curated AI PM insights delivered daily — covering this and 1,000+ other sources.
Subscribe Free