GenAI PM
person50 mentions· Updated May 23, 2026

Philipp Schmid

A Google AI/Developer Relations figure mentioned for demonstrating Gemini Managed Agents and the Interactions API. He appears here as a presenter explaining hosted sandboxed agent execution.

Key Highlights

  • Philipp Schmid is a prominent Google AI developer-facing voice associated with practical demos and guides for Gemini agents.
  • He is especially relevant for explaining Gemini Managed Agents and the Interactions API, including hosted sandboxed execution.
  • His updates consistently focus on lowering developer friction through quickstarts, better errors, streaming support, and simple agent setup.
  • For AI PMs, his work is a strong signal of Google’s near-term product direction around agent orchestration, tool use, memory, and multimodal workflows.

Philipp Schmid

Overview

Philipp Schmid is a Google AI and developer-relations figure who appears repeatedly in AI product discourse as a hands-on presenter, explainer, and launcher of practical Gemini platform capabilities. In recent coverage, he is most closely associated with demonstrating Gemini Managed Agents and the Gemini Interactions API, especially the idea that developers can spin up agents with secure, hosted Linux sandboxes, memory, tool execution, and streaming behavior with minimal setup.

For AI Product Managers, Schmid matters because his work sits at the translation layer between cutting-edge model/platform capabilities and real developer adoption. His demos, guides, and launch explanations surface how Google wants teams to build agentic products: stateful interactions, structured execution steps, built-in tool orchestration, hosted sandboxes, multimodal retrieval, and on-device app integration. In practice, his updates provide a useful signal for where the Gemini ecosystem is becoming easier to productize.

Key Developments

  • 2026-05-05 — Highlighted quality-of-life improvements for the Gemini Interactions API, including clearer error messages with exact field names, invalid values, expected formats, supported enums, and precise field paths such as `input[0].name`.
  • 2026-05-06 — Announced Multi-Token Prediction for Gemma 4, describing a roughly 3x inference speed improvement with no quality loss for supported variants.
  • 2026-05-06 — Shared four subagent coordination patterns—tool calls, spawns, pools, and teams—providing a practical framework for multi-agent workflow design.
  • 2026-05-07 — Showcased the Gemini API File Search tool’s multimodal PDF and image retrieval workflow using `gemini-embedding-2`, with chunking, embedding, indexing, and grounding handled in one flow.
  • 2026-05-07 — Highlighted Gemma 4 performance on Code Arena, positioning open models as increasingly practical for local and cost-sensitive deployments.
  • 2026-05-08 — Shared updates to the Gemini Interactions API that replaced rigid `user`/`model` roles with structured interaction steps such as `user_input`, `thought`, `function_call`, `tool_call`, and `model_output`.
  • 2026-05-12 — Promoted the Gemini API interactions quickstart guide, helping developers get started quickly with the newer interaction model.
  • 2026-05-13 — Published a guide covering Interactions API `thought` steps and encrypted signatures, clarifying stateful vs. stateless operation, model switching, and context management.
  • 2026-05-14 — Announced that Android 16+ includes built-in on-device MCP support via the `@AppFunction` annotation, enabling agents such as Gemini to call app functions locally across apps without server/network hops.
  • 2026-05-19 — Published a guide for streaming in the Gemini Interactions API, aimed at making streaming application development easier.
  • 2026-05-22 — Built a GitHub Issue Triage Agent using a single `curl` call to the Gemini API, emphasizing low-friction agent construction.
  • 2026-05-23 — At Google I/O, demoed how to build an AI agent with its own secure, hosted Linux sandbox in a single API call using Gemini Managed Agents and the Interactions API.
  • 2026-05-23 — Clarified implementation details for managed sandboxes, including persistent container IDs per sandbox session, shareability across agents, and the tradeoff of persistence versus slower cold starts.

Relevance to AI PMs

1. A roadmap for production-grade agent UX Schmid’s demos make concrete what modern agent products increasingly require: structured steps, memory, tool invocation, streaming, and secure execution environments. AI PMs can use these patterns to shape agent product requirements beyond simple chat.

2. A practical guide to reducing developer friction
Many of his updates focus on making complex capabilities easier to adopt—single-call setup, better errors, quickstarts, and built-in retrieval/execution. For PMs, this is a reminder that platform adoption often hinges on onboarding, debuggability, and defaults as much as raw model quality.

3. Signal on Google’s agent platform direction
His posts consistently preview where the Gemini stack is maturing: managed agents, Interactions API abstractions, multimodal search, Android app-function interoperability, and open-model efficiency improvements. PMs evaluating vendor strategy can treat these as useful indicators of near-term platform capabilities.

Related

  • Google — Schmid is presented in the context of Google’s AI and developer ecosystem, especially launches tied to product adoption.
  • Google DeepMind — Relevant as the broader research and model organization behind many Gemini and Gemma capabilities discussed in adjacent coverage.
  • Gemini — The central model family connected to Schmid’s demos, guides, and platform updates.
  • Gemini API — Core API surface for many of the examples he shares, including quickstarts and single-call agent setups.
  • Gemini Interactions API / Interactions API — One of the strongest recurring associations; Schmid frequently explains its step-based interaction model, streaming, thought steps, and context handling.
  • Gemini Managed Agents — Closely tied to his Google I/O demo around hosted, sandboxed agent execution.
  • Managed Agents Quickstart — Connects to his efforts to lower setup friction for developers building hosted agents.
  • GitHub Issue Triage Agent — A concrete example he built to demonstrate lightweight agent construction on Gemini.
  • Gemma / Gemma 4 / FunctionGemma — Related through his posts on open-model performance, inference speedups, and practical deployment patterns.
  • gemini-embedding-2 — Connected via his promotion of multimodal File Search and retrieval workflows.
  • Android 16 / AppFunction / Android AI App Functions — Important adjacent area where Schmid highlighted on-device agent-to-app interoperability.
  • Google I/O — The event context for his high-visibility managed-agents sandbox demo.
  • Sundar Pichai — Mentioned as also covering related Interactions API developments, indicating executive-level amplification of the same direction.
  • Jeff Dean, Demis Hassabis, Sebastian Raschka, Simon Willison, Addy Osmani — Related ecosystem figures often adjacent in AI product and developer conversations, though Schmid’s role here is notably more implementation- and developer-guide-oriented.

Newsletter Mentions (50)

2026-05-23
Philipp Schmid at Google I/O demoed how to build an AI agent with its own secure, hosted Linux sandbox in a single API call using Gemini Managed Agents and the new Interactions API to execute code and manage its memory.

#8 𝕏 Philipp Schmid at Google I/O demoed how to build an AI agent with its own secure, hosted Linux sandbox in a single API call using Gemini Managed Agents and the new Interactions API to execute code and manage its memory. #9 𝕏 Philipp Schmid clarifies that each sandbox session gets its own persistent container ID—shareable across agents—and that they’re currently trading persistence for slower cold starts.

2026-05-22
Philipp Schmid built a GitHub Issue Triage Agent using a single curl to the Gemini API.

#9 𝕏 Philipp Schmid built a GitHub Issue Triage Agent using a single curl to the Gemini API.

2026-05-19
Philipp Schmid published a new guide for streaming in the Gemini Interactions API to make building streaming applications super easy.

#12 𝕏 Philipp Schmid published a new guide for streaming in the Gemini Interactions API to make building streaming applications super easy. Just point your agent to it and let it handle the rest.

2026-05-14
#2 𝕏 Philipp Schmid announced that Android 16+ now includes a built-in on-device MCP, letting developers tag app functions with a new `@AppFunction` annotation so agents like Gemini can call and chain them across apps without any server or network calls.

#2 𝕏 Philipp Schmid announced that Android 16+ now includes a built-in on-device MCP, letting developers tag app functions with a new `@AppFunction` annotation so agents like Gemini can call and chain them across apps without any server or network calls. #3 📝 OpenAI News Building a safe, effective sandbox to enable Codex on Windows - OpenAI built a custom, unelevated sandbox for Codex on Windows after finding AppContainer too narrowly scoped, Windows Sandbox incompatible with acting on the user's real checkout (and unavailable on Home), and MIC unsafe because relabeling workspaces lowers their integrity; the prototype gives the sandbox a distinct identity via synthetic SIDs and uses write-restricted tokens to limit where Codex can modify files. The default Codex sandbox runs commands with reduced permissions—allowing reads broadly, writes only inside the workspace, and no internet access unless explicitly enabled.

2026-05-13
#7 𝕏 Philipp Schmid published a guide for Gemini Interactions API’s new `thought` steps and encrypted signatures, detailing stateful vs. stateless modes, seamless model switching, and effortless context management to supercharge agent development.

#7 𝕏 Philipp Schmid published a guide for Gemini Interactions API’s new `thought` steps and encrypted signatures, detailing stateful vs. stateless modes, seamless model switching, and effortless context management to supercharge agent development. Also covered by: @Sundar Pichai

2026-05-12
Philipp Schmid shares Google’s Gemini API interactions quickstart guide, helping PM builders quickly set up and test the new Gemini AI model.

#20 𝕏 Philipp Schmid shares Google’s Gemini API interactions quickstart guide, helping PM builders quickly set up and test the new Gemini AI model. #21 𝕏 Lenny Rachitsky shares eight actionable insights from Eric Ries—spanning financial gravity, CEO retention post-IPO, public-benefit corp structures like AnthropicAI, mission protection, and principled decision-making exemplified by Cloudflare.

2026-05-08
#4 𝕏 Philipp Schmid updated the Gemini Interactions API to replace rigid `user`/`model` roles with discrete “steps” (user_input, thought, function_call, tool_call, model_output, etc.), consolidated response_format controls, and added a toggle in the docs.

The item describes API changes and new interaction structure for Gemini.

2026-05-07
Philipp Schmid : The Gemini API File Search tool now offers true multimodal PDF and image retrieval using `gemini-embedding-2`, handling chunking, embedding, indexing and grounding in one call.

#4 𝕏 Philipp Schmid : The Gemini API File Search tool now offers true multimodal PDF and image retrieval using `gemini-embedding-2`, handling chunking, embedding, indexing and grounding in one call. #15 𝕏 Philipp Schmid shows Gemma 4 pushing the Pareto frontier on Code Arena, with Gemma-4-31b at #13 and Gemma-4-26b-a4b at #17 among open models you can run on a MacBook Pro.

2026-05-06
Philipp Schmid launched Multi-Token Prediction for Gemma 4, tripling inference speed with zero quality loss—now available E2B/E4B under Apache 2.0.

#13 𝕏 Philipp Schmid launched Multi-Token Prediction for Gemma 4, tripling inference speed with zero quality loss—now available E2B/E4B under Apache 2.0. #14 𝕏 Philipp Schmid outlines four subagent coordination patterns—tool calls, spawns, pools, and teams—to structure multi-agent workflows.

2026-05-05
#9 𝕏 Philipp Schmid launched a QoL upgrade for the Interactions API: errors now name the exact field and bad value, list supported enum options, show expected vs. actual formats, and pinpoint field paths like `input[0].name`.

#8 𝕏 Logan Kilpatrick rolled out extensive error message improvements for the Interactions API, making its feedback far more human- and agent-readable. #9 𝕏 Philipp Schmid launched a QoL upgrade for the Interactions API: errors now name the exact field and bad value, list supported enum options, show expected vs. actual formats, and pinpoint field paths like `input[0].name`. #10 𝕏 Guillermo Rauch launched npx deepspec, an open-source agent orchestrator that leverages thousands of parallel coding agents in Vercel Sandbox to uncover critical security vulnerabilities in minutes. #11 ▶️ AI Agents run my business and life Greg Isenberg Andrew Wilkinson demonstrates running an autonomous SaaS business using Claude-based OpenClaw agents orchestrated in Harbor to handle support ticket triage (including auto-fixing P0 issues and merging PRs) and marketing campaigns via Post Hog and Meta/Reddit ads.

Related

Cursortool

An AI coding editor and automation platform. The newsletter highlights multi-repository support for automations across codebases.

Simon Willisonperson

Independent AI commentator and developer known for practical analysis of LLM products. Here he argues Anthropic and OpenAI have found product-market fit.

Google DeepMindcompany

Google's frontier AI lab. The newsletter references a Google Research privacy approach and Google I/O 2026 announcements, which are adjacent to DeepMind's broader ecosystem.

Vercelcompany

Vercel is the hosting platform used for the rapid prototype demo. It remains a common deployment choice for AI-built web apps and landing pages.

Geminitool

Google's AI assistant/model family mentioned as one of the systems that can answer category-level brand questions. It is presented alongside ChatGPT and Perplexity in the context of AI-driven visibility.

Googlecompany

A major AI platform and product company shipping Gemini models, Search AI features, and developer tools. Important for AI PMs because many of the newsletter’s launches reflect Google’s evolving AI ecosystem.

Sebastian Raschkaperson

An ML researcher and writer mentioned for highlighting Gated DeltaNet-2 and sharing a primer on Gated DeltaNet. Relevant for technical AI architecture discussion.

MCPconcept

A protocol used to connect AI agents to tools and data sources. The newsletter contrasts MCP with APIs as foundational plumbing for agent actions and prompt-evaluation workflows.

Google AI Studiotool

Google’s app-building and experimentation environment for Gemini. For AI PMs, it is a product surface for rapid prototyping, app creation, and workspace-integrated AI experiences.

Google Researchcompany

Google's research organization working on privacy-preserving analytics and other AI systems. The newsletter mentions a private analytics approach and NotebookLM features.

Demis Hassabisperson

Co-founder and CEO of Google DeepMind. He is mentioned in connection with Gemini 3.5 Flash and Google’s model launch.

Jeff Deanperson

Google AI leader and notable voice in model launches and research updates. Mentioned here in connection with Gemini 3.5 Flash and Google’s AI releases.

Sundar Pichaiperson

CEO of Google and Alphabet mentioned in the context of Google I/O and Gemini strategy. The newsletter cites him in a discussion about AI roadmap and product direction.

Gemini APItool

Google's API for building on Gemini models. Here it is used to power a GitHub issue triage agent and custom managed agents.

LangChaincompany

An AI application framework for building agents and chains. The newsletter highlights its Managed Deep Agents private preview for long-horizon agents.

AI agentsconcept

Autonomous or semi-autonomous software systems that can take actions, manage workflows, and assist with operational work. The newsletter references them in multiple founder and startup productivity contexts.

Gemma 4tool

A model name referenced as part of a survey of recent LLM architectures. It is notable here as an example of the current pace of model iteration and architecture experimentation.

Gemini Interactions APItool

The Gemini Interactions API is a Google Gemini interface for building streaming applications. The newsletter highlights a guide focused on making streaming easier for agents and developers.

AI Studiotool

Google’s developer-focused AI product, positioned for higher-level thinking and developer workflows. It is contrasted with the Gemini app’s consumer UX constraints.

Gemini 3tool

A Gemini model variant used here to power agentic workflow examples and multi-agent systems. It is relevant to AI PMs as an example of frontier model capability enabling more complex automated workflows.

Google Searchtool

Google’s search product, mentioned as another interface for detecting SynthID watermarks. It illustrates how AI safety features can be embedded into mainstream consumer search.

Gemini 3.1 Flash-Litetool

A Gemini model variant that was noted as moving out of preview status.

Deep Researchconcept

A workflow/mode for using AI systems to search the web, synthesize information, and produce detailed reports. The newsletter frames it as a practical capability for research-heavy PM work.

WebMCPtool

A W3C-backed browser extension that exposes website functionality to MCP-capable agents. It lets developers register site functions as structured tools in the browser.

Antigravitytool

A Google AI product or feature mentioned as part of the Google AI Pro bundle. The newsletter gives no deeper detail, but it is notable as a bundled AI offering.

Gemini Embedding 2tool

An embedding model powering multimodal file search in the Gemini API. Relevant for PMs designing retrieval, citation, and metadata-aware workflows.

Agent Skillsconcept

A framework for defining, managing, and retiring capabilities that AI agents can use. The newsletter frames it as an operational way to keep agent behavior current and useful.

Veo 3.1tool

Google’s video generation model with updates to portrait mode, visual consistency, and higher-resolution upscaling.

Gemini 3.1 Flash TTStool

A Google AI text-to-speech model with native multi-speaker dialogue support across many languages. It is positioned as part of the Gemini product family.

Google AI Edge Gallerytool

Google AI Edge Gallery is a Google tool for showcasing and running on-device AI experiences at the edge, including offline use cases.

Interactions APItool

A new API for executing code and managing agent memory in Google’s hosted sandbox workflow. It matters to AI PMs as part of the control plane for agent execution.

Google Mapstool

Google’s mapping product used as a grounding source in AI Studio. It is mentioned as part of building location-aware, citation-backed apps.

MCP CLItool

An open-source command-line tool for dynamic discovery of Model Context Protocol servers. It is described as reducing MCP token usage and improving AI agent tool interactions.

Stay updated on Philipp Schmid

Get curated AI PM insights delivered daily — covering this and 1,000+ other sources.

Subscribe Free