Gemini API
Google’s developer API for Gemini, mentioned via an interactions quickstart guide. It is relevant for PM builders who need to prototype and test model capabilities quickly.
Key Highlights
- Gemini API is Google’s developer API for Gemini models, multimodal workflows, retrieval, and tool orchestration.
- Recent updates focused on Deep Research, File Search, webhooks, and built-in chaining of Google tools with custom functions.
- It is especially relevant for AI PMs who need to prototype grounded, agentic, and multimodal product experiences quickly.
- Billing caps and project-level spend controls make it easier to run experiments with clearer budget guardrails.
- Quickstart guides and interactive tooling lower the barrier to testing new Gemini capabilities in real product scenarios.
Gemini API
Overview
Gemini API is Google’s developer-facing API for accessing Gemini models and related capabilities such as multimodal generation, tool use, retrieval, deep research workflows, and long-running task orchestration. In the newsletter, it shows up repeatedly as a practical builder tool for quickly testing new Gemini model features through quickstarts, interactive docs, and API primitives that reduce the amount of infrastructure teams need to assemble themselves.For AI Product Managers, Gemini API matters because it sits at the intersection of prototyping speed and production-enabling features. It is not just a text-generation endpoint: recent updates highlight built-in tool orchestration, file search, webhooks, multimodal retrieval, Deep Research, billing controls, and support for workflows that would otherwise require stitching together several vendors. That makes it useful for PMs validating copilots, research agents, search experiences, and multimodal product concepts.
Key Developments
- 2026-03-19: Gemini API added orchestration across built-in tools and custom functions in a single API call, including Google Search, Google Maps on Gemini 3, File Search, and URL Context. Coverage also noted a revamped Python client and interactive docs with live examples.
- 2026-03-25: A tool-combination feature was highlighted that chains Google Search and custom functions in one request, with Gemini selecting tools, ordering execution, and passing context across steps automatically.
- 2026-03-28: Google introduced monthly spending caps for Gemini API billing tiers, pausing usage once limits are reached until the next month or a tier upgrade. Per-project spend limits in AI Studio were also mentioned.
- 2026-04-22: Deep Research in the Gemini API received upgrades for better quality, MCP support, and native chart/infographic generation.
- 2026-04-25: Collaborative planning launched for Deep Research, allowing developers to request and iteratively refine a draft research outline using a `collaborative_planning` flag.
- 2026-04-30: A getting-started guide showed how to build and run Deep Research workflows with the Gemini API, covering setup, workflow design, and execution.
- 2026-05-05: Webhooks shipped in the Gemini API for long-running tasks such as batch jobs, agents, and GenMedia workflows, improving asynchronous developer workflows.
- 2026-05-06: A multimodal File Search tool launched in the Gemini API powered by Gemini Embedding 2, adding custom metadata, inline citations, free storage, and on-demand embedding generation.
- 2026-05-07: File Search expanded to true multimodal PDF and image retrieval with `gemini-embedding-2`, handling chunking, embedding, indexing, and grounding in a single call.
- 2026-05-12: Philipp Schmid shared a Gemini API interactions quickstart guide, emphasizing fast setup for PM builders who want to test new Gemini model capabilities quickly.
Relevance to AI PMs
- Prototype agentic workflows faster: PMs can validate search, retrieval, and action-taking product concepts without building a full orchestration layer from scratch, thanks to built-in tool chaining across Google Search, Maps, File Search, and custom functions.
- Test multimodal and research-heavy UX: The API’s Deep Research and File Search features make it easier to prototype workflows involving PDFs, images, citations, structured outputs, charts, and research summaries.
- Manage risk during experimentation: Billing caps, per-project spend controls, and webhook support help PMs run pilots with clearer cost guardrails and more production-ready handling of long-running jobs.
Related
- Google AI Studio / AI Studio: Closely connected as the developer environment used to configure projects, manage spend limits, and explore Gemini API capabilities.
- Google: Parent platform behind Gemini API and the broader Gemini ecosystem.
- Gemini 3 Pro Preview / Gemini 3.1 Flash Lite: Related Gemini model variants that PMs may access through the API depending on latency, cost, and capability needs.
- Gemini Embedding 2: Powers newer File Search and multimodal retrieval workflows in the API.
- File Search: A major Gemini API capability for retrieval over documents and images, with built-in chunking, embedding, indexing, and grounding.
- Google Search and Google Maps: Built-in tools that can be orchestrated inside Gemini API requests for more grounded, action-oriented experiences.
- Deep Research: A higher-level Gemini API workflow for synthesis, planning, and research generation, with features like collaborative planning and native charts.
- Philipp Schmid, Logan Kilpatrick, Sundar Pichai: Key individuals who surfaced product updates, tutorials, and launches related to Gemini API in the newsletter.
Newsletter Mentions (16)
“Philipp Schmid shares Google’s Gemini API interactions quickstart guide, helping PM builders quickly set up and test the new Gemini AI model.”
#20 𝕏 Philipp Schmid shares Google’s Gemini API interactions quickstart guide, helping PM builders quickly set up and test the new Gemini AI model. #21 𝕏 Lenny Rachitsky shares eight actionable insights from Eric Ries—spanning financial gravity, CEO retention post-IPO, public-benefit corp structures like AnthropicAI, mission protection, and principled decision-making exemplified by Cloudflare.
“The Gemini API File Search tool now offers true multimodal PDF and image retrieval using `gemini-embedding-2`, handling chunking, embedding, indexing and grounding in one call.”
#4 𝕏 Philipp Schmid : The Gemini API File Search tool now offers true multimodal PDF and image retrieval using `gemini-embedding-2`, handling chunking, embedding, indexing and grounding in one call. #5 𝕏 Google DeepMind partners with EVE Online’s developers to use the game’s complex, player-driven universe as a sandbox for AI agents focused on memory, continual learning, and long-term planning.
“Logan Kilpatrick launched a multi-modal File Search tool in the Gemini API powered by Gemini Embedding 2, now with custom metadata, inline citations, and free storage plus on-demand embedding generation.”
#4 𝕏 Logan Kilpatrick launched a multi-modal File Search tool in the Gemini API powered by Gemini Embedding 2, now with custom metadata, inline citations, and free storage plus on-demand embedding generation.
“Google ships webhooks in Gemini API for long-running tasks #1 𝕏 xAI launched emotion-rich voice cloning on its Grok Voice API, now live for developers to generate AI voices nearly indistinguishable from human speech.”
Google ships webhooks in Gemini API for long-running tasks #1 𝕏 xAI launched emotion-rich voice cloning on its Grok Voice API, now live for developers to generate AI voices nearly indistinguishable from human speech. #2 𝕏 Logan Kilpatrick shipped Webhooks in the Gemini API to streamline developer workflows for long-running tasks like batch jobs, agents, and GenMedia. #3 𝕏 NVIDIA AI launched cuOpt Agent Skills, delivering GPU-accelerated decision optimization for supply-chain planning.
“#10 𝕏 Philipp Schmid published a developer getting-started guide on building and running Deep Research workflows with the Gemini API, covering API setup, workflow construction, and executing deep research queries.”
#10 𝕏 Philipp Schmid published a developer getting-started guide on building and running Deep Research workflows with the Gemini API, covering API setup, workflow construction, and executing deep research queries. #11 𝕏 Cursor launched the Cursor SDK, letting PM Builders spin up agents with the same runtime, harness, and models that power Cursor.
“Philipp Schmid launched collaborative planning in the Gemini API’s Deep Research, letting you use a `collaborative_planning` flag to request and iterate on a draft research outline (e.g., “add a section on power efficiency”).”
#6 𝕏 Philipp Schmid launched collaborative planning in the Gemini API’s Deep Research, letting you use a `collaborative_planning` flag to request and iterate on a draft research outline (e.g., “add a section on power efficiency”).
“Sundar Pichai launched two upgrades to Deep Research in the Gemini API—improved quality, MCP support, and native chart/infographic generation.”
#3 𝕏 Sundar Pichai launched two upgrades to Deep Research in the Gemini API—improved quality, MCP support, and native chart/infographic generation. Deep Research now delivers speed and efficiency, while a new Max mode offers top-tier context synthesis, hitting 93.
“#6 𝕏 Philipp Schmid : Starting April 1, the Gemini API billing tiers get monthly spending caps that pause the API once reached (resuming next month or upon upgrade), with faster automated tier upgrades.”
#6 𝕏 Philipp Schmid : Starting April 1, the Gemini API billing tiers get monthly spending caps that pause the API once reached (resuming next month or upon upgrade), with faster automated tier upgrades. You can also set per-project spend limits directly in AI Studio.
“#13 𝕏 Philipp Schmid released a Gemini API tool-combination feature that chains Google Search and custom functions in a single request—Gemini automatically picks the tools, orders them, and circulates context.”
#13 𝕏 Philipp Schmid released a Gemini API tool-combination feature that chains Google Search and custom functions in a single request—Gemini automatically picks the tools, orders them, and circulates context. #14 𝕏 Santiago shows how to deploy a Claw autonomous agent on Blink’s platform in four steps—describe the task, deploy, add tools and channels—so you can bypass 10 hours of setup and be running in minutes.
“Philipp Schmid announced that the Gemini API now combines built-in tools (Google Search, Google Maps on Gemini 3, File Search, URL Context) with custom functions in a single orchestrated API call, complete with automated tool chaining and signature-based context.”
#3 𝕏 Philipp Schmid shared that Gemini API now lets you combine built-in tools like browser and calculator with custom function calls, backed by a revamped Python client and interactive docs with live examples. #4 𝕏 Philipp Schmid announced that the Gemini API now combines built-in tools (Google Search, Google Maps on Gemini 3, File Search, URL Context) with custom functions in a single orchestrated API call, complete with automated tool chaining and signature-based context.
Related
An AI developer advocate/researcher mentioned for announcing Android 16’s on-device MCP and Android AI App Functions. He is presented as a voice on developer platform capabilities for agents.
Google's AI assistant/model family mentioned as one of the systems that can answer category-level brand questions. It is presented alongside ChatGPT and Perplexity in the context of AI-driven visibility.
A product lead associated here with Gemini API and AI Studio announcements. Known for shipping developer-facing AI product features.
The company behind Gemini, referenced through a Gemini API quickstart guide. It is relevant for model access and developer onboarding.
Google’s environment for building and experimenting with Gemini-powered apps and prototypes. It appears here as the venue for interactive UI experiments and an intelligent mouse pointer prototype.
Google’s research organization, mentioned for partnering with CGIAR on an AI crop-breeding model.
CEO of Google and Alphabet. He is cited here as the announcer of Gemini Intelligence at Android Show I/O.
Google’s app-building environment for experimenting with model-powered workflows and UI editing. PMs may use it for rapid prototyping and vibe coding.
A Gemini model variant that was noted as moving out of preview status.
An embedding model powering multimodal file search in the Gemini API. Relevant for PMs designing retrieval, citation, and metadata-aware workflows.
Google’s search product, mentioned here in the context of translation improvements powered by Gemini LLMs. The newsletter frames this as an example of AI being embedded into core search infrastructure.
A workflow/mode for using AI systems to search the web, synthesize information, and produce detailed reports. The newsletter frames it as a practical capability for research-heavy PM work.
Google’s mapping product used as a grounding source in AI Studio. It is mentioned as part of building location-aware, citation-backed apps.
Stay updated on Gemini API
Get curated AI PM insights delivered daily — covering this and 1,000+ other sources.
Subscribe Free