Gemini API
Google's API for building on Gemini models. Here it is used to power a GitHub issue triage agent and custom managed agents.
Key Highlights
- Gemini API evolved from model access into a broader product platform with retrieval, research, webhooks, and managed-agent capabilities.
- Recent launches emphasize practical shipping features for PMs, including File Search, Deep Research upgrades, and asynchronous workflow support.
- The API is already being used for concrete internal automations like GitHub issue triage agents and custom managed agents.
- Cost governance features such as billing caps and per-project spend limits make Gemini API more viable for production planning.
- Its tight connection to Google AI Studio lowers the barrier from prototype to operational deployment.
Gemini API
Overview
Gemini API is Google’s developer platform for building applications on Gemini models, including text, multimodal, search-grounded, retrieval, and agentic workflows. In this corpus, it appears as the underlying tool for practical builder use cases such as a GitHub issue triage agent, custom managed agents, Deep Research workflows, and multimodal file retrieval. It is closely tied to Google AI Studio, where teams can prototype, manage usage, and operationalize Gemini-powered experiences.For AI Product Managers, Gemini API matters because it combines frontier model access with increasingly product-ready primitives: managed agents, File Search, webhooks for long-running tasks, Deep Research, embeddings, and workspace-oriented integrations. That makes it useful not just for experimentation, but for shipping production workflows with clearer controls over cost, orchestration, retrieval quality, and user-facing experiences.
Key Developments
- 2026-03-28: Google introduced monthly spending caps for Gemini API billing tiers, with usage pausing once limits are reached until the next month or a tier upgrade. Faster automated tier upgrades and per-project spend limits in AI Studio were also highlighted.
- 2026-04-22: Deep Research in the Gemini API received upgrades including improved quality, MCP support, and native chart and infographic generation, expanding its usefulness for structured research workflows.
- 2026-04-25: Collaborative planning launched for Deep Research via a `collaborative_planning` flag, enabling users to request and iteratively refine draft research outlines.
- 2026-04-30: A developer guide showed how to build and run Deep Research workflows with the Gemini API, covering setup, workflow construction, and execution of research queries.
- 2026-05-05: Webhooks shipped in the Gemini API for long-running tasks such as batch jobs, agents, and GenMedia workflows, improving asynchronous workflow design.
- 2026-05-06: Google launched a multimodal File Search tool in the Gemini API powered by Gemini Embedding 2, adding custom metadata, inline citations, free storage, and on-demand embedding generation.
- 2026-05-07: File Search expanded to true multimodal PDF and image retrieval using `gemini-embedding-2`, handling chunking, embedding, indexing, and grounding in a single call.
- 2026-05-12: A Gemini API interactions quickstart was shared to help builders rapidly set up and test the model through simple developer workflows.
- 2026-05-20: Major updates landed across Google AI Studio and the Gemini API, including Gemini 3.5 Flash, managed agents with the antigravity harness, native Android app creation in AI Studio, workspace integrations, and one-click antigravity export.
- 2026-05-22: Philipp Schmid demonstrated a GitHub Issue Triage Agent built with a single curl call to the Gemini API, showcasing how quickly teams can prototype useful internal automation.
Relevance to AI PMs
1. Prototype internal AI workflows quickly: Gemini API supports fast experimentation for practical automations like issue triage, research assistants, and managed agents. PMs can validate a use case with lightweight implementation before committing engineering resources.2. Build retrieval and research features without assembling every layer manually: With File Search, embeddings, grounding, and Deep Research capabilities, teams can ship document Q&A, multimodal retrieval, and structured research experiences faster, with fewer separate infrastructure decisions.
3. Manage production concerns earlier: Billing caps, per-project spend limits, webhooks for long-running jobs, and managed-agent tooling help PMs think beyond demos. These features support planning around cost control, reliability, asynchronous UX, and scaling from prototype to production.
Related
- Google AI Studio / AI Studio: Closely connected as the primary environment for testing, configuring, and managing Gemini API projects and spend controls.
- Google: Parent platform behind Gemini models and the broader ecosystem around the API.
- Gemini 3.5 Flash / Gemini 3.1 Flash Lite / Gemini 3 Pro Preview: Model variants surfaced through the API for different speed, quality, and cost tradeoffs.
- Gemini Embedding 2: The embedding model powering File Search and multimodal retrieval workflows.
- File Search: A major Gemini API capability for retrieval over PDFs, images, and other content with indexing and grounding built in.
- Deep Research: A higher-level research workflow capability in the Gemini API, enhanced with collaborative planning, MCP support, and visual outputs.
- Managed Agents / Managed Agents Quickstart: Related orchestration layer for building and deploying agent workflows on top of Gemini API.
- GitHub Issue Triage Agent: A concrete example of Gemini API being used for internal automation and developer tooling.
- Philipp Schmid and Logan Kilpatrick: Frequent sources of product updates, examples, and quickstarts related to Gemini API launches.
- Sundar Pichai: Mentioned in connection with major Deep Research upgrades in the Gemini API.
Newsletter Mentions (18)
“Philipp Schmid built a GitHub Issue Triage Agent using a single curl to the Gemini API.”
#9 𝕏 Philipp Schmid built a GitHub Issue Triage Agent using a single curl to the Gemini API.
“Logan Kilpatrick launched major updates to Google AI Studio and the Gemini API—Gemini 3.5 Flash, managed agents with the antigravity harness, native Android app creation in AI Studio, workspace integrations, and one-click antigravity export.”
#17 𝕏 Logan Kilpatrick launched major updates to Google AI Studio and the Gemini API—Gemini 3.5 Flash, managed agents with the antigravity harness, native Android app creation in AI Studio, workspace integrations, and one-click antigravity export.
“Philipp Schmid shares Google’s Gemini API interactions quickstart guide, helping PM builders quickly set up and test the new Gemini AI model.”
#20 𝕏 Philipp Schmid shares Google’s Gemini API interactions quickstart guide, helping PM builders quickly set up and test the new Gemini AI model. #21 𝕏 Lenny Rachitsky shares eight actionable insights from Eric Ries—spanning financial gravity, CEO retention post-IPO, public-benefit corp structures like AnthropicAI, mission protection, and principled decision-making exemplified by Cloudflare.
“The Gemini API File Search tool now offers true multimodal PDF and image retrieval using `gemini-embedding-2`, handling chunking, embedding, indexing and grounding in one call.”
#4 𝕏 Philipp Schmid : The Gemini API File Search tool now offers true multimodal PDF and image retrieval using `gemini-embedding-2`, handling chunking, embedding, indexing and grounding in one call. #5 𝕏 Google DeepMind partners with EVE Online’s developers to use the game’s complex, player-driven universe as a sandbox for AI agents focused on memory, continual learning, and long-term planning.
“Logan Kilpatrick launched a multi-modal File Search tool in the Gemini API powered by Gemini Embedding 2, now with custom metadata, inline citations, and free storage plus on-demand embedding generation.”
#4 𝕏 Logan Kilpatrick launched a multi-modal File Search tool in the Gemini API powered by Gemini Embedding 2, now with custom metadata, inline citations, and free storage plus on-demand embedding generation.
“Google ships webhooks in Gemini API for long-running tasks #1 𝕏 xAI launched emotion-rich voice cloning on its Grok Voice API, now live for developers to generate AI voices nearly indistinguishable from human speech.”
Google ships webhooks in Gemini API for long-running tasks #1 𝕏 xAI launched emotion-rich voice cloning on its Grok Voice API, now live for developers to generate AI voices nearly indistinguishable from human speech. #2 𝕏 Logan Kilpatrick shipped Webhooks in the Gemini API to streamline developer workflows for long-running tasks like batch jobs, agents, and GenMedia. #3 𝕏 NVIDIA AI launched cuOpt Agent Skills, delivering GPU-accelerated decision optimization for supply-chain planning.
“#10 𝕏 Philipp Schmid published a developer getting-started guide on building and running Deep Research workflows with the Gemini API, covering API setup, workflow construction, and executing deep research queries.”
#10 𝕏 Philipp Schmid published a developer getting-started guide on building and running Deep Research workflows with the Gemini API, covering API setup, workflow construction, and executing deep research queries. #11 𝕏 Cursor launched the Cursor SDK, letting PM Builders spin up agents with the same runtime, harness, and models that power Cursor.
“Philipp Schmid launched collaborative planning in the Gemini API’s Deep Research, letting you use a `collaborative_planning` flag to request and iterate on a draft research outline (e.g., “add a section on power efficiency”).”
#6 𝕏 Philipp Schmid launched collaborative planning in the Gemini API’s Deep Research, letting you use a `collaborative_planning` flag to request and iterate on a draft research outline (e.g., “add a section on power efficiency”).
“Sundar Pichai launched two upgrades to Deep Research in the Gemini API—improved quality, MCP support, and native chart/infographic generation.”
#3 𝕏 Sundar Pichai launched two upgrades to Deep Research in the Gemini API—improved quality, MCP support, and native chart/infographic generation. Deep Research now delivers speed and efficiency, while a new Max mode offers top-tier context synthesis, hitting 93.
“#6 𝕏 Philipp Schmid : Starting April 1, the Gemini API billing tiers get monthly spending caps that pause the API once reached (resuming next month or upon upgrade), with faster automated tier upgrades.”
#6 𝕏 Philipp Schmid : Starting April 1, the Gemini API billing tiers get monthly spending caps that pause the API once reached (resuming next month or upon upgrade), with faster automated tier upgrades. You can also set per-project spend limits directly in AI Studio.
Related
A Google AI/Developer Relations figure mentioned for demonstrating Gemini Managed Agents and the Interactions API. He appears here as a presenter explaining hosted sandboxed agent execution.
A Google AI product leader mentioned for announcing Lyria 3 availability via API. The newsletter credits him with a distribution update relevant to developers.
Google's AI assistant/model family mentioned as one of the systems that can answer category-level brand questions. It is presented alongside ChatGPT and Perplexity in the context of AI-driven visibility.
A major AI platform and product company shipping Gemini models, Search AI features, and developer tools. Important for AI PMs because many of the newsletter’s launches reflect Google’s evolving AI ecosystem.
Google's research organization working on privacy-preserving analytics and other AI systems. The newsletter mentions a private analytics approach and NotebookLM features.
Google’s app-building and experimentation environment for Gemini. For AI PMs, it is a product surface for rapid prototyping, app creation, and workspace-integrated AI experiences.
CEO of Google and Alphabet mentioned in the context of Google I/O and Gemini strategy. The newsletter cites him in a discussion about AI roadmap and product direction.
Google’s developer-focused AI product, positioned for higher-level thinking and developer workflows. It is contrasted with the Gemini app’s consumer UX constraints.
A Gemini model variant highlighted for strong cost-per-intelligence performance. The newsletter frames it as especially efficient for simulated store operations on Vending Bench.
Google’s search product, mentioned as another interface for detecting SynthID watermarks. It illustrates how AI safety features can be embedded into mainstream consumer search.
A Gemini model variant that was noted as moving out of preview status.
A workflow/mode for using AI systems to search the web, synthesize information, and produce detailed reports. The newsletter frames it as a practical capability for research-heavy PM work.
An embedding model powering multimodal file search in the Gemini API. Relevant for PMs designing retrieval, citation, and metadata-aware workflows.
Google’s mapping product used as a grounding source in AI Studio. It is mentioned as part of building location-aware, citation-backed apps.
Stay updated on Gemini API
Get curated AI PM insights delivered daily — covering this and 1,000+ other sources.
Subscribe Free