company41 mentions· Updated Jul 11, 2026

Cognition

A customer company cited using Claude Fable 5 for around-the-clock work. For PMs, it provides a production example of enterprise adoption of frontier coding models.

Key Highlights

Cognition is best known here as the company behind Devin and a growing stack of agentic software-engineering products.
Its recent launches span coding evals, cloud agent handoff, repo-scale reasoning, security remediation, and lower-cost model orchestration.
SWE-1.7 positions Cognition as not just an app layer company, but also a builder of specialized coding models.
A July 2026 customer story shows Cognition using Claude Fable 5 for around-the-clock work, making it a notable enterprise adoption example.
For AI PMs, Cognition is a practical reference for designing trustworthy, reviewable, production-ready coding agent workflows.

Cognition

Overview

Cognition is an AI company best known in this corpus for building Devin and related agentic software-engineering systems. Across recent mentions, the company appears as both a product builder and an applied AI lab focused on autonomous coding workflows, large-repo reasoning, security analysis, evaluation infrastructure, and cost/performance optimization for coding agents. It is also cited as a real-world enterprise user of frontier models, including a customer story about using Claude Fable 5 for around-the-clock work.

For AI Product Managers, Cognition matters because it offers a concrete view into what production-grade AI software agents look like beyond demos. The company’s launches span the full stack of agentic product design: model development (SWE-1.7), orchestration (Devin Fusion), persistence and cloud execution (/handoff), repo-scale reasoning (Agentic MapReduce), security workflows (Security Swarm), and evaluation standards (FrontierCode). That makes Cognition a useful reference point for PMs thinking about reliability, UX, trust, benchmarks, human review loops, and enterprise adoption of coding agents.

Key Developments

2026-06-09: Cognition launched FrontierCode, a coding evaluation platform positioned as a higher-difficulty, higher-quality benchmark, with each task reportedly crafted over 40+ hours by top open-source maintainers.
2026-06-12: Cognition open sourced /handoff, a Devin CLI feature that allows agents to continue running in the cloud after a user closes their laptop.
2026-06-19: Cognition highlighted Devin’s ability to reason across an entire codebase to uncover deep business-logic security flaws that pattern-matching scanners can miss.
2026-06-25: Cognition described an automated QA workflow in Devin where users approve a test plan before PR review, then receive a screen recording and step-by-step QA checklist as evidence.
2026-06-27: Cognition shared how MetaviewAI used Devin to compress a typically weeks-long SOC2 audit workflow into two days.
2026-06-30: Cognition launched Devin Fusion, a hybrid-model harness for agentic coding designed to reduce routing issues and cut the cost of Fable-level intelligence by 35% while still producing merge-ready code.
2026-07-02: Cognition introduced Agentic MapReduce, an architecture for whole-codebase reasoning that maps relevant signals across large repositories and dispatches focused agents over bounded shards.
2026-07-02: Cognition also launched Security Swarm, a Devin security capability that combines vulnerability discovery, runtime validation, and automatic remediation PRs.
2026-07-09: Cognition launched SWE-1.7, described as its most capable model yet, scoring within a few points of top frontier models at a fraction of the cost and running at roughly 1000 tokens/second.
2026-07-10: Cognition said SWE-1.7 was built on open-source Kimi K2.7 and fine-tuned for trustworthiness, aiming to match US models on eval benchmarks while handling tasks other models often refuse.
2026-07-11: A customer story described how Cognition uses Claude Fable 5 for around-the-clock work, serving as a production example of enterprise adoption and trust in frontier coding models.

Relevance to AI PMs

1. A blueprint for agentic coding product design. Cognition shows how to package AI coding into a usable product system, not just a model. PMs can study features like PR review flows, QA evidence, cloud handoff, and security remediation to understand what drives trust and repeat usage.

2. A case study in reliability and orchestration. Launches like Devin Fusion, Agentic MapReduce, and SWE-1.7 illustrate that performance depends on routing, architecture, and workflow design as much as raw model quality. PMs can use this framing when prioritizing infra, evals, and fallback behavior.

3. A practical example of enterprise adoption. The Claude Fable 5 customer story and security/QA workflows show how frontier models get deployed in real production settings. PMs can use these examples when thinking about governance, review gates, auditability, and ROI narratives for enterprise buyers.

Devin: Cognition’s flagship AI software engineer product and the main surface through which many of these capabilities are delivered.
Claude Fable 5 / Fable: Connected through a customer story about around-the-clock work, showing Cognition as a production user of frontier coding models.
SWE-1.7, Kimi K2.7, FrontierCode: Represent Cognition’s work on model development and evaluation for coding tasks.
Devin Fusion, Agentic MapReduce, /handoff: Infrastructure and orchestration layers that improve cost, persistence, and repo-scale reasoning.
Security Swarm, bug-catcher, swe-check: Related to security and QA automation workflows around code analysis and remediation.
GitHub, pull requests, git diffs, Slack, Linear: Key workflow surfaces where Cognition-style agents fit into modern software team operations.
MetaviewAI, Fortune 500 companies, cobol modernization, Infosys, Rivian, Volkswagen, AstraZeneca, Evinova, ITA: Broader enterprise and transformation contexts connected to agentic coding adoption and modernization use cases.

Newsletter Mentions (41)

2026-07-11

“A customer story describing how Cognition uses Claude Fable 5 for around-the-clock work, highlighting enterprise AI and coding use cases.”

#20 📝 Claude Code Blog Working at the frontier: How Cognition trusts Claude Fable 5 to work through the night - A customer story describing how Cognition uses Claude Fable 5 for around-the-clock work, highlighting enterprise AI and coding use cases. The piece illustrates trust and reliability of Claude Fable 5 in production workflows. #21 ▶️ Grok 4.5 is a bigger deal than Fable 5 Greg Isenberg Uses Grok 4.5 inside a Hermes agent on Orgo—with connectors like Agent Mail, Agent Phone, Agent Card, Composio, Idea Browser MCP, X MCP, and vidIQ—to autonomously provision cloud VMs, craft a startup landing page in ~40 seconds, and generate startup ideas, video thumbnails, market insights, and a cold-email sequence in one session.

2026-07-10

“Cognition launched SWE-1.7 built on open-source Kimi K2.7—which handles 87% of tasks other models refuse over human-rights concerns—and fine-tuned it for trustworthiness to match US models on eval benchmarks.”

The note presents Cognition’s model launch as an attempt to improve reliability and benchmark performance.

2026-07-09

“Cognition launched SWE-1.7, their most capable model yet, scoring within a few points of top frontier models at a fraction of the cost and running at 1000 tok/s.”

Today's top 25 insights for PM Builders, ranked by relevance from X, Blogs, and YouTube. OpenAI launches GPT-Live full-duplex voice API #1 𝕏 Sam Altman announced that GPT-5.6 Sol launches Thursday, urging builders to start integrating and experimenting with the new model. #2 📝 OpenAI News Introducing GPT-Live - OpenAI is launching GPT‑Live, a full‑duplex voice model that can listen and speak simultaneously, use conversational cues like “mhmm,” and delegate deeper searches or reasoning to GPT‑5.5 in the background; two versions (GPT‑Live‑1 and GPT‑Live‑1 mini) are rolling out to ChatGPT users globally today with an API sign‑up available. OpenAI says over 150 million people use ChatGPT voice weekly, reports users strongly prefer GPT‑Live to Advanced Voice Mode (GPT‑Live‑1 preferred ~75.7%), and shows large evaluation gains — GPQA rising from 45.3% (AVM) to up to 84.2% and BrowseComp from 0.7% to up to 75.2%. Also covered by: @Sam Altman #3 𝕏 OpenAI rolled out GPT-Live voice models in ChatGPT on iOS, Android, and web starting today (full rollout over the next few days), with API access coming soon—just tap the Voice button to talk with ChatGPT. Also covered by: @Sam Altman #4 𝕏 Mistral AI launched Robostral Navigate, its first embodied navigation model with 8B parameters that guides robots to perform natural-language specified tasks using a single RGB camera. It achieves state-of-the-art results on the R2R-CE benchmark. #5 𝕏 Logan Kilpatrick rolled out “import from GitHub” in Google AI Studio Build, automagically converting your repo into a runtime-compatible format. Now you can seamlessly iterate on it in AI Studio, deploy it, and more. #6 📝 OpenAI News Separating signal from noise in coding evaluations - A detailed audit of SWE-Bench Pro estimates roughly 30% of tasks are broken—an automated pipeline flagged 200 (27.4%) and human annotators found 249 (34.1%)—primarily due to overly strict tests, underspecified prompts, low-coverage tests, and misleading prompts.

2026-07-02

“Cognition built Agentic MapReduce, a new architecture for whole-codebase reasoning that maps relevant signals across large repos and fans out focused agents over bounded shards.”

#7 𝕏 Cognition built Agentic MapReduce, a new architecture for whole-codebase reasoning that maps relevant signals across large repos and fans out focused agents over bounded shards. #8 𝕏 Cognition launched Security Swarm—a new pillar of Devin for Security that bundles tools to find vulnerabilities, validate them at runtime, and automatically ship remediation PRs.

2026-06-30

“#7 𝕏 Cognition built Devin Fusion, a hybrid-model harness for agentic coding that overcomes conventional routing issues and cuts the cost of Fable-level intelligence by 35%.”

#7 𝕏 Cognition built Devin Fusion, a hybrid-model harness for agentic coding that overcomes conventional routing issues and cuts the cost of Fable-level intelligence by 35%. It still delivers merge-ready code that feels good to use.

2026-06-27

“Cognition shows how @s16h_ and the @MetaviewAI team leveraged Devin to complete a typically weeks-long SOC2 audit in just two days.”

#9 𝕏 Cognition shows how @s16h_ and the @MetaviewAI team leveraged Devin to complete a typically weeks-long SOC2 audit in just two days.

2026-06-25

“Cognition : Devin built an automated QA workflow where you review and approve a test plan before PR review, then receive a screen recording with a visual, step-by-step QA checklist.”

Cognition is mentioned in connection with Devin and an automated QA workflow. The workflow is focused on review gates and evidence-rich testing.

2026-06-19

“Cognition ’s Devin tool reasons across your entire codebase to uncover deep business-logic flaws—like an unauthenticated password-reset endpoint—by tracing request flows through auth layers that pattern-matching scanners miss.”

📝 𝕏 Cognition ’s Devin tool reasons across your entire codebase to uncover deep business-logic flaws—like an unauthenticated password-reset endpoint—by tracing request flows through auth layers that pattern-matching scanners miss.

2026-06-12

“#15 𝕏 Cognition open sourced /handoff, the Devin CLI feature that lets your agents keep running in the cloud even after you close your laptop.”

#15 𝕏 Cognition open sourced /handoff, the Devin CLI feature that lets your agents keep running in the cloud even after you close your laptop.

2026-06-09

“𝕏 Cognition launched FrontierCode, a coding evaluation platform setting a new standard in difficulty and quality with each task crafted over 40+ hours by top open-source maintainers.”

GenAI PM Daily June 09, 2026 GenAI PM Daily 🎧 Listen to this brief 3 min listen Today's top 25 insights for PM Builders, ranked by relevance from X, Blogs, and YouTube. NotebookLM update adds PDF, DOCX, XLSX, PPTX exports and chart support for better research #1 𝕏 Philipp Schmid released new QAT Gemma 4 checkpoints that match original performance while using ~4× less memory, plus a mobile quantization format shrinking Gemma 4 E2B’s footprint to just 1 GB. They’re now available on Hugging Face and ready to run. #2 𝕏 NVIDIA AI shows how to train models faster with JAX and MaxText using NVFP4 precision on NVIDIA Blackwell GPUs, sharing detailed benchmarks, a full recipe breakdown, and a MaxText example. #3 𝕏 Cognition launched FrontierCode, a coding evaluation platform setting a new standard in difficulty and quality with each task crafted over 40+ hours by top open-source maintainers. #4 𝕏 Josh Woodward unveiled a new NotebookLM feature that lets you expand searches beyond your own source files. Today’s update adds export options—PDF, DOCX, XLSX, PPTX and charts—to help you do better research.

Devintool

An AI software engineering product from Cognition. The newsletter references its security-focused extension, indicating product expansion into vulnerability detection and remediation.

AI agentsconcept

Systems that use models plus tools, memory, and planning to perform multi-step tasks autonomously or semi-autonomously. The newsletter references both agent architectures and agentic coding/workflows.

Linearcompany

Work management product used here as the task backbone for autonomous coding agents. Relevant to AI PMs for agent-state management and human-in-the-loop reviews.

Slacktool

A workplace messaging platform used as a source of context, feedback, and automated triggers inside agent workflows. In this newsletter it is a key integration for product operations.

GitHubcompany

The software development platform where ClawSweeper is hosted. In this issue it appears as the project home for an open-source triage tool.

Claude Fable 5tool

A Claude model used by Cognition for overnight work and production workflows. For AI PMs, it signals trust, reliability, and enterprise readiness for coding tasks.

Devin Reviewtool

A reimagined code review interface from Cognition that groups related changes and flags issues by confidence and severity. Useful as an example of AI-native developer workflow design.

GLM-5tool

A model released on Windsurf with a limited-time launch discount. It is relevant as another model option available to developers.

Computertool

A product access offering mentioned in the context of pricing tiers and credits. It appears to be part of a broader AI product subscription structure.

COBOL modernizationconcept

The process of updating legacy COBOL systems, often for enterprise migration and maintenance. AI agents are increasingly positioned as tools to accelerate this high-friction modernization work.

Rusttool

A systems programming language used here as the implementation target for an AI-assisted rewrite of Bun.

COBOLconcept

A legacy programming language often targeted for modernization and migration efforts. For PMs, it represents enterprise technical debt and transformation risk.

Stay updated on Cognition

Get curated AI PM insights delivered daily — covering this and 1,000+ other sources.

Subscribe Free

Cognition

Key Highlights

Cognition

Overview

Key Developments

Relevance to AI PMs

Related

Newsletter Mentions (41)

Related

Stay updated on Cognition