Dan Shipper
A founder and writer cited for doing writing, research, and email inside AI agents. The newsletter uses him as an example of agent-native knowledge work.
Key Highlights
- Dan Shipper is cited as a strong example of agent-native knowledge work across writing, research, and email.
- He launched Plus Ones, a Slack-hosted agent environment preloaded with Every’s apps for email, writing, and docs.
- His idea of compound engineering emphasizes capturing prompt and workflow learnings so AI systems improve over time.
- He described a senior engineer benchmark that compares model output with human engineers on realistic software rewrite tasks.
- His workflow shows how agents can use tools like Google Docs and PostHog directly through in-app browsing.
Overview
Dan Shipper is a founder, writer, and operator associated with Every who appears in the newsletter as a leading example of agent-native knowledge work. Across multiple mentions, he is cited for using AI agents not just for isolated prompts, but as a primary working environment for writing, research, email, and software-related workflows. That makes him notable for AI Product Managers because he represents a practical model for how high-context knowledge work changes when agents become the interface rather than a side tool.His importance to AI PMs is twofold. First, he provides concrete examples of AI-native workflows: agent-assisted writing in tools like Codex and Claude Code, in-app use of products like Google Docs and PostHog, and bundled agent systems such as Plus Ones with apps for email, writing, and docs. Second, his ideas about AI-native organizations, “compound engineering,” and benchmarking model performance against senior human engineers offer useful frameworks for PMs designing products, teams, and evaluation systems in an agent-first world.
Key Developments
- 2026-01-11 — In a conversation with Jason Shuman, Dan Shipper is cited for principles behind AI-native organizations, including the shift from a knowledge economy to an “allocation economy,” the growing importance of generalists with strong taste, and the idea of compound engineering—capturing prompt lessons so agents improve over time.
- 2026-03-27 — Dan Shipper launched Plus Ones, described as a Slack-hosted OpenClaw preloaded with Every’s agent apps: Cora for email, Spiral for writing, and Proof for docs, plus custom skills and workflows configurable with a ChatGPT or API key.
- 2026-05-25 — On Lenny’s Podcast, Dan Shipper described Every’s custom senior engineer benchmark, which asks both models and human engineers to rewrite the vibe-coded Proof application from first principles. In the cited results, GPT-5.5 on an Opus 4.7 plan scored 62/100, while human senior engineers scored in the high 80s to low 90s.
- 2026-05-26 — Dan Shipper is referenced as now doing all his writing, research, and email inside AI agents such as Codex and Claude Code, using tools like Google Docs and PostHog through the agent’s in-app browser for more seamless, context-rich collaboration.
Relevance to AI PMs
1. Design workflows around agents, not just chat boxes. Dan Shipper’s usage pattern suggests users may prefer doing real work inside agents that can navigate tools, docs, and analytics products directly. AI PMs should prioritize multi-step task execution, browser/tool use, and persistent context over one-off prompt UX.2. Benchmark against real human job performance. The senior engineer benchmark example is a useful reminder that AI evaluation should map to actual workflows and quality bars, not vanity metrics. PMs can create role-specific benchmarks that compare model output to experienced practitioners on production-like tasks.
3. Capture learning as reusable system knowledge. The concept of compound engineering is especially practical for PMs building internal AI products. Teams should turn successful prompts, workflows, and failure recoveries into repeatable assets—playbooks, agent skills, templates, and memory systems that improve performance over time.
Related
- Every — Dan Shipper is closely tied to Every, the organization behind several of the agent-oriented products and experiments mentioned.
- Plus Ones — A Dan Shipper-launched system for deploying a preconfigured agent environment inside Slack.
- Jason Shuman — Interviewer/conversation partner who surfaced Shipper’s ideas about AI-native organizations and compound engineering.
- Compound engineering — A core concept associated with Shipper’s thinking: preserving prompt and workflow learnings so agent systems improve cumulatively.
- Codex and Claude Code — Agent environments cited in connection with how Shipper performs writing, research, and email.
- Google Docs and PostHog — Examples of standard work tools being accessed through agent in-app browsing, showing how existing SaaS products can become part of agent workflows.
- GPT-5.5 and Opus 4.7 — Models/plans mentioned in connection with benchmark testing discussed by Shipper.
- Lenny Rachitsky — The podcast/newsletter context in which some of Shipper’s workflow and benchmarking ideas were highlighted.
Newsletter Mentions (4)
“#15 𝕏 Lenny Rachitsky : Dan Shipper now does all his writing, research and email inside AI agents like Codex or Claude Code—using Google Docs, PostHog and other tools in the agent’s in-app browser for seamless, context-rich collaboration.”
#15 𝕏 Lenny Rachitsky : Dan Shipper now does all his writing, research and email inside AI agents like Codex or Claude Code—using Google Docs, PostHog and other tools in the agent’s in-app browser for seamless, context-rich collaboration.
“#8 🟣 The AI paradox: More automation, more humans, more work | Dan Shipper Lennys Podcast Dan Shipper describes Every’s custom “senior engineer benchmark” that asks models and engineers to rewrite their vibe-coded Proof application from first principles, showing GPT 5.5 (Opus 4.7 plan) scored 62/100 versus human engineers in the high 80s to low 90s.”
#8 🟣 The AI paradox: More automation, more humans, more work | Dan Shipper Lennys Podcast Dan Shipper describes Every’s custom “senior engineer benchmark” that asks models and engineers to rewrite their vibe-coded Proof application from first principles, showing GPT 5.5 (Opus 4.7 plan) scored 62/100 versus human engineers in the high 80s to low 90s. All coding models prior to GPT 5.5 scored 30/100 on the senior engineer benchmark. GPT 5.5 running on the Opus 4.7 plan achieved 62/100 on the benchmark rewrite. Human senior engineers each scored in the high 80s to low 90s out of 100 on the same benchmark.
“in Dan Shipper launched Plus Ones—a Slack-hosted OpenClaw preloaded with Every’s agent apps (Cora for email, Spiral for writing, Proof for docs) plus custom skills and workflows, all set up in one click using your ChatGPT or any API key.”
#23 𝕏 in Dan Shipper launched Plus Ones—a Slack-hosted OpenClaw preloaded with Every’s agent apps (Cora for email, Spiral for writing, Proof for docs) plus custom skills and workflows, all set up in one click using your ChatGPT or any API key.
“Jason Shuman’s conversation with Dan Shipper surfaces key principles for AI-native organizations: the shift from a knowledge economy to an “allocation economy” where orchestration of human and machine intelligence is paramount; the resurgence of generalists with strong taste and direction; and “compound engineering,” capturing prompt lessons to improve AI agents over time.”
Product Management Insights & Strategies Marc Baselga outlines three investor-selection filters for first-time founders: diversify checks among angels to build a supportive network; choose early backers who create positive signals for later rounds; and avoid detractors by backchanneling with founders of failed ventures—ensuring investors add strategic value beyond capital. Jason Shuman’s conversation with Dan Shipper surfaces key principles for AI-native organizations: the shift from a knowledge economy to an “allocation economy” where orchestration of human and machine intelligence is paramount; the resurgence of generalists with strong taste and direction; and “compound engineering,” capturing prompt lessons to improve AI agents over time. AI Industry Developments & News Guillermo Rauch spotlights OpenAI’s GPT-5.2 Pro working with Harmonic to near-autonomously generate a proof for an Erdős mathematical problem—demonstrating how advanced language models are tackling complex reasoning tasks once reserved for human experts.
Related
Anthropic's coding assistant used for programming and automation tasks. The newsletter references it for building a custom approval device and for writing and research workflows inside AI agents.
OpenAI's coding agent/tool used here for self-improving tax workflows and long-running autonomous loops. It is presented as capable of iterative task execution with plugins and goal-based runs.
A newsletter/podcast operator cited for summarizing Dan Shipper’s view on AI, work, and value creation. He connects the discussion to skill commoditization and recombination.
A frontier coding-capable model referenced in a benchmark comparison. The newsletter says it outperformed earlier coding models but still lagged behind human senior engineers in Every’s test.
A plan or configuration associated with GPT 5.5 in the benchmark discussion. It is mentioned as the mode under which GPT 5.5 achieved its score.
A practice of capturing learnings from prompts and agent interactions to steadily improve system behavior over time. For PMs, it is a feedback-loop mindset for iterative AI product improvement.
A product analytics company/platform mentioned as one of the services Nebula integrates with. It appears in the context of automating analytics workflows.
Stay updated on Dan Shipper
Get curated AI PM insights delivered daily — covering this and 1,000+ other sources.
Subscribe Free