Harrison Chase
A founder or leader associated with LangSmith and AI agent development. He emphasizes platform use, collaboration, and process-oriented measurement of agents.
Key Highlights
- Harrison Chase consistently argues that AI agents should be managed as measurable systems with observability, feedback, and iterative improvement loops.
- He is closely associated with LangSmith and LangChain, especially around production deployment, testing, and agent platform workflows.
- His commentary is especially useful for PMs evaluating vendor lock-in risks in agent harnesses, memory systems, and integrations.
- He emphasizes that agent quality is not only a technical problem but also a cross-functional process and collaboration challenge.
- Recent mentions tie him to DeepAgents, open harnesses, enterprise agent architecture, and practical deployment guidance.
Harrison Chase
Overview
Harrison Chase is a prominent builder and leader in the AI agent ecosystem, most closely associated with LangChain, LangSmith, and the broader tooling stack for developing, deploying, and improving AI agents. Across recent mentions, he appears as a public advocate for treating agents not as one-off demos, but as measurable systems that require observability, testing, feedback loops, and structured team processes.For AI Product Managers, Harrison Chase matters because his work sits at the intersection of agent platform strategy and day-to-day product execution. His commentary consistently emphasizes practical concerns that PMs must operationalize: cross-functional collaboration, evaluation frameworks, deployment readiness, vendor lock-in risks, harness design, and continuous improvement loops. In short, he represents a product-and-platform view of AI agents that is highly relevant for teams moving from prototype to production.
Key Developments
- 2026-04-15: Highlighted LangChain’s DeepAgents 0.5 release, including async subagents for long-running tasks, multimodal support, and related enhancements. He also stressed that local agent builds are not enough for production and pointed teams toward LangSmith deployments for secure, scalable launches.
- 2026-04-21: Broke down the end-to-end infrastructure needed for long-horizon AI agents, including task scheduling, state persistence, vector retrieval, prompt templating, and monitoring.
- 2026-04-23: Announced an upcoming LangChain feature debuting at Interrupt that would not only provide testing tools, but also guide teams on what to test, in what order, and when they are done.
- 2026-04-30: Walked through building and deploying a simple agent with DeepAgents deploy. Also predicted that closed-model costs could become prohibitively high by 2026, and noted optimization work aimed at strong performance on open-source models.
- 2026-05-02: Shared enthusiasm for autonomous web-browsing agents, pointing to a DeepAgents and Browserbase integration example as evidence that LLM capabilities are reaching useful autonomy thresholds.
- 2026-05-03: Warned that swapping model providers is relatively easy compared with changing inference and training harnesses, arguing that harnesses create deeper vendor lock-in. He advocated for open harnesses and also noted that memory and integrations remain tightly coupled to the agent harness, with only early signs of open standards such as agents.md and skills.
- 2026-05-04: Publicly supported Shashikant’s PyFlue, a Python-native agent harness framework ported from Flue, and encouraged more developers to work on harness engineering.
- 2026-05-06: Argued that observability alone is insufficient: teams need feedback data collection and even automated feedback generation embedded directly into platforms like LangSmith to create a continuous improvement loop for agents.
- 2026-05-08: Discussed how Ramp Labs built Ramp Sheets and its internal AI agent, Inspect, covering architecture, features, and practical enterprise use cases on the Max Agency podcast.
- 2026-05-10: Described LangSmith as an organization-wide platform for building AI agents that improves cross-functional collaboration and tightens feedback loops. On the same day, he also argued that measuring and iteratively improving agents is as much a human-process challenge as a technical one.
Relevance to AI PMs
1. He frames agents as product systems, not model demos. PMs can use this lens to prioritize instrumentation, evaluation, feedback capture, and deployment workflows early, rather than treating them as post-launch cleanup. 2. He highlights where platform decisions create lock-in. His focus on harnesses, memory, and integrations helps PMs ask better architectural questions before committing to a vendor or internal framework. 3. He emphasizes collaboration and process design. For PMs running agent products, this means success depends not only on model quality, but on how product, engineering, operations, and subject-matter experts review traces, collect feedback, and iterate on failures.Related
- LangSmith: A core platform associated with Harrison Chase’s thinking on observability, evaluation, deployments, and feedback loops for AI agents.
- LangChain: The broader application framework ecosystem he is closely linked to, including launches and testing-related product direction.
- DeepAgents / deep-agents / deepagents-cli / deepagents-deploy / deepagents-05: Connected to his work on agent building, deployment, async subagents, and open-model performance.
- LangGraph: Relevant as part of the broader LangChain ecosystem for agent orchestration and stateful workflows.
- Agent harnesses / open-harnesses / agent-middleware: Central to his warnings about vendor lock-in and the need for more open infrastructure standards.
- LangSmith deployments / LangSmith insights / traces / agent observability / agent reliability: Related to his repeated emphasis on measuring, monitoring, and improving agent behavior over time.
- Browserbase: Mentioned in connection with web-browsing agents enabled through DeepAgents integrations.
- Ramp Labs / Inspect: Referenced through his discussion of real enterprise AI-agent architectures and internal tooling.
- PyFlue / Flue / Shashikant: Connected through his support for harness engineering and Python-native experimentation in the agent tooling ecosystem.
- agents.md / skills / langchain-oss-skills / memory: Relevant to his comments on emerging standards and the still-coupled nature of agent memory and integrations.
Newsletter Mentions (37)
“#7 𝕏 Harrison Chase calls LangSmith an org-wide platform for building AI agents that speeds up cross-functional collaboration and tightens feedback loops. #10 𝕏 Harrison Chase argues that treating AI agents as systems to measure and iteratively improve isn’t just a technical challenge—it demands intentional human collaboration and team processes.”
GenAI PM Daily May 10, 2026 GenAI PM Daily 🎧 Listen to this brief 3 min listen Today's top 11 insights for PM Builders, ranked by relevance from X, Blogs, and LinkedIn. PromptLayer’s multi-step agent evaluation framework #1 𝕏 Jason Zhou launched `/goal` support in CodeX and Hermes agents for one-step autonomous coding, advising use of interview mode, clear stop conditions, and a goal-buddy to manage state and goal files. #2 📝 PromptLayer Blog What Is Agent Evaluation? A Practical Guide for AI Teams - Agent evaluation tests whether an AI agent reliably completes tasks across real inputs, edge cases, and new versions by scoring not just final outputs but multi-step behavior via black-box, trajectory, and component-level evaluations, using metrics like task completion rate, tool selection accuracy, unsupported-claim rate, latency/cost per step, and regression pass rate. PromptLayer offers tracing with span-level context, reusable datasets, batch evaluations, backtesting, regression testing, automated evaluation triggers on new prompt versions, and flexible pipelines including code execution, human input, conversation simulation, regex checks, and LLM assertions. #3 in Udi Menkes built his new product’s entire data flow in a single interactive HTML file—complete with diagrams, in-page navigation, and color-coded complexity—letting his team understand it in minutes instead of hours. #4 𝕏 Garry Tan suggests diagramming your AI agent codebases and architecture in plain ASCII, then relentlessly questioning each component to clarify design and accelerate product development. #5 𝕏 Boris Cherny says Claude Code’s switch to a native installer means npm-only stats undercount its real usage. On Thursday it hit its second-highest signup day ever with 15× growth since Jan 1—now you can ask Claude to debug your SQL. #6 𝕏 Boris Cherny is enhancing Claude Code’s UX for snappier performance and adding debug logs so users can self-serve hang diagnostics. #7 𝕏 Harrison Chase calls LangSmith an org-wide platform for building AI agents that speeds up cross-functional collaboration and tightens feedback loops. #8 𝕏 Santiago showcases a step-by-step guide for constructing Python-powered multi-agent systems from scratch, leveraging MCP and A2A patterns to incrementally add complexity and enable collaborative AI agents. #9 𝕏 Garry Tan spends $2K/mo on Openclaw AI tokens to turbocharge product development and startup insights. He’s “tokenmaxxing” now with a goal to make these capabilities affordable for everyone in 18 months. #10 𝕏 Harrison Chase argues that treating AI agents as systems to measure and iteratively improve isn’t just a technical challenge—it demands intentional human collaboration and team processes. #11 in Peter Yang warns that unedited AI-generated markdown can compound small errors over time—what starts as 5% “slop” quickly balloons into an overwhelming pile of confusing, unverified content. Found this valuable? Share it with another PM - they can subscribe at genaipm.com Unsubscribe • Switch to Weekly
“#16 𝕏 Harrison Chase dives into how Ramp Labs built Ramp Sheets and their internal AI agent Inspect—unpacking architecture, features, and real-world use cases—on the Max Agency podcast.”
Harrison Chase is mentioned in the context of a podcast discussion about Ramp’s internal AI products.
“Harrison Chase argues that agent observability in LangSmith is only half the battle—you must embed feedback data collection (and even automated feedback generation) directly into your observability platform to power a continuous AI-agent improvement loop.”
#10 𝕏 Harrison Chase argues that agent observability in LangSmith is only half the battle—you must embed feedback data collection (and even automated feedback generation) directly into your observability platform to power a continuous AI-agent improvement loop.
“𝕏 Harrison Chase cheers on Shashikant’s PyFlue—a Python-native agent harness framework ported from Flue—and invites more developers to dive into harness engineering.”
#10 𝕏 Harrison Chase cheers on Shashikant’s PyFlue—a Python-native agent harness framework ported from Flue—and invites more developers to dive into harness engineering.
“#6 𝕏 Harrison Chase warns that while swapping model providers is straightforward, changing inference/training harnesses is much harder due to vendor lock-in—and urges the development of open harnesses to keep users flexible.”
#6 𝕏 Harrison Chase warns that while swapping model providers is straightforward, changing inference/training harnesses is much harder due to vendor lock-in—and urges the development of open harnesses to keep users flexible. #13 𝕏 Harrison Chase warns that memory and integrations are still tightly coupled to the agent harness—only agents.md and skills hint at any open standard.
“Harrison Chase is excited that LLMs are becoming capable enough to power autonomous web-browsing agents, showcased by the deepagents + @browserbase LangChain integration example on GitHub.”
Harrison Chase is excited that LLMs are becoming capable enough to power autonomous web-browsing agents, showcased by the deepagents + @browserbase LangChain integration example on GitHub.
“#12 𝕏 Harrison Chase walks through building and deploying a simple agent using DeepAgents deploy, inviting PM Builders to try it out (full docs at https://docs.langchain.com/oss/python/deepagents/deploy).”
#12 𝕏 Harrison Chase walks through building and deploying a simple agent using DeepAgents deploy, inviting PM Builders to try it out (full docs at https://docs.langchain.com/oss/python/deepagents/deploy). #19 𝕏 Harrison Chase predicts that by 2026 closed-model costs will be prohibitively high and he’s optimizing deepagents for peak performance on OSS models.
“#21 𝕏 Harrison Chase is launching on May 13th at Interrupt a new feature for LangChain that not only provides testing tools but also guides you on what to test, in what order, and when you’re done.”
#21 𝕏 Harrison Chase is launching on May 13th at Interrupt a new feature for LangChain that not only provides testing tools but also guides you on what to test, in what order, and when you’re done.
“Harrison Chase breaks down the end-to-end infrastructure for deploying long-horizon AI agents—covering task scheduling, state persistence, vector retrieval, prompt templating, and monitoring.”
#11 𝕏 Harrison Chase breaks down the end-to-end infrastructure for deploying long-horizon AI agents—covering task scheduling, state persistence, vector retrieval, prompt templating, and monitoring. #12 ▶️ Hermes Agent: The New OpenClaw? Greg Isenberg Shows how to install and configure Hermes Agent with built-in SQLite memory and 40+ tools via a one-line command, connect to Open Router for transparent token pricing, and deploy it on Android devices using Termox and Termox API.
“#10 𝕏 Harrison Chase highlights LangChain’s DeepAgents 0.5 release, which adds async subagents to handle longer-running tasks without blocking the event loop, plus multimodal support and other enhancements.”
#10 𝕏 Harrison Chase highlights LangChain’s DeepAgents 0.5 release, which adds async subagents to handle longer-running tasks without blocking the event loop, plus multimodal support and other enhancements. #13 𝕏 Harrison Chase warns that building agents locally isn’t enough for production—he recommends using LangSmith deployments for secure, scalable launches, with a full walkthrough and docs available.
Related
A coding environment for Claude mentioned for its keyboard shortcut that opens a full-featured editor for prompt writing. It is highlighted as making long prompts far easier to manage.
An agent referenced as benefiting from GBrain’s memory layers. It serves as an example of agent systems becoming more personalized and context-aware.
An LLM application framework mentioned in the context of autonomous web-browsing agents and integrations.
Autonomous or semi-autonomous software systems that can act across tools and workflows. The newsletter frames agents as buyers, tool consumers, and the primary audience for protocols like MCP.
LangChain’s platform for observability, evaluation, and collaboration around AI agents. Here it is described as an org-wide platform that improves cross-functional workflows and feedback loops.
A Claude offering for legal organizations and enterprise AI teams, mentioned as part of deploying Claude in legal workflows.
A concept for modular agent capabilities or instructions, mentioned as an emerging hint toward open standards. It is discussed alongside agents.md in the context of agent harness interoperability.
An open-source agent framework associated with Harrison Chase. In the newsletter it is being optimized for open-source models as closed-model costs rise.
A file-based convention that hints at emerging open standards for agent behavior and configuration. The newsletter references it as one of the few signs of openness in the agent harness stack.
A modular layer that adds tools, guardrails, and custom instructions to AI agents. It is described as a composable harness for production agent systems.
LangChain’s deployment offering for launching agents securely and at scale. It is important for PMs evaluating production readiness, observability, and managed infrastructure for agents.
Community middleware example for customizing agent behavior and steering tasks in agent frameworks.
Stay updated on Harrison Chase
Get curated AI PM insights delivered daily — covering this and 1,000+ other sources.
Subscribe Free