Back to Videos

Building AI-Native: Inside the Stacks Powering Cognition, Gamma, and Harvey

Channel Anthropic
Date May 6, 2026
Duration 28 min
Tags AI-Native, Multi-Agent, MCP, Architecture, Production
TL;DR

The teams behind Cognition (Devin), Gamma (AI-first presentations), and Harvey (AI for legal) compare notes on what it actually takes to build AI-native products: how they architect multi-agent systems, where MCP fits in production, how they handle failure modes, and the architectural decisions they'd make differently with hindsight.

Key Takeaways

Summary

Cognition's Architecture

Cognition runs a hierarchical agent system for Devin where a planning agent breaks down tasks, dispatches to execution agents, and a verification agent checks outputs before they're accepted. The key design decision was context isolation — execution agents don't share context with each other, only with the planner via structured messages. This prevents task bleedthrough and makes failures attributable.

Gamma's Creative Stack

Gamma uses Claude for end-to-end presentation generation — from brief to finished deck. Their architecture is less hierarchical and more pipeline-oriented: a planning pass produces a structured outline, a content pass fills it in, a design pass applies visual logic. MCP integrates their design token system so Claude has access to brand constraints during generation rather than as a post-processing step.

Harvey's Legal Stack

Harvey operates in legal — high-stakes, output-verified, every claim needs attribution. Their multi-agent system uses a research agent (retrieval and synthesis), a drafting agent (structured legal writing), and a review agent (checking for unsupported claims, jurisdictional accuracy, formatting compliance). The trust model is strict: no output ships without the review agent's sign-off.

MCP in Production

All three teams have MCP in production, and all three have opinions about where it shines and where it struggles. The consensus: MCP is excellent for giving agents access to structured external systems (databases, APIs, design tokens) but requires careful tool scope design. An agent with 50 tools doesn't perform better than one with 10 well-chosen tools — it performs worse, because the model spends context deciding which tool to use.

The Lessons

In the final segment, each team names their biggest architectural regret and the decision they'd repeat. Cognition's regret: not building context isolation into the base layer from day one. Harvey's regret: building custom orchestration before trying the Managed Agents layer. Gamma's repeat: designing their eval pipeline before their production stack, not after.

Notable Quotes

"The hardest part of multi-agent isn't the routing. It's the failure handling. What does your product do when one agent in a five-agent chain returns something that doesn't parse?"

"MCP doesn't make tools free. Every tool you give an agent is context you're spending. Scope tightly."

"We built our eval suite before we had a product. Seemed wasteful at the time. Best decision we made."

Companies Featured

CompanyDomainKey Architecture Pattern
Cognition (Devin)AI software engineeringHierarchical agents with context isolation
GammaAI presentation generationSequential pipeline with MCP design tokens
HarveyAI legalResearch + draft + review agent chain