A layered map of the AI agent application stack, from user experience and agent runtimes through sandboxes, gateways, models, and compute.

The AI Agent Stack: Architecture Layers and Market Map

Research date: June 15, 2026
Scope: Active or influential open-source projects and commercial products with a public official website or repository.

How to read this map

There is no single “AI agent market.” There are several markets stacked on top of one another:

Agent products and hosts
User experience and agent interaction
Application and model SDKs
Agent frameworks and orchestration
Managed agent infrastructure and durable execution
Tools, integrations, protocols, retrieval, and memory
Sandboxed execution and computer use
Model gateways and routing
Model providers and inference runtimes
Compute, storage, and network

Security, identity, governance, observability, evaluation, and cost management cut across the entire stack.

Products often span several layers. This directory places each player where an architect would primarily evaluate or buy it. “Comprehensive” here means a broad, useful working set, not every GitHub repository or newly launched startup.

Layer 0: Agent products and hosts

This is the layer users operate directly. These products contain or connect to an agent runtime, but they are evaluated as finished coding agents, desktop agents, IDEs, assistants, or automation products.

Open-source and model-flexible coding agents

Player	Primary surface	What it is trying to solve
goose	Desktop, CLI, API	Block's open-source, general-purpose local agent for coding, automation, research, workflows, MCP extensions, and multi-agent delegation
OpenCode	Terminal, desktop, IDE	Provider-neutral open-source coding agent with a client/server architecture, LSP awareness, tools, permissions, agents, and MCP
Cline	VS Code, CLI, SDK	Open coding-agent runtime with plan/action workflows, model choice, MCP, and embeddability
Roo Code	VS Code	Open-source coding agent focused on configurable modes, tools, model choice, and extension workflows
Kilo Code	IDE, CLI, cloud	Open-source coding-agent platform spanning local development, terminal use, cloud agents, and team workflows
Continue	VS Code, JetBrains, CLI	Open-source coding-assistant platform for custom models, rules, context providers, autocomplete, and agents
Aider	Terminal	Git-native pair-programming agent with broad model support and repository map/context management
OpenHands	Web, CLI, SDK, cloud	Open platform for software-development agents, remote execution, evaluation, and delegation
SWE-agent	CLI, research framework	Open software-engineering agent and benchmark-oriented environment from Princeton
Qwen Code	Terminal	Open-source terminal coding agent optimized for Qwen but usable with compatible model endpoints
Gemini CLI	Terminal	Google's open-source agentic terminal interface for Gemini and developer workflows
OpenAI Codex	CLI, IDE, desktop, cloud	OpenAI's coding agent for local interactive work and delegated cloud tasks
Tabby	IDE and self-hosted server	Self-hosted open-source coding assistant and completion platform
Zed	Native editor	High-performance editor with first-party and Agent Client Protocol integrations
Open Interpreter	Terminal and local computer	Open agent that lets language models operate code and the user's computer
Agent Zero	Web/terminal agent	Open general-purpose agent with tools, subagents, memory, and local control
AutoGPT	Agent platform	Open agent platform and visual system for creating and running autonomous workflows
Browser Use	Library, CLI, cloud agent	Browser-operating agents and cloud browser automation

Commercial and vendor-native coding agents

Player	Primary surface	Center of gravity
Claude Code	Terminal, IDE, web	Anthropic-native coding agent and agent SDK ecosystem
GitHub Copilot	IDE, CLI, GitHub, cloud	Coding assistance, repository agents, reviews, issue-to-PR work, and multi-agent hosting
Cursor	AI-native editor and cloud agents	Integrated coding, codebase reasoning, background agents, review, and team workflows
Windsurf	AI-native editor	Agentic coding IDE with codebase context and workflow automation
Devin	Cloud software engineer	Delegated autonomous software-development tasks in managed environments
Amazon Q Developer	IDE, CLI, AWS console	AWS-oriented development, transformation, operations, and cloud assistance
JetBrains Junie	JetBrains IDEs	Coding agent integrated with JetBrains project intelligence
Warp	Agentic terminal	Terminal-native coding and DevOps agents with shared workflows
Amp	Terminal and editor	Sourcegraph's agentic coding product for large codebases and delegated work
Replit Agent	Browser IDE and hosted runtime	Prompt-to-application building, execution, and deployment
Bolt	Browser app builder	Prompt-to-full-stack web application generation and deployment
Lovable	Browser app builder	Conversational application generation with visual editing and deployment
v0	Browser and Vercel ecosystem	Generative UI and full-stack web application creation
Firebase Studio	Browser development environment	Google/Firebase-oriented agentic application building
Mistral Vibe	Coding agent	Mistral-native agentic software-development experience

Business and knowledge-work agent products

Player	Primary use
Glean Agents	Enterprise search, knowledge, and workplace agents
Salesforce Agentforce	CRM, customer service, sales, and enterprise workflow agents
Microsoft Copilot Studio	Enterprise copilots and low-code agents across Microsoft systems
Google Agentspace	Enterprise search and agent experiences over organizational data
ServiceNow AI Agents	IT, employee, customer-service, and workflow agents
UiPath Agentic Automation	Agents combined with robotic process automation and enterprise workflows
Dust	Custom enterprise assistants grounded in company systems
Lindy	No-code business and personal workflow agents
Relevance AI	No-code agent teams and business automation
Gumloop	Visual AI workflow and automation agents
Relay.app	Human-in-the-loop workflow automation
Harvey	Legal and professional-services agents
Sierra	Customer-experience agents
Intercom Fin	Customer-support agent

Layer 1: User experience and agent interaction

This layer turns backend agents into usable products. It handles chat, multimodal input, streamed events, generative UI, frontend tools, shared application state, approvals, and human intervention.

Agentic frontend frameworks and component systems

Player	Primary fit
CopilotKit	Embedded copilots, AG-UI, generative UI, frontend tools, shared state, and human-in-the-loop interaction
assistant-ui	Composable React chat and agent interfaces with runtime adapters
AI Elements	Vercel/shadcn-style components for AI application interfaces
Chainlit	Python-first conversational and agent application UI
Gradio	Rapid Python interfaces for models, multimodal apps, and agents
Streamlit	Python data and AI application interfaces
NLUX	Framework-neutral conversational AI UI library
Stream Chat	Production chat infrastructure and AI-agent messaging interfaces
Botpress Webchat	Embeddable conversational-agent UI
Voiceflow	Conversational and voice-agent design and prototyping

Self-hosted assistant shells

Player	Primary fit
Open WebUI	Self-hosted multi-model and local-model interface
LibreChat	Self-hosted multi-provider assistant platform
Chatbot UI	Open-source conversational UI
LobeChat	Open-source multi-provider chat and agent workspace
AnythingLLM	Self-hosted chat, RAG, workspaces, and agents

Voice and realtime-agent interfaces

Player	Primary fit
LiveKit Agents	Realtime voice, video, and multimodal agent framework
Vapi	Voice-agent development and telephony platform
Retell AI	Production phone and voice agents
Bland AI	Phone-call automation agents
Pipecat	Open-source realtime voice and multimodal agent framework
Daily Bots	Realtime voice/video agent infrastructure

Layer 2: Interaction and interoperability protocols

Protocols are not agent frameworks. They standardize boundaries between hosts, tools, agents, user interfaces, commerce systems, and external services.

Protocol	Boundary it standardizes
Model Context Protocol (MCP)	Agent/host access to tools, prompts, resources, and external context
Agent2Agent Protocol (A2A)	Discovery, delegation, messaging, and task lifecycle between agents
AG-UI	Event stream between agent backends and user-facing applications
Agent Client Protocol (ACP)	Communication between coding agents and editors such as Zed
A2UI	Declarative agent-generated user interfaces
MCP Apps	Interactive UI resources delivered through MCP hosts
Agent Payments Protocol (AP2)	Agent-initiated payments and commerce authorization
Universal Commerce Protocol (UCP)	Interoperable commerce flows for agents and businesses
Agent Network Protocol (ANP)	Decentralized agent discovery, identity, and communication
OpenAPI	Machine-readable HTTP APIs commonly converted into agent tools
JSON Schema	Structured tool inputs, outputs, and generated data
OpenTelemetry	Vendor-neutral traces, metrics, and logs, including emerging GenAI conventions

Layer 3: AI application and model SDKs

These libraries normalize model access and application primitives such as streaming, structured output, multimodal input, tools, prompt management, and frontend state. Some overlap with the agent-runtime layer.

Player	Language/ecosystem	Center of gravity
Vercel AI SDK	TypeScript	Full-stack AI applications, provider abstraction, streaming, tools, agents, and UI
TanStack AI	TypeScript	Modular, type-safe provider adapters, tools, streaming, and application clients
Genkit	JavaScript/TypeScript, Go	Full-stack AI flows, tools, evaluation, local development, and Google integrations
LangChain	Python, JavaScript	Broad component ecosystem for models, tools, retrieval, agents, and chains
LlamaIndex	Python, TypeScript	Data-centric AI applications, retrieval, workflows, and agents
Haystack	Python	Production RAG pipelines, components, tools, and agents
Spring AI	Java	Spring-native model, vector-store, tool, RAG, and agent application APIs
Semantic Kernel	.NET, Python, Java	Enterprise application SDK for models, plugins, planning, and agent integration
Instructor	Python and ports	Reliable schema-validated structured extraction from model outputs
BAML	Multi-language	Typed prompt functions and structured model outputs
Outlines	Python	Constrained and structured generation
Guidance	Python	Controlled generation and prompt programming
LMQL	Python/query language	Declarative constraints and programming over language models
DSPy	Python	Declarative LM programs and prompt/module optimization

Layer 4: Agent frameworks, runtimes, and orchestration

This is the decision-loop layer. It coordinates models, tools, state, delegation, retries, approvals, and multi-agent workflows.

Code-first agent frameworks

Player	Primary design
Strands Agents SDK	Model-driven agents, MCP/A2A, multi-agent Graph/Swarm, Python and TypeScript
OpenAI Agents SDK	Agents, handoffs, guardrails, sessions, tracing, MCP, voice, and OpenAI-centric workflows
Claude Agent SDK	Programmable access to Claude Code's agent harness and tools
Google Agent Development Kit	Model-flexible agent development, multi-agent composition, sessions, and deployment
Microsoft Agent Framework	Microsoft's successor path combining agent orchestration concepts from AutoGen and Semantic Kernel
LangGraph	Stateful graphs, durable execution, interrupts, replay, and human-in-the-loop agents
CrewAI	Role-based agent teams, crews, flows, and managed deployment
PydanticAI	Type-safe Python agents with Pydantic validation and dependency injection
Mastra	TypeScript agents, workflows, memory, RAG, evaluation, and development tooling
Agno	Python agent framework with tools, teams, memory, knowledge, and runtime
smolagents	Lightweight Hugging Face agents and code agents
LlamaIndex Workflows	Event-driven workflows and data-aware agents
AutoGen	Microsoft's established multi-agent framework; increasingly complemented by Microsoft Agent Framework
AG2	Community-led continuation of the AutoGen multi-agent ecosystem
BeeAI Framework	IBM-backed open-source agents and multi-agent workflows
Letta	Stateful agents with explicit memory architecture and long-running identity
Langroid	Multi-agent Python programming framework
CAMEL-AI	Multi-agent research and framework ecosystem
Atomic Agents	Small, composable, schema-driven Python agents
VoltAgent	Open-source TypeScript agent framework and observability ecosystem
mcp-agent	Lightweight agent workflows centered on MCP
Rasa	Enterprise conversational AI and controlled agent workflows

Visual and low-code agent builders

Player	Primary fit
Dify	Open-source visual AI application, RAG, workflow, and agent platform
Flowise	Open-source visual agent and LLM workflow builder
Langflow	Python-based visual builder for agents, MCP, and RAG
n8n	Workflow automation with AI agents and broad integrations
Botpress	Visual conversational and customer-facing agent platform
Microsoft Copilot Studio	Enterprise low-code agents in the Microsoft ecosystem
Vertex AI Agent Builder	Google's managed agent and enterprise-search builder
Amazon Bedrock Agents	AWS-managed agent building, action groups, knowledge bases, and multi-agent collaboration
Vellum	Visual AI workflow, prompt, evaluation, and deployment platform
Stack AI	Enterprise no-code agent and workflow platform
Retool Agents	Agents connected to internal tools, databases, and applications
Zapier Agents	Agents over Zapier's application and automation ecosystem

Layer 5: Managed agent infrastructure and durable execution

This layer runs and operates agents in production. It provides hosting, session isolation, identity, scaling, durable state, scheduling, deployment, and operational controls.

Agent-native managed platforms

Player	Primary fit
Amazon Bedrock AgentCore	Framework-neutral AWS runtime, memory, gateway, identity, policy, observability, browser, code interpreter, registry, and evaluations
Microsoft Foundry Agent Service	Managed build, deployment, tools, tracing, identity, and scaling for agents on Azure
Vertex AI Agent Engine	Managed runtime, sessions, memory, evaluation, and deployment on Google Cloud
Cloudflare Agents	Stateful edge agents using Workers, Durable Objects, realtime connections, and workflows
LangSmith Deployment	Managed deployment and operations for LangGraph applications
CrewAI AMP	Managed enterprise deployment and operation of CrewAI agents
Mastra Cloud	Managed deployment for Mastra applications and agents
Agentuity	Agent-native cloud, development, deployment, routing, and observability
Blaxel	Serverless agent infrastructure, sandboxes, functions, and model endpoints
xpander.ai	Full-lifecycle, framework-neutral agent platform and control plane
Inkeep	Agent builder, platform, and customer-facing agent deployment

Durable workflow and background execution platforms

Player	Primary fit
Temporal	Highly durable, replayable, long-running workflows
Inngest	Event-driven durable functions and agent workflows
Trigger.dev	Long-running background jobs and AI tasks
Hatchet	Open-source durable task and workflow orchestration
Restate	Durable execution, virtual objects, and workflows
DBOS	Database-backed durable workflows and application execution
Prefect	Python workflow orchestration and data/AI pipelines
Dagster	Data and AI asset orchestration
Apache Airflow	Scheduled batch workflow orchestration

Layer 6: Tools, actions, integrations, and web intelligence

Agents need authenticated, governed access to business applications, APIs, browsers, search, and the public web.

Tool and integration platforms

Player	Primary fit
Composio	Agent toolkits, managed authentication, and broad SaaS integrations
Arcade	Secure agent tools, user authorization, MCP, and tool evaluation
Pipedream	Developer integration platform, workflows, and MCP-connected APIs
Nango	OAuth, credentials, API integrations, and agent connectivity
Klavis AI	Hosted MCP infrastructure, tool discovery, and enterprise integration
Zapier MCP	Agent access to Zapier's application ecosystem
Apify	Web scraping, browser automation, Actors, and agent tools
Activepieces	Open-source automation and MCP-enabled integrations
Paragon	Embedded integrations and managed authentication
Merge	Unified APIs and agent integrations for business software
Workato	Enterprise integration, automation, and agent orchestration
Tray.ai	Enterprise integration and agent connectivity
Make	Visual automation and application integrations
Toolhouse	Tool infrastructure, memory, and agent capabilities
Smithery	MCP server registry, hosting, and discovery
Glama	MCP server directory and hosting ecosystem

Search, research, crawling, and browser data

Player	Primary fit
Tavily	Search and research API designed for agents
Exa	Neural web search and research APIs
Firecrawl	Website crawling, extraction, search, and agent-ready content
Jina AI	Search, readers, rerankers, embeddings, and web data APIs
Parallel Web Systems	Web research and data retrieval infrastructure for agents
Perplexity Sonar	Search-grounded answer and research APIs
SerpAPI	Structured search-engine result APIs
Bright Data	Web data, scraping, proxies, and agent infrastructure
Oxylabs AI Studio	Web intelligence, crawling, extraction, search, and browser-agent infrastructure
Diffbot	Knowledge graph and structured web extraction

Layer 7: Sandboxed execution and agent computers

This layer gives agents isolated machines, filesystems, processes, browsers, desktops, code interpreters, and preview environments.

General code and computer sandboxes

Player	Primary fit
Daytona	Framework-neutral programmable computers, snapshots, code interpreter, Git, previews, GPUs, Windows, and computer use
E2B	Hosted secure cloud sandboxes and code interpreters for agents
Runloop	Development environments and infrastructure for coding agents
Modal Sandboxes	On-demand containers and sandboxes within a serverless compute platform
Vercel Sandbox	Firecracker-based isolated execution integrated with Vercel
Cloudflare Sandbox	Sandboxed code execution integrated with Workers and Containers
Fly.io Sprites	Persistent, fast-starting remote computers for agents
Blaxel Sandboxes	Serverless agent sandboxes and execution environments
Northflank	Container infrastructure and isolated workloads for agents
Beam	Serverless CPU/GPU execution and sandboxes
Freestyle	Stateful agent computers and code execution
CodeSandbox SDK	Programmatic development sandboxes and previews
Replit	Hosted development environments and application execution
Azure Container Apps dynamic sessions	Azure-managed isolated code-interpreter and custom-container sessions
AgentCore Code Interpreter	AWS-managed isolated code execution

Browser and desktop computer-use infrastructure

Player	Primary fit
Browserbase	Managed browsers, sessions, observability, and agent browser infrastructure
Steel	Open-source browser API and managed browser infrastructure
Hyperbrowser	Cloud browsers and web agents
Kernel	Browser and computer infrastructure for agents
Browserless	Hosted browser automation and Chrome infrastructure
Browser Use Cloud	Managed browser-use agents and browser sessions
Anchor Browser	Cloud browser infrastructure for agents
AgentCore Browser	AWS-managed browser automation

Layer 8: Memory, retrieval, context, and knowledge

This layer turns stateless model calls into systems that can remember users, retrieve organizational knowledge, and maintain durable context.

Agent memory and context engines

Player	Primary fit
Mem0	User, session, agent, and graph memory as a framework or managed service
Zep	Context graphs and enterprise agent memory
Letta	Stateful agents with explicit working and archival memory
Cognee	Knowledge graphs and memory for agents
Graphiti	Temporal knowledge graphs for agent memory
Supermemory	Universal memory and context API
LangMem	Long-term memory tools for LangGraph agents
Hindsight	Agent memory and retrieval system
EverMind	Persistent agent memory platform
Memvid	Portable file-oriented memory for AI applications

Retrieval and document processing frameworks

Player	Primary fit
LlamaIndex	Data connectors, indexing, retrieval, workflows, and agents
Haystack	RAG pipelines, document stores, retrievers, and agents
LangChain	Retrieval components, loaders, splitters, vector stores, and agents
Unstructured	Document ingestion, parsing, partitioning, and enrichment
LlamaParse	Agentic document parsing
Docling	Open document conversion and structured extraction
Vectorize	Managed RAG pipelines and context engineering
Graphlit	Multimodal knowledge ingestion, extraction, search, and agent memory

Vector and hybrid-search databases

Player	Primary fit
Pinecone	Managed vector database and retrieval
Weaviate	Open-source and managed vector/hybrid database
Qdrant	Open-source and managed vector database
Milvus / Zilliz	Distributed open-source vector database and managed cloud
Chroma	Developer-focused open-source and hosted retrieval database
LanceDB	Embedded and cloud multimodal/vector database
pgvector	Vector search extension for PostgreSQL
Redis	In-memory data platform with vector and hybrid search
Elasticsearch	Search, hybrid retrieval, and vector database capabilities
MongoDB Atlas Vector Search	Vector search within MongoDB
Vespa	Large-scale search, ranking, recommendations, and vectors
Turbopuffer	Serverless vector and full-text search
Supabase Vector	Postgres/pgvector retrieval in the Supabase platform
OpenSearch	Open-source search and vector retrieval

Layer 9: Observability, evaluation, testing, and prompt operations

Traditional logs tell you whether code crashed. Agent observability must also explain model decisions, tool calls, trajectories, quality, cost, regressions, and user outcomes.

End-to-end observability and evaluation platforms

Player	Primary fit
LangSmith	Agent tracing, datasets, evaluation, prompt management, and deployment
Langfuse	Open-source traces, evaluation, prompts, metrics, and self-hosting
Braintrust	Evals, experiments, datasets, tracing, and production monitoring
Arize Phoenix	Open-source OpenTelemetry-native tracing and evaluation
Weights & Biases Weave	Tracing, evaluation, datasets, and model/application analytics
AgentOps	Agent monitoring, replay, cost tracking, and evaluation
Helicone	Gateway, observability, sessions, prompts, and evaluations
Opik	Open-source tracing, evaluation, prompt optimization, and production monitoring
MLflow	Open-source GenAI tracing, evaluation, prompt/version management, and deployment integration
Traceloop	OpenLLMetry/OpenTelemetry-based LLM and agent observability
Parea	Evaluation, testing, tracing, and experimentation
Galileo	Agent evaluation, observability, guardrails, and quality intelligence
Patronus AI	Evaluation, red teaming, and reliability
HoneyHive	Evaluation, tracing, prompt management, and experiments
Maxim AI	Simulation, evaluation, observability, and agent quality
Laminar	Open-source tracing, evaluation, and data collection
LangWatch	Open-source LLM/agent evaluation and observability
Lunary	Open-source monitoring, prompts, analytics, and evaluation
Portkey	AI gateway plus observability, governance, and prompt management

Evaluation and testing frameworks

Player	Primary fit
DeepEval	Pytest-style evaluation for LLMs, RAG, and agents
Ragas	RAG and agent evaluation
promptfoo	Open-source testing, comparison, red teaming, and CI
Giskard	Open-source LLM testing, evaluation, and security scanning
OpenAI Evals	Open-source evaluation framework and registry
Inspect AI	UK AI Security Institute's open-source evaluation framework
Evidently	Open-source AI quality, testing, and monitoring
Fiddler	Model and GenAI observability and governance
Arthur	AI performance, monitoring, evaluation, and governance

Layer 10: Security, identity, guardrails, and governance

Agent security includes prompt injection, data leakage, unsafe output, excessive permissions, tool misuse, insecure MCP servers, identity delegation, policy enforcement, and auditability.

Runtime guardrails and open frameworks

Player	Primary fit
NVIDIA NeMo Guardrails	Open-source programmable input, output, retrieval, execution, and dialog rails
Guardrails AI	Structured validation, validators, and runtime guardrails
LLM Guard	Open and commercial prompt/output scanning and sanitization
Llama Guard	Meta safety-classification models
OpenAI Moderation	OpenAI content-safety classification API
AWS Guardrails for Bedrock	Managed content, topic, grounding, and policy controls
Azure AI Content Safety	Managed content safety and prompt shields
Google Model Armor	Managed prompt/response security and policy enforcement

Agent and GenAI security platforms

Player	Primary fit
Check Point Lakera	Prompt-injection defense, runtime guardrails, and red teaming
Protect AI	AI supply-chain security, model scanning, red teaming, and LLM protection
Prompt Security	Enterprise GenAI and agent security
Snyk / Invariant Labs	Agentic AI and MCP security guardrails
HiddenLayer	Model and AI runtime threat detection
Cisco AI Defense	Enterprise AI discovery, validation, runtime protection, and governance
Palo Alto Prisma AIRS	AI runtime security, posture management, and red teaming
Noma Security	AI and agent security posture, red teaming, and runtime protection
Zenity	Security and governance for enterprise agents and copilots
Pillar Security	AI application discovery, testing, and runtime protection
Mindgard	AI red teaming and security testing
CalypsoAI	AI security, red teaming, and runtime controls
WitnessAI	Enterprise AI visibility, policy, and security
Credo AI	AI governance, risk, policy, and compliance
Holistic AI	AI governance, risk management, and assurance

Identity and authorization for agents

Player	Primary fit
Auth0 for AI Agents	User identity, token vaulting, and delegated authorization for agents
Stytch Connected Apps	OAuth and machine/application authorization
WorkOS	Enterprise identity and authorization infrastructure
Permit.io	Fine-grained authorization and policy for applications and agents
Oso	Authorization as a service and policy engine
Cerbos	Open-source authorization and policy decision points
Open Policy Agent	General-purpose policy engine

Layer 11: Model gateways, routers, and inference control planes

Gateways normalize providers and add routing, fallbacks, caching, budgets, keys, policy, logging, and usage control. They do not replace an agent runtime.

AI-native gateways and model routers

Player	Primary fit
OpenRouter	Multi-provider model marketplace and routing API
Vercel AI Gateway	Unified model access integrated with Vercel AI SDK and platform
Cloudflare AI Gateway	Edge gateway, caching, logging, resilience, routing, and policy
LiteLLM	Open-source OpenAI-compatible proxy and provider normalization
Portkey AI Gateway	Gateway, routing, governance, observability, and prompt operations
Helicone AI Gateway	Open gateway with routing and observability
Unify	Performance/cost-aware model routing and unified API
Requesty	Multi-provider routing, fallbacks, and observability
Martian	Model routing and optimization
Not Diamond	Model selection and routing
Eden AI	Unified API across AI providers and capabilities
TrueFoundry AI Gateway	Enterprise gateway, governance, routing, and self-hosting

API gateways with AI capabilities

Player	Primary fit
Kong AI Gateway	Enterprise API gateway extended for LLM traffic
Gloo AI Gateway	Envoy-based enterprise AI gateway
Tyk AI Studio / Gateway	API management and AI gateway controls
Gravitee AI Gateway	API management, AI governance, and traffic control
Zuplo	Developer API gateway with AI and MCP features
WSO2 AI Gateway	Enterprise API and AI gateway
Apigee	Google Cloud enterprise API management with AI integrations
Envoy AI Gateway	Open-source Kubernetes/Envoy AI gateway

Layer 12: Model providers, inference clouds, and local runtimes

This is where model inference occurs. Some providers sell proprietary models, some host open models, and some supply inference software for your own compute.

Foundation-model providers and model platforms

Player	Primary offering
OpenAI	GPT, reasoning, realtime, image, audio, embeddings, and agent APIs
Anthropic	Claude models and agent/coding ecosystem
Google Gemini	Gemini models, multimodal APIs, and Google AI Studio
Amazon Bedrock	Managed access to Amazon and third-party foundation models
Microsoft Foundry Models	Azure model catalog and managed inference
Mistral AI	Frontier and open-weight models and APIs
xAI	Grok models and APIs
Cohere	Enterprise language models, embeddings, and reranking
DeepSeek	Reasoning, coding, and general models
Qwen	Alibaba's open and hosted multimodal/model family
AI21 Labs	Enterprise language models and orchestration
IBM watsonx.ai	Enterprise model platform and Granite models
NVIDIA NIM	Optimized model microservices and enterprise inference
Hugging Face	Model hub, datasets, endpoints, and inference services

Hosted inference for open and custom models

Player	Primary fit
Together AI	Hosted open-model inference and fine-tuning
Fireworks AI	Fast hosted inference, fine-tuning, and compound AI
Groq	Low-latency inference on LPU hardware
Cerebras Inference	High-speed model inference
SambaNova Cloud	Enterprise model inference and platform
Replicate	Hosted model execution and deployment
Baseten	Production model inference and deployment
Modal	Serverless model and GPU workloads
RunPod	GPU cloud, serverless inference, and endpoints
CoreWeave	GPU cloud and AI infrastructure
Lambda	GPU cloud and training/inference infrastructure

Self-hosted inference runtimes

Player	Primary fit
Ollama	Simple local model runtime and API
vLLM	High-throughput open-source serving engine
SGLang	High-performance serving and structured generation runtime
Hugging Face Text Generation Inference	Production serving for Hugging Face models
llama.cpp	Efficient local inference across CPUs and GPUs
LM Studio	Desktop local-model discovery, execution, and API server
LocalAI	OpenAI-compatible local inference stack
NVIDIA TensorRT-LLM	NVIDIA-optimized LLM inference
KServe	Kubernetes-native model serving
BentoML	Model packaging, serving, and deployment

Layer 13: Compute, storage, network, and deployment substrate

Every upper layer eventually runs on conventional infrastructure. The agent ecosystem does not remove this layer; it adds more dynamic workloads, secrets, egress, state, and isolation requirements.

Category	Representative players
Hyperscale cloud	AWS, Microsoft Azure, Google Cloud, Oracle Cloud
Edge/serverless	Cloudflare, Vercel, Fastly, Netlify
Application platforms	Fly.io, Railway, Render, Heroku, Northflank
Containers and orchestration	Docker, Kubernetes, Nomad, OpenShift
Databases and object storage	PostgreSQL, Supabase, Neon, MongoDB, S3, Cloudflare R2
Messaging and streaming	Kafka, Redpanda, NATS, RabbitMQ, Confluent
Secrets and key management	HashiCorp Vault, AWS Secrets Manager, Azure Key Vault, Google Secret Manager, Infisical

Where Goose and OpenCode sit

Goose and OpenCode are best understood as agent products/hosts, not merely SDKs:

User
  |
  +-- Goose desktop / CLI / API
  |     +-- agent loop
  |     +-- model-provider adapters
  |     +-- MCP extensions
  |     +-- built-in code and shell tools
  |     +-- recipes, scheduling, subagents, and ACP delegation
  |
  +-- OpenCode terminal / desktop / IDE
        +-- client/server agent architecture
        +-- model-provider adapters
        +-- LSP-aware coding tools
        +-- permissions and custom agents
        +-- MCP and SDK integration

They compete most directly with Claude Code, Codex, Gemini CLI, Cline, Roo Code, Kilo Code, Aider, Continue, and OpenHands. They do not directly replace an agent deployment platform such as AgentCore, a sandbox such as Daytona, a frontend framework such as CopilotKit, or a gateway such as LiteLLM.

Goose is broader and more automation-oriented: it is positioned as a general-purpose local agent with recipes, extensions, scheduling, and the ability to conduct other agents through ACP. OpenCode is more deliberately centered on the terminal/IDE software-development experience and a provider-neutral client/server architecture.

Common stack combinations

Open-source local coding agent

OpenCode or Goose
  -> MCP tools
  -> OpenRouter or LiteLLM
  -> Anthropic, OpenAI, Gemini, Qwen, or local Ollama/vLLM

Embedded enterprise copilot

CopilotKit
  -> LangGraph, Strands, Microsoft Agent Framework, or OpenAI Agents SDK
  -> Composio, Arcade, or internal MCP servers
  -> Mem0, Zep, or a vector database
  -> AgentCore, Foundry Agent Service, or Vertex Agent Engine
  -> model gateway
  -> model provider

Secure coding or data-analysis agent

Product UI
  -> agent framework
  -> Daytona, E2B, Runloop, or another sandbox
  -> repository, data, tests, and preview server
  -> observability and evaluation platform
  -> gateway and models

Fully self-hosted stack

Open WebUI or LibreChat
  -> LangGraph, PydanticAI, Mastra, or Strands
  -> self-hosted MCP servers
  -> PostgreSQL/pgvector, Qdrant, or Weaviate
  -> Langfuse and promptfoo
  -> NeMo Guardrails and OPA
  -> LiteLLM
  -> vLLM, SGLang, Ollama, or llama.cpp
  -> Kubernetes or conventional compute

Selection checklist

Do not choose one winner across the whole landscape. Choose one or two candidates per required layer:

Is the deliverable a finished agent product or a framework for building one?
Is the primary interface an IDE, terminal, web application, voice channel, or API?
Do you need model portability or a vendor-native experience?
Must workflows survive restarts, wait for humans, and run for hours or days?
Will the agent execute arbitrary code or operate a browser?
Which tools require delegated user identity rather than service credentials?
What memory is durable, deletable, attributable, and policy-governed?
How will you evaluate full trajectories, not only final text?
Where are prompt injection and tool authorization enforced?
Do you need self-hosting, air-gapping, regional control, or managed operations?
Which protocol boundaries should remain portable: MCP, A2A, AG-UI, or ACP?
What happens when the model, tool, browser, sandbox, or human approval times out?

Practical shortlist by problem

Problem	Start with
Open-source local coding agent	Goose, OpenCode, Cline, Roo Code, Aider, OpenHands
Vendor-native coding agent	Claude Code, Codex, Gemini CLI, GitHub Copilot
Embedded product copilot	CopilotKit, assistant-ui, Vercel AI SDK
TypeScript AI application	Vercel AI SDK, TanStack AI, Mastra
Python agent	PydanticAI, LangGraph, OpenAI Agents SDK, Strands, Google ADK
Explicit durable state machine	LangGraph plus LangSmith Deployment or a durable workflow platform
Multi-agent teams	Strands, CrewAI, Google ADK, Microsoft Agent Framework, AutoGen/AG2
Managed AWS operation	Bedrock AgentCore
Managed Azure operation	Microsoft Foundry Agent Service
Managed Google Cloud operation	Vertex AI Agent Engine
Edge/stateful realtime agent	Cloudflare Agents
Secure code and computer execution	Daytona, E2B, Runloop, Modal, Vercel Sandbox
Browser automation	Browserbase, Steel, Hyperbrowser, Browser Use, AgentCore Browser
Agent tools and OAuth	Composio, Arcade, Pipedream, Nango, Klavis
Long-term agent memory	Mem0, Zep, Letta, Cognee, Graphiti
RAG and document context	LlamaIndex, Haystack, Unstructured, Qdrant/Weaviate/Pinecone
Open-source observability	Langfuse, Phoenix, Opik, MLflow, Laminar
Evaluation and red teaming	Braintrust, DeepEval, Ragas, promptfoo, Giskard, Inspect AI
Runtime security	Lakera, NeMo Guardrails, LLM Guard, Protect AI, Invariant/Snyk
Self-hosted model gateway	LiteLLM, Portkey, Helicone, Envoy AI Gateway
Broad model marketplace	OpenRouter
Local model inference	Ollama, vLLM, SGLang, llama.cpp, LM Studio

Maintenance notes

This market changes rapidly. Recheck status, licensing, pricing, and product names before procurement.
Open-source client libraries and self-hosted servers from the same company can use different licenses.
A protocol implementation claim does not guarantee full interoperability; test the exact client/server pair.
“Memory” may mean message history, semantic facts, a knowledge graph, checkpoint state, or durable workflow state. These are different products and data-governance obligations.
“Agent platform” is heavily overloaded. Always identify whether it supplies authoring, hosting, execution, tools, UI, models, or only a control plane.
The earlier detailed comparison remains available in Vercel AI SDK vs TanStack AI and adjacent platforms.