Plano — Summary
Plano is an AI-native proxy server and data plane for agentic applications, built on Envoy proxy by its core contributors (Katanemo). It externalizes four concerns from agent code: orchestration (YAML-declared intent routing via a purpose-built 4B-parameter LLM), model agility (unified LLM API with automatic provider fallback), Agentic Signals (zero-code OTEL trace capture), and filter chains (guardrails, moderation, and memory hooks). Agents are plain HTTP servers implementing OpenAI-compatible chat completions — Plano adds the routing, observability, and safety layer without framework dependencies. The planoai CLI (up, down, logs, trace, init) manages a Docker-based deployment of Envoy + brightstaff (native Rust binary).
Compared to seeds: Plano is closest to AgentGateway and ContextForge (both are AI proxies) but distinguishes itself by: (1) using Envoy as the underlying proxy rather than a custom Rust binary, (2) the purpose-built "Plano-Orchestrator" 4B-parameter LLM for intent routing at fraction of GPT-4 cost, and (3) framing agents as plain HTTP services with no framework dependency — the most opinionated "your agent is just an HTTP server" stance in the corpus.