Cross-runtime harness
One source compiled to outputs for Claude Code, Codex, Gemini CLI, Cursor, etc.
69 primary frameworks · 50 lower-confidence entries
Background worker service captures every tool call as an observation, AI-compresses sessions, and auto-injects relevant past context using a 3-layer…
A minimal, hackable, multi-provider terminal coding agent that adapts to your workflows via npm-installable TypeScript Extensions and Markdown Skills — without…
Encodes senior-engineer software development lifecycle as 23 auto-routed skills and 7 slash commands for any AI coding agent.
Single Markdown source for 83 domain-specialized plugins that auto-generates idiomatic artifacts for five AI coding harnesses.
Self-hosted AI coding assistant server (alternative to GitHub Copilot) with admin dashboard, RAG-based completions, and multi-IDE support.
Make each unit of engineering work compound into easier future work via brainstorm→plan→execute→review→learn cycles.
Open-source AI PR reviewer with single-call tool architecture, PR compression for large diffs, self-reflection quality gate, and cross-platform git provider…
macOS desktop app that eliminates context-switching overhead when running 10–100+ parallel AI coding agents, each isolated in its own git worktree, monitored…
Make git worktree lifecycle (create, switch, list, merge, cleanup) as simple as branch operations, designed for managing 5-10+ parallel AI agents.
Routes local process syscalls (network, files, DNS, env) through a live Kubernetes cluster pod so AI agents test against real infrastructure without deploying.
Prove that a ~130-line agent with only a bash tool can achieve >74% on SWE-bench, and serve as a clean research baseline + hackable daily tool.
Generate code that passes tests, nothing else — the smallest possible TDD agent that avoids the Roomba-under-the-table problem of general coding agents.
Mistral AI's first-party open-source CLI coding agent with configurable safety profiles, subagent delegation, SKILL.md custom skills, and ACP programmatic…
TypeScript server framework for building deployable headless agent services, with first-class Cloudflare Workers + Durable Objects support
Portable Kiro-style SDD harness for 8 AI agents: discovery-routed spec creation with boundary-annotated tasks and autonomous subagent-per-task TDD…
Ultra-thin Python kernel (~2,600 lines) with formal module protocol contracts and a Git-based bundle marketplace, providing a Linux-kernel-style extensible AI…
57 slash commands (15 multi-agent workflows + 42 single-purpose tools) for Claude Code, now superseded by the wshobson/agents plugin marketplace.
Manages complex multi-session software projects by coordinating specialized AI agents (Planner/Manager/Workers) with human-mediated message relay and…
Be the AI pair programmer that watches your tmux screen and helps with any terminal workflow — no shell wrappers, no workflow changes, just observe and assist.
Engineering reasoning governor that enforces First Principles Framing — frame, compare under parity, decide with falsifiable contracts, detect stale evidence —…
Governs AI agent execution through formal decision contracts, parity-enforced comparisons, evidence-decay scoring, and bounded WorkCommissions — ensuring…
Delivers a complete spec-driven agentic workflow across 17 AI agents with worktree isolation, lane-tracked parallel execution, and retrospective-based workflow…
TypeScript port of LangChain's Python Deep Agents harness, with advanced generic type safety for agent configuration
Python DSL for building multi-agent DAG pipelines with >> operator, Jinja2 prompt chaining, fanout/merge parallelism, iterative LLM-as-judge cycles, and native…
Cross-platform rule template for Cursor + Cline + Roo Code using symbolic links for a shared single source of truth with Agile-inspired SDLC workflow.
One native Rust binary that is a complete sovereign AI agent workspace — GUI, CLI, webapp, scheduler, knowledge base, and multi-agent orchestration — without…
Portable multi-agent harness that models an engineering team as specialized agents and projects them across 27+ AI tools from a single .agents/ source of truth
Prevent AI context rot by keeping the orchestrator lean (~15% context) and running all implementation in parallel fresh 200k-token subagent contexts.
Orchestrate everything around code-writing — task selection, branch management, review, PR, merge, memory — via structured pipelines with gated phases and…
Provides production-grade OAuth-secured MCP tool aggregation across 14+ external services, eliminating the hardest part of connecting AI agents to real-world…
Verbatim Kiro IDE system prompt + 8 spec-driven development skills packaged as a Claude Code plugin for teams that want Kiro's methodology without the paid IDE.
Enforce spec-first task hygiene with fresh-context workers, source-tagged capture, and cross-model review gates to prevent context bleed and hallucinated…
Universal CLI package manager for AI coding agent configuration files (rules, commands, agents, skills, MCPs) with cross-platform conversion for 40+ tools.
Enterprise multi-agent server platform: always-on, multi-vendor failover, RBAC, approval gates, audit trail, and 8 IM channel adapters in a single Spring Boot…
Unified Python library for interacting with persistent bash sessions in any execution backend (local, Docker, Fargate, Modal).
Same as sandboxed-sh — self-hosted AI agent orchestrator (non-canonical slug).
Self-hosted orchestrator for AI coding agents with isolated workspaces, multi-runtime support, and Library-based configuration management.
Deploys an identical spec-driven workflow across 8 AI platforms using a CLI-first scaffold that the AI then populates.
Catches intent-implementation mismatches in AI-generated code by cross-examining git diffs against the stated goal and conversation history.
Brings GitHub PR-style human inline review to AI agent output (plans, code, live apps, HTML) with persistent per-line comments and round-to-round diffs,…
Python agent harness providing infrastructure (orchestration, resilience, observability, fallback chains) around any AI agent framework
Python production agent harness with parallel tool batching, structured observations, BM25 tool retrieval, signature loop detection, and modular production…
Reference implementation of harness engineering for Claude Code — hook-enforced dual review, sentinel-driven state machine, and fail-closed safety where the AI…
Ships three selectable SDD methodologies (Simple/FIRE/AI-DLC) in one npm package so teams can graduate from lightweight to full lifecycle orchestration without…
Deterministic YAML-defined multi-agent workflow engine with Jinja2 routing, parallel execution, multi-model support, and a built-in real-time web dashboard.
Production operations runtime for executing frozen, long-horizon agentic programs (hanks) reliably, with single-threaded execution, git checkpointing, and…
Gives any LLM a computer via a runtime-computer isolation protocol — the harness never shares its keys or config with the agent.
Generates complete spec-driven toolkits for any domain from a single command, enabling the creation of domain-specific specification systems rather than…
AGENTS.md-based framework that enforces TDD, Plan Injection gates, and progressive skill loading across Cursor, Codex CLI, and Gemini CLI.
Implements memory as a plain text file managed by the agent's native Read/Write/Edit/Grep tools, with a structured analyze command that produces a 7-section…
Steering tile that enforces spec-before-code methodology via versioned skills, always-on rules, and an evaluation harness with 9 graded scenarios.
Provides a skills+rules+evals tile for spec-driven development with one-question-at-a-time requirement gathering, explicit stakeholder approval gates, and…
Wrap any coding agent in a hardware-isolated microVM with COW workspace snapshot, egress firewall, and interactive change review.
88-agent corporate-hierarchy orchestration system with Intel NPU hardware acceleration and multi-IDE support for enterprise-grade parallel AI task execution.
Compose reusable context/prompt/rule components into named pipelines that activate with one command, enabling instant context switching between development…
Translates a shared set of context-engineering patterns into native formats for 4 different AI coding tools via a Deno CLI installer.
Orchestrates 7 different AI coding agent CLIs from a single Kanban workflow with per-feature worktree isolation, Fleet mode for parallel agent comparison, and…
Give AI agents persistent memory, 34 skills, and personal calibration via an Obsidian vault so every session picks up exactly where the last one left off —…
ACP protocol adapter that exposes Cline's coding capabilities to non-VS-Code editors like Zed.
Pre-assembles deterministic context packages for agents at each lifecycle step, combined with a multi-language code graph and customizable schema that defines…
One command to generate correct, security-researched ignore configurations for all AI coding tools in a project, with documented CVEs and bypass…
Prevent the recurring AI-generated technical debt patterns (duplicate artifacts, convention drift, scope creep) that accumulate regardless of which AI tool is…
Prevent AI coding assistants from generating insecure or broken Next.js 15 + Supabase code by injecting stack-specific constraint rules into every AI…
Zero-friction template for spec-driven development in cloud workspaces: write a spec, run one command, get working code.
Open registry infrastructure for AI assets where agents can autonomously discover, install, and contribute reusable capabilities across sessions and platforms.
Full-platform AI software factory: hooks turn prompts into deterministic enforcement, Missions add structured orchestration, and Droid Computers provide…
Show 50 lower-confidence entriestier-b · tier-c · unknown · delta reports
These entries map to § a8 by tag but carry weaker evidence — fewer documented primitives, delta reports of absent skills, or marketing-only sites without a public repo. They're listed for completeness; treat them with appropriate caution.
Open-source managed agents platform with 4 UI surfaces (web/desktop/mobile/CLI), squad routing, autopilots, and skill compounding — turns coding agents into org…
Run Claude agents securely in per-session Docker containers with multi-channel messaging (WhatsApp/Telegram/Discord) and credential vault isolation.
Enforces Manus-style persistent markdown planning on any AI coding agent via hooks that automatically re-inject plan state before every tool call.
Production-grade Python/.NET SDK for building, orchestrating, and hosting multi-agent AI workflows on Azure Foundry.
Lightweight framework for AI-powered offensive and defensive security automation, battle-tested in CTF competitions and real-world vulnerability discovery.
Five-phase PRD-to-shipped-code skill using GitHub Issues as canonical task store with parallel agent execution via git worktrees.
AI-native proxy built on Envoy that externalizes agent orchestration, LLM routing, observability, and safety as out-of-process middleware — agents are just HTTP…
Structured task management for AI coding agents using per-task markdown files, MCP, and a web Kanban board — all inside the git repo.
Enterprise AI platform providing centralized MCP registry, Kubernetes-native orchestration, dual-LLM security, and cost management for organizations adopting AI…
Cross-IDE adaptive software development lifecycle enforcement for AI coding agents, with mandatory audit trails and human approval gates at each phase.
Provides a memory-resilient Claude Code starter with dual git-shared + machine-local memory that survives CLAUDE.md resets, plus multi-provider delegation to Co…
Multi-language SDK for Alibaba Cloud's on-demand sandbox sessions across Browser, Desktop, Mobile, and Code execution surfaces.
Generates and continuously maintains AI context files (CLAUDE.md, Cursor rules, AGENTS.md, Copilot instructions) with deterministic quality scoring and automati…
VS Code extension that orchestrates 12 AI coding agent CLIs through a unified chat UI with 5-strategy Brainstorm Mode, 16 personas, and heuristic convergence de…
Terminal TUI companion panel that provides real-time git diff viewing, unified conversation history from 10+ AI coding agents, task monitoring, and workspace ma…
6-phase cybernetics-inspired pipeline for large-scale AI transformations with S.U.P.E.R architectural health framework, adaptive drift control, and GitHub-nativ…
Autonomous parallel-executing coding agent that reads project context naturally without special setup files.
Multi-provider multi-agent harness for OpenCode with Sisyphus orchestrator routing work across Anthropic/OpenAI/Google/xAI agents based on task domain.
SDLC control plane for coding agents: compounding corpus of decisions, learnings, and planning rules that makes each session smarter than the last.
Ports the full BMAD agile agent workflow (10 named personas, planning+implementation phases) into Claude Code's native hook and subagent infrastructure.
Runs multiple AI coding agents in parallel, each in its own git worktree, handling commits, PRs, and CI monitoring automatically.
3,500+ verified Docker terminal tasks + minimal ReAct BashAgent for evaluating and training terminal-capable AI agents.
Provide a visual web interface for OpenSpec workflows with PTY terminal, OPSX compose, project hooks, and static export.
Runs AI agent skills in hermetically isolated Docker containers against deterministic graders to produce reproducible pass/fail results across a model × case ma…
Conversational skill that interviews the user once, generates requirements/design/tasks, then creates identical Universal Instruction Blocks for every AI tool s…
Converts any MCP server into a production-quality agent skill package following the agentskills.io specification, with real introspection, OAuth support, and up…
Local-first TUI and CI-gate tool for retrospective analysis of AI coding-agent session cost, tokens, health, and latency across 15+ agent runtimes.
Strips AI-isms (sycophancy, stock vocabulary, hedging stacks, em-dash overuse) from LLM responses while preserving technical accuracy via research-backed subtra…
Extends OpenSpec with domain-specific artifact workflows for BDD, event-driven systems, ADRs, and Linear project management.
Self-hostable platform for creating, versioning, and distributing AI coding assistant configurations (CLAUDE.md, AGENTS.md, .cursorrules) across teams with ente…
Applies deterministic pattern-matched checks on AI-generated agent code to catch hallucinated tools, unbounded loops, and missing retry limits before code ships…
Provides a git-native, IANA-registered YAML format for AI project context (project.faf) that bi-syncs with CLAUDE.md/AGENTS.md and scores 0-100% AI-readiness vi…
Compiles OpenAPI specifications into all agent interface formats (MCP server, CLAUDE.md, AGENTS.md, skills, .cursorrules, A2A card, CLI) so API owners don't man…
A Next.js 16 SaaS starter kit with embedded Claude Code skills that prevent AI agents from hallucinating incompatible patterns by encoding exact import paths, A…
Dependency-aware JSONL issue tracker for AI agents with a ready-queue that surfaces only unblocked work.
Bridges OpenSpec's lifecycle control with superpowers execution discipline via a review.md readiness gate and per-change execution mode selection.
Manages MCP server enable/disable state across Claude Code, Cursor, and Cline to prevent context window exhaustion from startup tool-loading.
Combines codebase context packing, security scanning, workflow skills, and a persistent web dashboard into one zero-cloud tool that reduces per-prompt token cos…
Prevents AI agents from creating duplicate artifacts by enforcing a mandatory reuse-before-create decision chain as a precondition for any code-writing action.
SQLite-backed execution harness that gives AI agents task lifecycle management, lease-based concurrency control, workload-profile skill specialization, and Symp…
Gives AI assistants the full team context — PRD, ADRs, guidelines, process — needed to execute a complete SDLC without context loss.
A focused Python execution agent (LangGraph ReAct + persistent IPython kernel) that mandates constraint-independent verification before saving any solution, plu…
Provide full OpenSpec lifecycle orchestration within IntelliJ IDEA with flexible AI routing to any of 28 detected tools or direct API providers.
Packages PR diffs with full post-change file contents so LLMs can review code with context about unchanged lines, not just what changed.
CI linter that validates AI coding context files (CLAUDE.md, .cursorrules, AGENTS.md) against 12 rules covering security, structure, and AI anti-patterns.
Provides annotated fill-in-the-blank starter files for AI coding context (CLAUDE.md, .cursorrules, PRP) so developers don't write useless or harmful context fil…
Rust TUI for browsing, searching, filtering, and copying skill names from .agents/ skill directories used by Claude/Opencode coding assistants.
Prevents context budget exhaustion and hallucination on complex brownfield changes via subagent delegation and mandatory session restarts.
Provides a single shell-pipeable CLI for AI generation, web search, and data enrichment across 25+ providers so agents can delegate external calls without burde…
Git-native requirements management CLI generating dual human+AI documentation from a flat JSON hierarchy with built-in test traceability.