Batch 05 Notes — Major Coding Agents (Terminal/IDE Harnesses)
Agent: Phase B batch agent #05 of 33
Theme: Major coding agents (terminal/IDE harnesses)
Frameworks analyzed: aider, cline, opencode, goose, continue, tabby, gpt-engineer, openhands, swe-agent, sweep
Date completed: 2026-05-26
Roster Table
| Framework | Stars | License | CLI Binary | Local UI | Orchestration | Multi-Model | Tier |
|---|---|---|---|---|---|---|---|
| opencode | 165,457 | MIT | opencode |
desktop-app | task-decomposition-tree | yes | A |
| openhands | 74,904 | NOASSERTION | openhands |
web-dashboard | sequential | no | A |
| gpt-engineer | 55,211 | MIT | gpte |
none | sequential | no | C (archived) |
| cline | 62,338 | Apache-2.0 | cline |
vscode-extension | parallel-fan-out | no | A |
| goose | 45,882 | Apache-2.0 | goose |
desktop-app | hierarchical | no | A |
| aider | 45,340 | Apache-2.0 | aider |
terminal-tui | sequential | yes | A |
| tabby | 33,542 | NOASSERTION | tabby |
web-dashboard | none | yes | A |
| continue | 33,391 | Apache-2.0 | cn |
vscode-extension | sequential | no | A |
| swe-agent | 19,314 | MIT | sweagent |
none | sequential | no | A |
| sweep | 7,712 | NOASSERTION | sweep |
none (JetBrains plugin separately) | sequential | no | A (dormant) |
Total stars: ~537,000 across 10 frameworks
Intra-Batch Patterns
Every framework ships a CLI binary — all 10 have a named binary (
aider,cline,opencode,goose,cn,tabby,gpte,openhands,sweagent,sweep). This contrasts with seed frameworks (BMAD, spec-kit) which are methodology docs without executables.Isolation bifurcation: 3 frameworks run in Docker containers (Tabby, OpenHands, OpenCode via Docker support), 7 run in-place. Docker users score better on SWE-bench because they can safely run test suites and undo file changes. In-place runners avoid Docker dependency but risk polluting the host environment.
The REPL-vs-pipeline split: SWE-agent forces one command per turn (REPL loop); all other frameworks allow multi-step internal reasoning before producing output. SWE-agent's REPL constraint is a research design choice for trajectory analysis; it would be inconvenient in production.
Multi-model routing is rare but significant: Only aider (Architect/Editor) and opencode (per-agent routing) implement true per-role model assignment. The majority (8 of 10) use a single model throughout.
Pivot stories are common: 2 of 10 frameworks show clear product pivots — gpt-engineer (archived, spawned Lovable.dev), sweep (GitHub bot → JetBrains plugin). The pattern: GitHub bot → IDE plugin as the commercial destination.
Most Interesting Find
Sweep's sweep.yaml blocks its own prompt file: The Sweep repository's own sweep.yaml includes blocked_dirs: ["sweepai/core/prompts.py"], meaning Sweep (the GitHub bot) is configured to refuse to modify its own core system prompts when processing issues against the Sweep repo. This is a self-referential meta-constraint that demonstrates the sweep.yaml behavioral rule injection taken to its logical extreme. No other framework in this batch has a comparable self-referential constraint in its own config.
Tier C Items
- gpt-engineer (slug:
gpt-engineer): Archived repository (last push May 2025, ARCHIVED status). Sufficient material for full 11-file analysis — thepreprompts/directory contains well-documented prompt philosophy. Thegptebinary still installs but the project is officially discontinued. Markedmaintainer_status: archivedin METRICS.yaml. Documented as the direct precursor to Lovable.dev (web app generation) and spiritual ancestor of all single-shot code generation frameworks.
Status Notes
- sweep: Marked
maintainer_status: dormant(last commit 2025-09-18, no active development on the open-source repo). The JetBrains plugin atplugins.jetbrains.com/plugin/26275-sweep-aiis active but is a separate product not in this repo. All 11 files cover the open-source GitHub bot/CLI codebase. - tabby: Contributor count capped at 100 (GitHub API max_per_page); actual count may be higher.
- opencode: Repo moved from
sst/opencodetoanomalyco/opencodeduring analysis. All API calls resolved via the original URL (GitHub redirect) and canonical reference included in files. - openhands:
agenthub/directory returned 404 (structure changed in main branch). Analysis used.openhands/microagents/andskills/directories which contain the equivalent content.
Cross-References Discovered
- aider SEARCH/REPLACE format is referenced or adopted by: opencode (same format), SWE-agent (
edit_anthropictool bundle), cline (partial) - SWE-bench benchmark is the common evaluation target for: openhands (77.6%), SWE-agent (original benchmark developers), sweep (unmeasured, predates SWE-bench)
- sweep.yaml rules pattern is the ancestor of:
.clinerules/(Cline),.cursorrules(Cursor),CLAUDE.md(Claude Code) — all implement the same "project rules as behavioral constraints" paradigm - gpt-engineer's
preprompts/philosophy(coding principles as system prompt constraints) directly anticipates OpenHands' microagent PERSONA pattern and Sweep's aspirational quality claims - Docker two-container architecture (OpenHands: runtime + sandbox) is a more sophisticated version of the container isolation that tabby uses for server deployment