mini-swe-agent — Summary
mini-swe-agent is a deliberately minimal Python coding agent from the Princeton/Stanford SWE-bench team that answers the question: "What if our agent was 100x simpler, and still worked nearly as well?" The core DefaultAgent class is ~130 lines of Python, uses only bash as a tool (no file-specific tools), executes each action with subprocess.run (stateless — no persistent shell session), maintains a completely linear message history, and supports all LLMs via litellm. Despite this radical simplicity, it scores >74% on SWE-bench verified with Gemini 3 Pro. It ships a CLI (mini), multiple sandboxed execution environments (Docker, Singularity, bubblewrap, contree), a trajectory browser UI (Textual TUI), and batch evaluation mode. The intentional minimalism serves two purposes: (1) a clean research baseline that isolates LLM capability from agent scaffold; (2) a hackable daily-use tool.
Compared to seeds: no seed framework approaches this level of intentional minimalism. The closest is agent-os (zero-primitive scaffold), but mini-swe-agent is a fully functional agent with 74% SWE-bench performance in ~100 lines. The architectural insight — single bash tool + stateless subprocess execution + linear history — is unique in the entire corpus.