Skip to content
/
Phase B Batch 29

Batch 29 Notes — Harness Frameworks (Desktop / Durable / .NET / JVM / Slack-integrated)

10 frameworks in this batch

Batch 29 Notes — Harness Frameworks (Desktop / Durable / .NET / JVM / Slack-integrated)

Roster

# Slug Display Name Tier Stars Distinctive Feature
1 hexagent HexAgent A ~200 Computer Protocol isolation; VM-per-agent (WSL2/Lima); SKILL.md progressive loading
2 open-cowork Open Cowork A ~150 Chinese-authored; Lima VM sandbox; SKILL.md catalog; deep Claude Code integration
3 thclaws thClaws A ~300 Rust binary; tmux multi-window UX; KMS grep-wiki; /dream background consolidation; 8 hook events
4 mateclaw MateClaw A ~180 Spring Boot JVM daemon; 8 IM channel adapters; 14+ LLM provider failover; LESSONS.md adaptive skills
5 openclaw-net OpenClaw.NET A ~120 .NET/NativeAOT; Passive Harness Contracts JSON; 48 native tools; 9 channel adapters; port 18789
6 utah-inngest Utah (Inngest) A ~90 Inngest durable execution; every LLM call is step.run(); cancelOn singleton; SOUL.md/USER.md/MEMORY.md
7 mini-coding-agent mini-coding-agent A ~400 Zero-dependency single Python file; Ollama-only; 6 labeled components; educational reference
8 deerflow DeerFlow A 69608 ByteDance; Docker sandbox isolation; ultra parallel sub-agents; claude-to-deerflow bridge; highest stars in batch
9 coleam00-your-claude-engineer your-claude-engineer A ~850 Arcade OAuth gateway; Linear as checkpoint store; screenshot evidence gate; per-agent model selection
10 backgrounder-dev backgrounder.dev C N/A Closed SaaS — insufficient public material

Intra-Batch Patterns

1. The Daemon Spectrum

This batch spans the full range of execution durability:

  • backgrounder.dev (Tier C) — cloud-hosted, always-on (inferred)
  • MateClaw — Spring Boot daemon, always-on, survives reboots via systemd/launchd
  • thClaws — Rust process with native daemon scheduling (launchd/systemd-user)
  • Utah/Inngest — serverless durable execution; "always on" via Inngest step checkpointing
  • your-claude-engineer — script-per-run; durability via Linear issue state

The spectrum from "long-running OS process" to "event-driven serverless with durable steps" to "external SaaS as checkpoint" represents three distinct architectural answers to the same question: how do agents survive restarts?

2. Language Runtime Diversity

Batch 29 is the most language-diverse in the entire Phase B corpus:

  • Rust binary (thClaws) — compiled, no runtime, tmux-native
  • JVM / Spring Boot (MateClaw) — enterprise runtime, port 8080
  • .NET / NativeAOT (OpenClaw.NET) — compiled binary, no runtime dependency
  • Python script (your-claude-engineer, mini-coding-agent, Utah/Inngest)
  • Node.js (Utah/Inngest, next.js frontend in DeerFlow)
  • Python + Node.js hybrid (DeerFlow, HexAgent, Open Cowork)

3. External Service Integration Trinity

your-claude-engineer is the only framework in the entire corpus that integrates Linear + GitHub + Slack simultaneously via a single OAuth gateway. Most frameworks treat external services as optional add-ons or avoid them entirely.

4. Isolation Mechanism Progression

Framework Isolation Level
mini-coding-agent None (host process)
your-claude-engineer None (host process)
thClaws tmux window separation
MateClaw JVM thread isolation
OpenClaw.NET Process boundary
Utah/Inngest Inngest step isolation
Open Cowork Lima VM (macOS)
HexAgent WSL2 (Windows) / Lima (macOS)
DeerFlow Docker container per task (AioSandboxProvider)

5. The "Educational" vs "Production" Axis

  • mini-coding-agent: Explicitly educational — 6 source components are labeled in comments; zero dependencies; the README names the reference papers and patterns
  • DeerFlow: Production-grade with 69K stars, ByteDance backing, Docker sandboxing, and v1→v2 complete rewrite
  • MateClaw, OpenClaw.NET, thClaws: Mid-spectrum production harnesses with real deployment infrastructure

Most Interesting Find

Utah/Inngest's cancelOn pattern is the most technically novel element in this batch. The cancelOn: [{ event: "app/chat.message", if: "async.data.threadId == event.data.threadId" }] configuration cancels any in-progress agent invocation when a new message arrives on the same thread. This solves the "rapid typing race condition" problem that most harnesses ignore entirely — sending two messages quickly would trigger two concurrent agent runs with conflicting tool calls. The Inngest event-driven model makes this a one-line config rather than a complex concurrency control system.

Tier C Items

Slug Reason
backgrounder-dev Closed SaaS; no public GitHub repo; no technical documentation available

Cross-References to Other Batches

  • DeerFlow → claude-to-deerflow skill: The skill bridges Claude Code → DeerFlow; relates to DeerFlow being callable from Claude Code terminal (inversion of normal direction). Cross-reference with Batch 5 (major coding agents) for Claude Code plugin ecosystem.
  • your-claude-engineer → Arcade MCP gateway: Arcade gateway approach for multi-service OAuth is referenced by other harness frameworks. Cross-reference with Batch 1 (OpenSpec ecosystem) for MCP gateway patterns.
  • thClaws KMS pattern: The grep+read wiki (no-embeddings) is the Karpathy LLM-wiki pattern. Cross-reference with Batch 12 (Memory compaction/KG/pruning) for alternative memory approaches.
  • OpenClaw.NET Passive Harness Contracts: The JSON work plan pattern that doesn't block execution relates to the spec-driven development workflows in Batch 3 (BMAD-aware + cross-runtime SDD).
  • Utah/Inngest durable execution: The Inngest step.run() checkpointing model is comparable to LangGraph StateGraph checkpointing in DeerFlow (v2 rewrite). Both solve long-horizon task durability.

Notable Gaps

  • MateClaw and OpenClaw.NET: Both are Chinese-authored frameworks with non-English primary documentation. The English README content was sufficient for analysis but deeper prompt engineering details may be underrepresented.
  • thClaws /dream feature: The background side-channel agent that consolidates KMS from session history is mentioned in the README but implementation details in Rust source weren't fully inspectable in this analysis pass.
  • DeerFlow v1 vs v2: The v2 rewrite is a complete ground-up rewrite. The main-1.x branch (v1 Deep Research framework) represents a substantially different architecture that was not analyzed.