Skip to content
/

RUBRIC-v3 — The evaluation schema

Every framework in the corpus was analyzed against RUBRIC-v3, a 25-dimension engineering schema. It deliberately excludes pricing, sentiment, and closed-source signals — those are derivable noise, not architectural differentiators. The schema was finalized in Phase B and extended with 5 new dimensions in Phase D (cross-vendor model routing, compiled enforcement, self-evolving plugins, token-efficient encoding, LLM-evaluated hooks).

What changed from v2

Dropped from v2: pricing_model, closed_source_ide, security_incidents, token_cost_concerns, reddit_sentiment, hn_sentiment. Added in v3: cli_binary, local_ui, multi_model, orchestration_pattern, isolation_mechanism, scripts_vs_hooks, subagent_definition_format, auto_validators, git_automation, audit_log, execution_mode, cross_tool_portability.

Output structure per framework (11 files)

Each analysis lives under spec-system/<slug>/ with 11 files:

# File Content focus
1 00-summary.md 5–7 sentence elevator pitch
2 01-overview.md Origin, philosophy, manifesto-style quotes verbatim
3 02-architecture.md Distribution, install, dir tree, deps, target AI tools
4 03-components.md Every named primitive (cmd / skill / agent / hook / mcp / script / template)
5 04-workflow.md Phases + artifacts per phase + approval gates
6 05-prompts.md Verbatim excerpts from ≥2 key prompt files
7 06-memory-context.md State storage, persistence, compaction, handoffs
8 07-orchestration.md Multi-agent pattern, multi-model routing, isolation, execution mode
9 08-ui-cli-surface.md Dedicated CLI binary, local UI/dashboard, IDE integration, observability
10 09-uniqueness.md differs_from_seeds paragraph + positioning + failure modes
11 METRICS.yaml Machine-readable, v3 schema exactly

The 25 engineering dimensions

Distribution
distribution_typeinstall_complexityrequired_runtimeconfig_files
CLI & UI surface
cli_binary (exists, name, subcommands)local_ui (type, port, stack, features)
Primitives
commandsskillssubagentshooksscriptsmcp_serverstemplates
Workflow
workflow_phasesapproval_gatesspec_formatdelta_or_whole_file_specs
Orchestration
orchestration_patternisolation_mechanismmax_concurrent_agentsprompt_chaining_pattern
Multi-model
multi_modelmodel_role_mappingsupports_byoklocked_to_model
Execution lifecycle
execution_modecrash_recoverycontext_compaction_handlingcross_session_handoff
State & memory
memory_typememory_persistencestate_filessearch_mechanism
Quality gates
tdd_enforcedtdd_enforcement_mechanismauto_validatorsself_review_pattern
Git automation
commits_automaticallycreates_pr_automaticallymerges_automaticallyworktree_per_feature
Observability
audit_logaudit_log_formatreplay_capability
Portability
target_toolsprimary_toolcross_tool_portability

What "good" looks like

A great v3 report answers these questions without reading prose: