Caliber — Summary
Caliber is a CLI tool and agent skill pack that continuously generates and maintains AI context files (CLAUDE.md, .cursor/rules/*.mdc, AGENTS.md, copilot-instructions.md) for any codebase, keeping them accurate as the code evolves. It provides a deterministic 100-point scoring system (caliber score) that evaluates context file quality without any LLM calls, an LLM-driven generation workflow (caliber init / /setup-caliber skill), pre-commit hooks for automatic refresh on every commit, and session-learning hooks that distill agent interaction patterns into CALIBER_LEARNINGS.md.
Problem it solves: Hand-written context files go stale the moment code is refactored — the AI agent hallucinates paths that no longer exist, misses new dependencies, and gives advice based on yesterday's architecture. Caliber automates the full create-score-refresh loop so context files stay accurate without manual maintenance.
Distinctive trait: The deterministic scoring system — 100 points across 6 categories (Files & Setup 25, Quality 25, Grounding 20, Accuracy 15, Freshness 10, Bonus 5) — evaluates context by cross-referencing config files against the actual project filesystem: do referenced paths exist? Are code blocks present? Is there config drift since the last commit? No LLM call needed for scoring, making it fast and reproducible.
Target audience: Teams using multiple AI tools simultaneously (Claude Code + Cursor + Codex + OpenCode + GitHub Copilot) who need synchronized context files across all agents. Multi-provider LLM support: Claude Code seat, Cursor seat, Anthropic API, OpenAI API, Vertex AI, custom endpoint.
Differs from seeds: Most similar to agent-os (Archetype 4 — generates and manages markdown context files) but adds a complete automation layer: deterministic quality scoring, LLM-driven generation, pre-commit hooks for continuous sync, session-learning hooks, and multi-platform output (5 AI tools instead of 1). Unlike agent-os which ships a fixed set of opinionated standards, Caliber generates project-specific content using LLM analysis of the actual codebase.