Vibe-Coded Evals with LLM-as-a-Judge
December 13, 2025 • 2 min read
Using Claude Code and OpenRouter infrastructure to rapidly build model evaluations with Claude Opus 4.5 as the judge
Loading...
Using Claude Code and OpenRouter infrastructure to rapidly build model evaluations with Claude Opus 4.5 as the judge