PLUGIN 0 upvotes 3
Agent Benchmark Kit
Automated quality assurance for Claude Code agents using LLM-as-judge evaluation. Built by BrandCast.
by BrandCast-Signage
View Source
Tags
community
Automated quality assurance for Claude Code agents using LLM-as-judge evaluation. Built by BrandCast.