PLUGIN 0 upvotes 3

Agent Benchmark Kit

Automated quality assurance for Claude Code agents using LLM-as-judge evaluation. Built by BrandCast.

by BrandCast-Signage View Source

Tags

community