Agent-EvalKit: Open-Source Evaluation for AI Agents in Claude Code11. June 2026AI Models, Claude AI, Claude CodeShare on:Agent-EvalKit automates the evaluation of AI agents through structured test-case generation, observability instrumentation, and combined code and LLM-based metrics directly in the development environment. Share on: