Evaluation framework for testing LLM agents' ability to use MCP tools
-
Updated
Apr 24, 2026 - Python
Evaluation framework for testing LLM agents' ability to use MCP tools
A pytest plugin integrating pydantic-evals
YAML Based Eval Specification Language and AI generation pipeline for LLMs and Developers.
Add a description, image, and links to the pydantic-evals topic page so that developers can more easily learn about it.
To associate your repository with the pydantic-evals topic, visit your repo's landing page and select "manage topics."