pydantic-evals

Here are 3 public repositories matching this topic...

Evaluation framework for testing LLM agents' ability to use MCP tools

mcp code-first async-first llm logfire pydantic-ai pydantic-evals

A pytest plugin integrating pydantic-evals

pytest pytest-plugin evals pydantic-ai pydantic-evals

YAML Based Eval Specification Language and AI generation pipeline for LLMs and Developers.

yaml specification llms evals pydantic-evals

Add a description, image, and links to the pydantic-evals topic page so that developers can more easily learn about it.

To associate your repository with the pydantic-evals topic, visit your repo's landing page and select "manage topics."