The Fellowship of Agentic Evaluations: How to Evaluate an Agent?
How do you know if your AI agent is actually doing the right thing? In this workshop, we'll explore practical evaluation frameworks for agentic systems. Forming a fellowship of evaluation techniques—from simple unit tests to complex behavioral evaluations—we'll apply them to real agent scenarios. You'll learn to define evaluation criteria, implement automated test suites, measure agent performance quantitatively, and track improvement over time.
Speakers
Want to know more?
Join PyCon Colombia newsletter and get a complete overview of our events, speakers and community participation.



