Design evaluation strategies for AI agents
Describe your agent to get eval recommendations
Coding Agent
Code generation, debugging, refactoring
Conversational
Customer support, chat assistants
Research Agent
Web research, data gathering, analysis
Computer Use
Browser automation, GUI interaction
Medium (mostly consistent)
Configure your agent to see recommendations