Framework for Evaluating AI Agents in Production: Lessons from 100+ Deployments | She Talks AI