AI Agent Evaluation Analyst - AI TrainerAt Mindrift, innovation meets opportunity. We believe in using the power... AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation...
and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you'll have to balance... (edge cases, failure modes, "what could go wrong") Some understanding of how scoring or evaluation works in agent testing...