. About the Role Mindrift is looking for passionate freelance contributors to join the Tendem project ( ) to help build the... future of hybrid agents — where human expertise and AI capabilities work hand in hand As an AI Agent Assistant, you’ll...
is unique, you might typically: Review pre-generated intents and transform them into a logical sequence of agent steps; Define... a test case: dialogue flow (user request, intermediate steps, agent’s final response); Create a Golden Path: the optimal...
evaluation and complex data annotation Training and evaluation of large language models Benchmarking and agent-based code.... Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm...
evaluation and complex data annotation Training and evaluation of large language models Benchmarking and agent-based code... and sometimes work with challenging, complex guidelines. Our freelance role is fully remote so, you just need a laptop, internet...
project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout...”). Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.). Benefits Get paid...
) servers and internal tools for running and evaluating agent behavior. You’ll implement base methods for agent action...-compatible evaluation servers Implementing logic to check agent actions against scenario definitions Creating or extending...