Evaluation Scenario Writer AI Agent Testing Specialist

Work set-up: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

Bachelor's or Master's degree in Computer Science, Data Science, AI, NLP, or related fields., At least 3 years of professional experience in relevant areas., Advanced proficiency in English (C1 or above)., Strong analytical skills, attention to detail, and adaptability to new methods..

Key responsibilities:

  • Design realistic and structured evaluation scenarios for AI agents.
  • Define gold-standard behaviors and annotate task steps and edge cases.
  • Collaborate with developers to test and refine scenarios.
  • Review agent outputs and adapt tests to improve clarity and effectiveness.

Mindrift logo
Mindrift Information Technology & Services SME https://mindrift.ai/
501 - 1000 Employees
See all jobs

Job description

This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates.

At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.

What we do

The Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into realworld expertise from across the globe.

About the Role

We’re looking for someone who can design realistic and structured evaluation scenarios for LLMbased agents. You’ll create test cases that simulate humanperformed tasks and define goldstandard behavior to compare agent actions against. You’ll work to ensure each scenario is clearly defined, wellscored, and easy to execute and reuse. You’ll need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions.

Although every project is unique, you might typically:

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Industry :
Information Technology & Services
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Detail Oriented
  • Analytical Thinking
  • Adaptability
  • Problem Solving

Related jobs