Key Facts

Remote From:

Brazil

Full time

Senior (5-10 years)

English

Hard Skills

Other Skills

•
Communication
•
Teamwork
•
Analytical Thinking
•
Detail Oriented
•
Problem Solving

Roles & Responsibilities

Experience evaluating Generative AI, LLMs, and agentic AI systems.
Strong understanding of AI/ML evaluation metrics and error analysis.
Hands-on experience with Python and AI evaluation workflows.
Familiarity with RAG architectures, prompt evaluation, and agent orchestration.

Requirements:

Design and implement evaluation frameworks for AI agents, LLMs, and RAG-based systems.
Measure accuracy, relevance, consistency, hallucinations, and task success across AI outputs.
Establish baseline and comparative evaluations across models, prompts, and agent strategies.
Validate agent decision logic, reasoning paths, and tool usage for explainability and traceability.

Job description

We are tech transformation specialists, uniting human expertise with AI to create scalable tech solutions.

With over 8,000 CI&Ters around the world, we’ve built partnerships with more than 1,000 clients during our 30 years of history. Artificial Intelligence is our reality.

Role Overview

The AI Agent Evaluation Engineer is responsible for ensuring the quality, accuracy, explainability, and reliability of AI agent systems across Proof-of-Concept, Pilot, and Production. The role focuses on establishing enterprise-grade evaluation frameworks for agentic AI, LLMs, and AI-driven workflows to ensure outputs are trustworthy, measurable, and continuously improving.

Key Responsibilities

• Design and implement evaluation frameworks for AI agents, LLMs, and RAG-based systems.

• Measure accuracy, relevance, consistency, hallucinations, and task success across AI outputs.

• Establish baseline and comparative evaluations across models, prompts, and agent strategies.

• Validate agent decision logic, reasoning paths, and tool usage for explainability and traceability.

• Support human-in-the-loop (HITL) evaluation for high-impact or high-risk use cases.

• Partner with engineering teams to improve prompts, retrieval strategies, and agent orchestration.

• Validate AI observability, monitoring, drift detection, and regression controls.

• Support vendor PoCs, pilots, and RFP evaluations with fact-based assessments.

Required Qualifications
• Experience evaluating Generative AI, LLMs, and agentic AI systems.

• Strong understanding of AI/ML evaluation metrics and error analysis.

• Hands-on experience with Python and AI evaluation workflows.

• Familiarity with RAG architectures, prompt evaluation, and agent orchestration.

• Experience with cloud AI platforms (Azure or GCP preferred).

Preferred Qualifications
• Experience in Education, Healthcare, or other regulated domains.

• Exposure to synthetic data generation and test scenario design.

• Familiarity with AI governance, risk, and compliance practices.

Success Measures

• Measurable improvement in AI accuracy, reliability, and trustworthiness.

• Clear visibility into why AI agents made specific decisions.

• Standardized evaluation frameworks adopted across AI initiatives.

• Increased leadership confidence in AI-driven outcomes.

Our benefits:

-Health and dental insurance

-Meal and food allowance

-Childcare assistance

-Extended paternity leave

-Partnership with gyms and health and wellness professionals via Wellhub (Gympass) TotalPass;

-Profit Sharing and Results Participation (PLR);

-Life insurance

-Continuous learning platform (CI&T University);

-Discount club

-Free online platform dedicated to physical, mental, and overall well-being

-Pregnancy and responsible parenting course

-Partnerships with online learning platforms

-Language learning platform

And many more!

More details about our benefits here: https://ciandt.com/br/pt-br/carreiras

At CI&T, inclusion starts at the first contact. If you are a person with a disability, it is important to present your assessment during the selection process. See which data needs to be included in the report by clicking here.This way, we can ensure the support and accommodations that you deserve. If you do not yet have the assessment, don't worry: we can support you in obtaining it.

We have a dedicated Health and Well-being team, inclusion specialists, and affinity groups who will be with you at every stage. Count on us to make this journey side by side.

Ready to apply?

APPLY

Share ·

AI Operations (AI Ops) Engineer Related jobs

Brazil AI Operations (AI Ops) Engineer

Senior Data Engineer- AI/ML (Remote)

30+ days ago

Ad Hoc LLC

Fixed term

MLOps (Machine Learning Operations)PyTorch (Machine Learning Library)Python (Programming Language)EmbeddingMLflow

Principal Software Engineer, AI Developer Tools

30+ days ago

Docker

Full time

Artificial IntelligenceArgo CDPrompt EngineeringKubernetesObservability

AI Solutions Engineer

30+ days ago

SHI International Corp.

Full time

Nvidia CUDAData Center DesignSales ManagementProof Of Concept (POC) DevelopmentReference Application

Associate Distinguished Engineer - AI, Data Science & Agentic Solutions

30+ days ago

Nagarro

Full time

Multi-Agent SystemsMLOps (Machine Learning Operations)Data InfrastructureKubernetesObservability

Software Engineer AI/ ML

30+ days ago

Genesys

Full time

Java (Programming Language)Machine LearningAWS Cloud ServicesPython (Programming Language)Data Structures

Other jobs at Ci&T

[Job-28591] Analista de Suporte/ Sustentação (Mulheres), Brasil

Today

Ci&T

Full time

Java (Programming Language)SQL (Programming Language)Git (Version Control System)System MonitoringProduction Engineering

[Job-28294] Mid-Level Automation I Java Developer, Brazil

Today

Ci&T

Full time

Java (Programming Language)Java (Programming Language)Test AutomationSelenium (Software)Application Programming Interface (API)

[Job-28482] Senior .NET Developer, Brazil

Today

Ci&T

Full time
Senior (5-10 years)

.NETMicrosoft Azure.NET DevelopmentKubernetesC# (Programming Language)

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

✨

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.

[Job - 28525] AI Quality Engineer Senior, QA

Key Facts

Hard Skills

Other Skills

Roles & Responsibilities

Requirements:

Job description

AI Operations (AI Ops) Engineer Related jobs

Senior Data Engineer- AI/ML (Remote)

Principal Software Engineer, AI Developer Tools

AI Solutions Engineer

Associate Distinguished Engineer - AI, Data Science & Agentic Solutions

Software Engineer AI/ ML

Other jobs at Ci&T

[Job-28591] Analista de Suporte/ Sustentação (Mulheres), Brasil

[Job-28294] Mid-Level Automation I Java Developer, Brazil

[Job-28482] Senior .NET Developer, Brazil

We help you get seen. Not ignored.

Auto-Apply

AI Match Feedback