Key Facts

Remote From:

Full time

Portuguese, Spanish, English, Italian, Polish

Hard Skills

Intelligent Agent Large Language Modeling TypeScript User Feedback Prompt Engineering Database Cursor Observability Azure OpenAI Controlled Experiments Solution Benchmarking Agent-Based Model CI/CD Issue Tracking Software Engineering Open Systems Architecture Ethical AI Messaging Strategy

Other Skills

•
Quality Assurance
•
Adaptability
•
Dealing With Ambiguity
•
Teamwork
•
Problem Solving

Roles & Responsibilities

Functional coding in TypeScript or Python
Demonstrated experience with LLMs (personal project or professional)
Experience with creating AI agents
Familiarity with LLM providers and model trade-offs

Requirements:

Maintain evals to keep close to 100% accuracy and update them as we get new feedback
Monitor user feedback to ensure our agents have a high level of accuracy
Run new experiments to proactively increase the performance of current agents (test new prompts, models, agent architectures)
Design new agents

Job description

Coverflex

Work changed. Pay didn’t.

Coverflex exists to make compensation work for everyone.
Pay is still rigid, fragmented, and hard to feel.
We turn compensation into choice - one platform, one card, one app - for benefits, meal allowance, insurance and more.

Our platform is simple for HR and meaningful for employees.
We provide choice, smarter compensation tools and empowerment.

⚙️ TL;DR (The Essentials)

Role: AI Engineer
Seniority Level: Mid
Type: Individual Contributor
Languages: English (main) / Portuguese, Spanish or Italian a plus
Main Tools:

→ OpenAI API

→ Vercel SDK or similar

→ Langsmith / Braintrust / Langfuse or similar

→ Cursor / Codex / Claude Code

Location: Remote (Europe only)
Compensation:

Base Salary: 50K to 60K gross annually
Equity: Yes – VSOPs
Benefits: See below
Contract Type: Permanent

💥 Your Impact

Your role will play a major role in our success because…

With our goals to scale AI, specially for expansion post-series B, we need AI that has some minimum threshold of reliability. Our AI already handles thousands of conversations per week and we expect that to increase significantly. We need dedicated effort in making sure that our AI is accurate, helpful and reliable.

You’ll know you’re successful when, after 90 days, you’ve…

1. Maintain evals to keep close to 100% accuracy and update them as we get new feedback

2. Monitor user feedback to ensure our agents have a high level of accuracy

3. Run new experiments to proactively increase the performance of current agents (test new prompts, models, agent architectures)

4. Keep up to date with AI developments

5. Design new agents

6. Maintain Evals infrastructure to run reliably and at reasonable time and cost in our CI/CD pipeline

How we’ll measure success:

1. Eval coverage exists for all production agents. Evals are updated shortly after new feedback. Accuracy stays above agreed threshold and matches real-world performance.

2. Feedback loop is active - issues are caught and logged. Time to detect accuracy problems decreases. Recurring issues are identified and addressed.

3. Number of experiments ran and documented per month, resulting in improvements shipped to production.

4. New agents go from concept to production. Agents meet quality bar on first iterations (fewer back-and-forth cycles).

⚡ Reality Check - What Makes This Role Hard

Let's be real - here's what makes this role challenging:

Biggest hurdle is the fact that this is a new field. Processes are being created and it requires faster adaptation to new developments in the industry. Also relevant is that we LLMs are inherently non-deterministic which can make the work more complex than with software.

👤 You

Must-haves (evidence, not years)

→ Functional coding in TypeScript or Python

→ Demonstrated experience with LLMs (personal project or professional)

→ Experience with creating AI agents

→ Familiarity with LLM providers and model trade-offs

Nice-to-have

→ Exposure to LLM observability tools

→ Experience with OSS models

→ Background in SW, product, QA, or data

🧬 Your DNA

Genuinely curious about AI — explores new models and tools for fun, not just work. More excited about making the model perform well than writing elegant code. Comfortable with ambiguity and iteration. Testing mindset: likes to measure, compare, and improve. Proactive in sharing findings and documenting learnings.

Who will probably find this frustrating…

Someone who sees this as a stepping stone to "real" ML/infra engineering. Pure coders who want to build systems but have no patience for prompt iteration. People who need clear specs to start working — this role requires experimentation. Anyone who thinks prompt engineering is "not real work".

👥 Manager & Team

Meet Your Manager

Hiring Manager: Fábio Domingues, Senior AI Engineer
Location: Portugal
LinkedIn Profile: https://www.linkedin.com/in/fjrdomingues/

Profile Snapshot:

Energy: Calm and steady. I genuinely enjoy the work — solving problems, making progress, seeing things improve. No artificial urgency or drama. I like to keep things relaxed while still moving forward.
Communication: Straightforward and low-ceremony. I value clarity over polish. I might skip the small talk and get straight to the point — not because I don't care, just because that's how I'm wired. I'm always open to questions and back-and-forth.
Feedback Style: Honest and collaborative. If something isn't working, I'll say it — but constructively, and I'll help fix it. I appreciate direct feedback in return. No egos, no politics

How to work with me - in the Manager's own words:

Relaxed but engaged. We'll share the workload, think through problems together, and iterate quickly. I don't have rigid processes — you'll have room to shape how we work. If you like autonomy, ownership, and a calm environment where the focus is on doing good work, we'll get along well.

Your Team

You’ll work day-to-day with:

Engineers, Product Manager, Product Designer, Engineering Manager

Key Stakeholders:

Bruno Oliveira, VP Engineering
João Caxaria, Head of AI
Vitor Pires, AI Engineer
Jorge Mestre, AI Engineer

💜 Access & Belonging (Equal Opportunity)

We hire for impact and potential, not pedigree.
We welcome applications from people with non-linear careers, career breaks, caregiving gaps, and those changing fields.

No discrimination on the basis of age, disability, gender identity/expression, marital or family status, pregnancy, neurodivergence, race/ethnicity, religion/belief, gender, sexual orientation, or any other protected ground.

Assessment fairness:
We anchor on evidence of outcomes (what you shipped, moved, or influenced).
We actively de-bias by using structured rubrics, multiple assessors, and blind screening most of the time (we won’t know your name, gender, or personal info until the interview stage).

📬 Application Clarity

No cover letter required.
Apply with your LinkedIn or upload your CV.
You may be asked a few short, relevant questions.

Total candidate time investment: ~3–5 hours end-to-end.

🧩 Hiring Stages (What to Expect, Why & How Long)

1. CV / LinkedIn Screen — Signal check vs must-haves
• You’ll hear from us within 7 business days.

2. Role-Fit Questionnaire (async)
Purpose: capture signals your CV can’t (languages, tools, scenario judgement) and calibrate seniority.
Format: multiple choice + short answers.
Accessibility: prefer a call? Tell us - we’ll swap for a short chat.

3. Hiring Manager Interview - Engineering Mindset deep-dive • 45–60 min\

4. Case / Work Sample - Walkthrough + Q&A • 20–30 min
You’ll get actionable feedback either way.

5. Cultural Interview - With People • 30 to 45min

6. Final Conversation (CEO / C-Level) — Values, strategy, and your growth • 30–45 min

Optional: References (2–3 people who’ve seen your recent work) - async.

⏱️ Speed & Communication

Decision: within 4 weeks of your application.
Updates: weekly if the process runs longer.
Scheduling: interviews between 10:00–16:00 CET (flexible across Europe).
Feedback: from the Case stage onwards, you’ll always receive written or verbal feedback - what went well, and what to strengthen next time.

Ready to apply?

APPLY

Share ·

Related jobs

Europe

Mesa West - Morgan Stanley, Real Estate Private Credit – Loan Originations, Analyst/Associate

30+ days ago

Morgan Stanley

Full time

Queer-Affirming Therapist (Fully Licensed, 1099, Remote/Telehealth)

30+ days ago

Edges Wellness Center LLC

Fixed term

Cognitive Behavioral TherapyDirectly Observed TherapyCase Management SystemsHolistic HealthTelepsychiatry

Technicien(ne) Concepteur(trice) Instrumentation et Contrôle

30+ days ago

WSP in Canada

Full time

Equipment Specification DevelopmentElectrical EngineeringAutoCADControl System DesignBuilding Inspections

Rekryterare till Datalenta

30+ days ago

Digitalenta

Full time

Data EngineeringData ArchitectureData ScienceIntelligent AgentNetwork Administration

Mobile Vehicle Inspector II, Manheim Express, Las Vegas, NV

30+ days ago

Cox Automotive Inc.

Full time

Damage AssessmentsMechanical InspectionMechanical InspectionAutomotive EngineeringSafety Standards

Other jobs at Coverflex

Account Executive (SMB)

30+ days ago

Coverflex

Full time
Mid-level (2-5 years)
30 - 42K

Sales ManagementSmall Business SalesSales ForecastingSocial MarketingLinkedIn Sales Navigator

Business Development Representative (BDR)

30+ days ago

Coverflex

Full time
20 - 25K

HubSpot CRMOutbound Sales SkillsPain AssessmentAftercareLinkedIn Sales Navigator

Spontaneous Application – Categorie Protette (L. 68/99)

30+ days ago

Coverflex

Full time

Artificial IntelligenceProduct ManagementGoogle DataCompensation AnalysisMarketing

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

✨

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.

Ai Engineer - Performance Team

Key Facts

Hard Skills

Other Skills

Roles & Responsibilities

Requirements:

Job description

Coverflex

⚙️ TL;DR (The Essentials)

💥 Your Impact

⚡ Reality Check - What Makes This Role Hard

👤 You

Must-haves (evidence, not years)

Nice-to-have

🧬 Your DNA

👥 Manager & Team

Meet Your Manager

Your Team

💜 Access & Belonging (Equal Opportunity)

📬 Application Clarity

🧩 Hiring Stages (What to Expect, Why & How Long)

⏱️ Speed & Communication

Related jobs

Mesa West - Morgan Stanley, Real Estate Private Credit – Loan Originations, Analyst/Associate

Queer-Affirming Therapist (Fully Licensed, 1099, Remote/Telehealth)

Technicien(ne) Concepteur(trice) Instrumentation et Contrôle

Rekryterare till Datalenta

Mobile Vehicle Inspector II, Manheim Express, Las Vegas, NV

Other jobs at Coverflex

Account Executive (SMB)

Business Development Representative (BDR)

Spontaneous Application – Categorie Protette (L. 68/99)

We help you get seen. Not ignored.

Auto-Apply

AI Match Feedback