Offer summary

Qualifications:

Proven experience in infrastructure and reliability engineering, including deployment automation and monitoring., Solid programming skills in Python, Go, or TypeScript, with production-grade coding experience., Familiarity with cloud-native development and AWS services, especially AI/ML-related services., Experience with Kubernetes, CI/CD pipelines, Infrastructure-as-Code tools like Terraform, and containerization using Docker..

Key responsibilities:

Design and develop platform components for machine learning and GenAI use cases.

Manage deployment, maintenance, monitoring, and incident response for services.

Collaborate with ML Engineers, SREs, and platform teams to ensure operability and scalability.

Participate in code reviews, documentation, and team decision-making processes.

Job description

About the opportunity

We are seeking a Site Reliability Engineer to join the Platform Engineering domain in the AI Platform team.

The mission of Platform Engineering is to provide trusted, performant, selfservice platforms that empower product teams to build the bank the world loves to use. The AI Platform team contributes to this mission by creating scalable, secure, and compliant infrastructure solutions that support MLOps and GenAI capabilities.

As one of the first banks completely hosted in the cloud, our security, resilience, and productivity standards require not only the use of a modern technology stack but also building teams in line with our principles, supporting our product teams, the company, and our customers.

In this role, you will:

Contribute to the design and development of platform components that enable machine learning and generative AI use cases across the company

Take ownership of reliable deployment, maintenance, monitoring, and incident response for our services

Write highquality, maintainable code and help ensure our platform solutions are welldocumented and testable

Work alongside more senior engineers to evolve our infrastructure and build secure, compliant and scalable solutions across cloud, networking, observability and CICD domains

Collaborate with ML Engineers, SREs, and other Platform teams to ensure operability and maintainability of AI capabilities offered across the company
Participate in code reviews, RFCs, documentation and product discovery, contributing to the teams design and decisionmaking processes

Identify technical or knowledge gaps and proactively work to address them, either independently or with the team

Help improve our engineering practices and team ways of working

What you need to be successful

Background and skills:

Proven experience specifically in infrastructure and reliability engineering, including deployment automation, monitoring, incident management, and performance tuning

Solid programming skills, ideally in Python, Go or TypeScript, and experience writing productiongrade code

Familiarity with cloudnative development and AWS infrastructure, including some experience with services like SageMaker, Bedrock, or other AIMLrelated services

Experience with Kubernetes, CICD pipelines (e.g. ArgoCD, GitHub Actions), InfrastructureasCode tools (e.g. Terraform), and containerization (Docker)

Working knowledge of networking, security and compliance best practices in production environments

Appreciation for good documentation, testing and observability

Nice to have:

Exposure to MLOps practices or working with Data ScienceMachine Learning teams

Familiarity with promptbased or LLMdriven GenAI workflows

Interest or prior experience in building developerfacing platforms and reusable abstractions

Traits:

You take pride in writing clean, reliable, and welltested code

You’re a proactive team player who communicates openly and supports others

Comfortable working in a crossfunctional environment, with a focus on practical impact

Eager to learn from others and also share your knowledge with newer team members

Excited by the opportunity to work on cuttingedge AI platform capabilities while developing your expertise across a mix of SRE, ML, and platform domains

Were looking for someone who wants to grow their career in the AI infrastructure space, bringing battletested SRE practices to a crossfunctional team while also learning from experienced engineers in Machine Learning, Backend, and Platform Engineering. Youll have the chance to contribute meaningfully to the evolution of the AI platform that’s at the heart of the company’s highest value bets, helping to build a foundation for reliable, scalable and democratized access to Machine Learning and GenAI capabilities across N26.

If youre motivated by ownership, curious about AI and infrastructure, and energized by working in a highimpact, collaborative team: we’d love to hear from you!

What’s in it for you:

Accelerate your career growth by joining one of Europe’s most talked about disruptors 🚀.

Employee benefits that range from a competitive personal development budget, work from home budget, discounts to fitness & wellness memberships, language apps and public transportation.

As an N26 employee you will have access to a Premium subscription on your personal N26 bank account. As well as subscriptions for friends and family members.

Vacation days vary depending on your location of work. Additional day of annual leave for each year of service.

A high degree of autonomy and access to cutting edge technologies all while working with a friendly team of peers of diverse nationalities, life experiences and family statuses.

A relocation package with visa support for those who need it.

Who we are

N26 has reimagined banking for today’s digital world. Technology and design empower everything we do and it’s how we are building the global banking platform the world loves to use.

Weve eliminated physical branches, paperwork, and hidden fees for an elegant digital experience and supreme savings. Giving people the power to live and bank their way is what gets us out of bed in the morning and inspires the work that we do.

Founded in 2013, N26 now has more than 8 million customers in 24 markets. We are headquartered in Berlin with offices in multiple cities across Europe, including Vienna and Barcelona, and a 1,500strong team of more than 80 nationalities.

Sounds good? Apply now for this position.

N26 is an equal opportunity employer and values diversity. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status or disability status.

Required profile

Are you interested?

Site Reliability Engineer (SRE) Related jobs

Senior Site Reliability Engineer, NIM Factory

Today

NVIDIA

Full time

MicroservicesCloud ComputingDocker (Software)Site Reliability Engineering

Senior Site Reliability Engineer - Observability and Telemetry Platform

Today

NVIDIA

Full time

Python (Programming Language)KubernetesOpenStackInfrastructure Automation

Senior Site Reliability Storage Engineer - GPU Clusters

Today

NVIDIA

Full time

ContainerizationCloud ComputingComputer Data StorageDistributed File Systems

Principal Site Reliability Engineer, AI Infrastructure

Today

NVIDIA

Full time

KubernetesPython (Programming Language)UnixSite Reliability Engineering

Principal Architect, Site Reliability Engineering - GeForce Now

Today

NVIDIA

Full time

MicroservicesKubernetesDistributed ComputingCloud Computing

See more Site Reliability Engineer (SRE) jobs

Site Reliability Engineer AI Platform

Offer summary

Qualifications:

Key responsibilities:

Job description

Required profile

Experience

Hard Skills

Other Skills

Site Reliability Engineer (SRE) Related jobs

Senior Site Reliability Engineer, NIM Factory

Senior Site Reliability Engineer - Observability and Telemetry Platform

Senior Site Reliability Storage Engineer - GPU Clusters

Principal Site Reliability Engineer, AI Infrastructure

Principal Architect, Site Reliability Engineering - GeForce Now