Logo for SimScale

Senior SRE / Platform Engineer (m/f/d)

Roles & Responsibilities

  • 5+ years of professional experience in SRE, platform, or infrastructure engineering
  • Software development experience in at least one of Python, Go, Rust, or Java
  • Strong systems foundation with understanding of Linux internals and distributed systems
  • Hands-on cloud and infrastructure experience including AWS, Terraform, and Kubernetes

Requirements:

  • Evolve the Kubernetes platform and coordinate platform evolution across multiple engineering teams
  • Drive organization-wide adoption of OpenTelemetry for distributed tracing and metrics
  • Support the transition to a global, multi-cloud architecture for disaster recovery and data residency
  • Improve tooling for AWS account provisioning and infrastructure management

Job description

The Role

We are looking for a Senior SRE / Platform Engineer (m/f/d) to own and improve the cloud infrastructure behind SimScale's browser-based simulation platform. The role spans AWS and EKS, observability, disaster recovery, security and compliance controls, multi-region architecture, elastic GPU/HPC capacity, and internal developer tooling.

SimScale's engineering teams run workloads directly on AWS; you will build the standards, guardrails, and self-service tooling that let them do so safely, raising reliability and security without slowing engineering velocity. You will join a small, tightly knit infrastructure team supporting 50+ engineers across the company. This is a hands-on senior individual contributor role; people management is not required, but there is a genuine path toward tech-lead ownership as the team grows.

Your Opportunity

  • Evolve our Kubernetes platform: Evaluate and adopt technologies such as Kubernetes Gateway API and service mesh patterns, and coordinate platform evolution across 10+ engineering teams.
  • Take observability to the next level: Drive organization-wide adoption of OpenTelemetry for distributed tracing and metrics, and help teams define meaningful SLOs.
  • Shape multi-region architecture and data residency: Support our move from an EU-centered footprint toward a global, multi-cloud architecture that satisfies disaster-recovery and data-residency requirements.
  • Own cloud cost and efficiency at scale: Keep petabyte-scale infrastructure cost-efficient, secure, and well-instrumented.
  • Improve tooling: Build self-service AWS account provisioning, guardrails and AI-assisted automations that help engineering teams manage infrastructure safely and efficiently at scale.

What We Expect from You

  • 5+ years of professional experience in SRE, platform, or infrastructure engineering.
  • Software development experience: Your background is rooted in software development, and you moved into SRE from there. You write production-quality software in at least one of Python, Go, Rust, or Java.
  • Strong systems foundation: You understand Linux internals and distributed systems well enough to debug complex production behavior.
  • Hands-on cloud and infrastructure experience: AWS (or GCP), declarative infrastructure (Terraform), gitops-workflow (ArgoCD) and container orchestration (Kubernetes).
  • Observability and reliability experience: You have worked with OpenTelemetry, Prometheus, distributed tracing, monitoring, and meaningful SLOs/SLIs.
  • Production debugging depth: You can investigate complex failures, communicate clearly during incidents, and turn findings into durable improvements.
  • Security and compliance awareness: You understand how infrastructure decisions affect access control, auditability, disaster recovery, logging, and standards such as SOC 2.
  • Clear communication: You can explain trade-offs to engineering teams and help others adopt better platform practices without unnecessary friction.

Bonus Points

  • An open source portfolio or contributions.
  • Prior technical leadership experience, especially in infrastructure, reliability, or platform engineering.

Location: Remote (within CET ±5h)

What you can expect from us 

  • Join a dedicated, supportive team with unlimited growth opportunities and leadership potential
  • Make an impact quickly by sharing ideas and contributing to creative, goal-oriented projects
  • Work in a diverse, inclusive environment with colleagues from over 35 countries
  • Enjoy flexible hours and the freedom to work remotely from anywhere in the world
  • Access comprehensive health coverage, retirement plans, paid time off, and wellness support
  • Enjoy fresh office lunches or gift cards as a remote employee
  • Grow as a professional with online/offline learning, language courses, and tech talks
  • Connect at team events, join support groups, and contribute to our ESG and DE&I initiatives
  • Participate in fun team challenges and competitions for added excitement and team spirit

Diversity, Equity and Inclusion at SimScale

At SimScale, we look beyond borders and hire great talent from all parts of the world. With our team consisting of people from various backgrounds, we truly embrace diversity and encourage everyone to be themselves. We are unified by curiosity, dedication and our team spirit! As an equal opportunity employer, we acknowledge that our employees have different aspirations and career goals, and therefore are committed to create a diverse environment. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status. A copy of SimScale's full recruiting guideline can be made available on request. Kindly let us know how you would like to be addressed and whether you have specific requirements for the interview. 

Platform Engineer Related jobs

Other jobs at SimScale

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.