This is a remote position.
This is a full-time contract position offering a daily rate. The role provides Tier-3 operational ownership for Compute and Operating System services within a mission-critical production platform, ensuring high availability and performance for a private cloud infrastructure.
Fluent German and English (C1 level) are required. Only occasional onsite visits in Germany.
Tier-3 Operations: Drive operational ownership for Compute & OS services, handling complex incidents, deep troubleshooting, and root cause analysis to implement permanent fixes.
Operational Readiness: Validate deployment artifacts and ensure infrastructure readiness for releases, including hardening, patch strategies, and rollback procedures.
Stability & Monitoring: Maintain system health and performance baselines across multi-tenant environments, ensuring robust monitoring and alerting coverage.
Automation & SRE: Execute and improve standard operational procedures through automation to reduce toil and improve Mean Time to Recovery (MTTR).
Technical Coordination: Collaborate with Kubernetes, Data, and Storage SMEs to resolve cross-domain production issues and ensure seamless application hosting.
Governance: Enforce quality assurance measures and document standard operation procedures and runbooks to ensure high-quality service delivery.
Security & Compliance: Implement logging strategies to support audit requirements and perform routine security scans to remediate vulnerabilities.
Senior-level professional with 5–10+ years in IT operations, platform operations, or service delivery within mission-critical environments.
Proven experience leading Incident, Problem, Change, and Release governance in production.
Expertise with ITSM tools, specifically Jira Service Management (JSM), Jira, and Confluence.
Strong background in modern platform operations, including Kubernetes, containerisation, and automation.
Hands-on experience with observability stacks such as Prometheus, Grafana, Mimir, and Loki.
Proficiency in platform delivery concepts, including GitOps and Infrastructure as Code (Terraform, OpenTofu, ArgoCD).
Experience managing SLI/SLA/SLO tracking and gathering operational insights.
Familiarity with enterprise DevOps toolchains (e.g., GitLab, JFrog Artifactory, Harness).
Proficiency in both speech and writing in English (at least C1).
Proficiency in both speech and writing in German (at least C1).
Eligibility Residency in the EU, EEC, UK, or Switzerland.
As a freelancer / contractor with us, you will enjoy flexible working hours and the freedom to choose your own projects. Our platform gives you access to exciting projects in various industries and supports you in advancing your career. You'll benefit from competitive pay and a dedicated team to help you with any questions you may have. Work independently and utilise our strong network to achieve your professional goals.

UST HealthProof

BP

KeyBank

Guidehouse

Medtronic

Interval Group

Interval Group

Interval Group