Logo for Interval Group

T3 Operations Specialist — Compute & OS (PID9057)

Roles & Responsibilities

  • 5–10+ years of IT operations, platform operations, or service delivery experience in mission-critical environments.
  • Proven experience leading Incident, Problem, Change, and Release governance in production.
  • Strong background in modern platform operations, including Kubernetes, containerisation, automation, and hands-on familiarity with observability stacks (Prometheus, Grafana, Mimir, Loki).
  • Fluency in English and German at C1 level and EU/EEA/UK/Switzerland residency eligibility; experience with ITSM tools (Jira/JSM, Jira, Confluence) and GitOps/IaC concepts (Terraform, OpenTofu, ArgoCD).

Requirements:

  • Tier-3 Operations: Drive operational ownership for Compute OS services, handling complex incidents, deep troubleshooting, and root cause analysis to implement permanent fixes.
  • Operational Readiness: Validate deployment artifacts and ensure infrastructure readiness for releases, including hardening, patch strategies, and rollback procedures.
  • Stability Monitoring: Maintain system health and performance baselines across multi-tenant environments, ensuring robust monitoring and alerting coverage.
  • Automation SRE: Execute and improve standard operational procedures through automation to reduce toil and improve MTTR.

Job description

This is a remote position.

This is a full-time contract position offering a daily rate. The role provides Tier-3 operational ownership for Compute and Operating System services within a mission-critical production platform, ensuring high availability and performance for a private cloud infrastructure.

Fluent German and English (C1 level) are required. Only occasional onsite visits in Germany.

Responsibilities

  • Tier-3 Operations: Drive operational ownership for Compute & OS services, handling complex incidents, deep troubleshooting, and root cause analysis to implement permanent fixes.

  • Operational Readiness: Validate deployment artifacts and ensure infrastructure readiness for releases, including hardening, patch strategies, and rollback procedures.

  • Stability & Monitoring: Maintain system health and performance baselines across multi-tenant environments, ensuring robust monitoring and alerting coverage.

  • Automation & SRE: Execute and improve standard operational procedures through automation to reduce toil and improve Mean Time to Recovery (MTTR).

  • Technical Coordination: Collaborate with Kubernetes, Data, and Storage SMEs to resolve cross-domain production issues and ensure seamless application hosting.

  • Governance: Enforce quality assurance measures and document standard operation procedures and runbooks to ensure high-quality service delivery.

  • Security & Compliance: Implement logging strategies to support audit requirements and perform routine security scans to remediate vulnerabilities.



Requirements



  • Senior-level professional with 5–10+ years in IT operations, platform operations, or service delivery within mission-critical environments.

  • Proven experience leading Incident, Problem, Change, and Release governance in production.

  • Expertise with ITSM tools, specifically Jira Service Management (JSM), Jira, and Confluence.

  • Strong background in modern platform operations, including Kubernetes, containerisation, and automation.

  • Hands-on experience with observability stacks such as Prometheus, Grafana, Mimir, and Loki.

  • Proficiency in platform delivery concepts, including GitOps and Infrastructure as Code (Terraform, OpenTofu, ArgoCD).

  • Experience managing SLI/SLA/SLO tracking and gathering operational insights.

  • Familiarity with enterprise DevOps toolchains (e.g., GitLab, JFrog Artifactory, Harness).

  • Proficiency in both speech and writing in English (at least C1).

  • Proficiency in both speech and writing in German (at least C1).

  • Eligibility Residency in the EU, EEC, UK, or Switzerland.




Benefits

As a freelancer / contractor with us, you will enjoy flexible working hours and the freedom to choose your own projects. Our platform gives you access to exciting projects in various industries and supports you in advancing your career. You'll benefit from competitive pay and a dedicated team to help you with any questions you may have. Work independently and utilise our strong network to achieve your professional goals.



Operations Specialist Related jobs

Other jobs at Interval Group

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.