This is a remote position.
This is a full-time contract position offering a daily rate. The role provides Tier-3 operational ownership for storage products within an on-premises production platform, ensuring high availability and performance for mission-critical business applications.
Fluent German and English (C1 level) are required.
Only occasional onsite visits in Germany.
Storage Product Ownership: Provide Tier-3 ownership for storage services, including file, block, and object storage (S3-like concepts), and ensure operational readiness for all storage-related changes.
Incident & Troubleshooting: Handle complex incidents and perform deep root cause analysis to drive permanent fixes and preventive measures within the storage infrastructure.
Kubernetes Integration: Manage and troubleshoot storage integration within Kubernetes environments, focusing on CSI driver concepts and the PV/PVC lifecycle.
Automation: Reduce operational toil by automating standard tasks such as capacity checks, validation procedures, and provisioning workflows.
Operational Readiness: Validate deployment artifacts and enforce quality assurance measures, including documented standard operating procedures and successful test reports.
Stability & Monitoring: Maintain monitoring and alerting coverage, performance baselines, and hardening strategies for storage services across multi-tenant environments.
Security & Compliance: Implement logging strategies to support audit requirements and perform routine security scans to remediate identified vulnerabilities.
Senior-level professional with 5+ years in IT storage operations or platform operations in mission-critical environments.
Expert knowledge of storage types (File, Block, Object) and protocols such as NFS and S3-like services.
Proven experience with storage virtualisation in enterprise environments.
Strong understanding of Kubernetes storage integration and CSI driver troubleshooting.
Demonstrated leadership in Incident, Problem, Change, and Release governance.
Technical proficiency with observability tools such as Prometheus, Grafana, Mimir, or Loki.
Familiarity with GitOps and Infrastructure as Code (IaC) concepts (Terraform, OpenTofu, ArgoCD) for governing deployment standards.
Experience with ITSM toolsets including Jira Service Management (JSM), Jira, and Confluence.
Proficiency in both speech and writing in English (at least C1).
Proficiency in both speech and writing in German (at least C1).
Eligibility Residency in the EU, EEC, UK, or Switzerland.

UST HealthProof

BP

KeyBank

Guidehouse

Medtronic

Interval Group

Interval Group

Interval Group