Logo for RxBenefits, Inc.

Director, DevOps & Cloud Infrastructure

Roles & Responsibilities

  • 10+ years hands-on experience in DevOps, SRE, cloud engineering and infrastructure.
  • 5+ years in a Director or senior leadership role managing managers and large teams.
  • Deep expertise in cloud platforms (AWS, Azure) and modern toolchains: CI/CD (GitHub Actions, GitLab CI, Jenkins, ArgoCD), containers (Kubernetes, Docker, Helm), IaC (Terraform, Pulumi, Crossplane), and monitoring/observability tooling.
  • Strong communication, strategic thinking, and cross-functional collaboration; proven ability to drive cultural change and scale high-performance teams.

Requirements:

  • Own uptime, availability, scalability, and performance of all production systems; define and manage SLOs/SLAs, error budgets, and incident response; lead post-incident reviews and drive systemic reliability improvements; implement observability standards.
  • Own cloud infrastructure strategy (AWS, Azure, hybrid); lead infrastructure-as-code; ensure disaster recovery, backup, and business continuity tested and compliant; monitor and optimize cloud spend through cost governance and FinOps practices.
  • Enable CI/CD pipelines, deployment automation, and release strategies; enable safe, frequent releases (blue/green, canary, feature flags); standardize DevOps tooling and platform capabilities; partner with Engineering to remove friction and increase delivery velocity.
  • Embed security into DevOps practices (DevSecOps); partner with Security, Legal, and Compliance on audits and certifications (SOC 2, HIPAA, HITRUST, PCI); ensure secrets management, access controls, and vulnerability remediation.

Job description

The Director of DevOps and Cloud Infrastructure is a senior leadership role that combines deep technical expertise in DevOps/Cloud/SRE practices with strategic business alignment, team leadership, and organizational transformation. This position has evolved significantly through 2026, with strong emphasis on cloud-native architectures, cost optimization, AI/ML infrastructure support, GitOps, security/governance at scale, and driving faster, more reliable software delivery while aligning with business outcomes. This role owns Cloud infrastructure, development environment strategy, platform reliability, CI/CD, cloud cost optimization and management, and operational excellence, enabling engineering teams to deliver software faster and more safely.

 

This leader will balance hands-on technical depth with strategic leadership, setting standards, tooling, and operating models while growing and mentoring a high-performing DevOps/SRE organization.

 

Essential Job Responsibilities Include:

Platform & Reliability

  • Own uptime, availability, scalability, and performance of all production systems.
  • Define and manage SLOs, SLAs, error budgets, and incident response practices.
  • Lead post-incident reviews and drive systemic reliability improvements.
  • Implement observability standards (logging, metrics, tracing).

Cloud & Infrastructure

  • Own cloud infrastructure strategy (AWS, Azure, hybrid).
  • Lead infrastructure-as-code (Terraform, CloudFormation, ARM, etc.).
  • Working with GRC, Ensure disaster recovery, backup, and business continuity plans are tested and compliant.
  • Monitor and optimize cloud spend through cost governance and FinOps practices.

DevOps, CI/CD & Engineering Enablement

  • Own CI/CD pipelines, deployment automation, and release strategies.
  • Enable safe, frequent releases (blue/green, canary, feature flags).
  • Standardize DevOps tooling and platform capabilities across teams.
  • Partner with Engineering to remove friction and increase delivery velocity.

 

Monitoring, Alerting, and Event Management

  • Set plan and manage execution of dashboards, availability management and reporting.
  • Align with Product Engineering teams to define NFRs related to definition, instrumentation and logging


Security & Compliance

  • Embed security into DevOps practices (DevSecOps).
  • Partner with Security, Legal, and Compliance on audits and certifications (SOC 2, HIPAA, HITRUST, PCI, etc.).
  • Ensure secrets management, access controls, and vulnerability remediation.

Leadership & Strategy

  • Build and lead DevOps, SRE, and Cloud Engineering teams.
  • Define the DevOps operating model (centralized, embedded, hybrid).
  • Establish KPIs for reliability, deployment frequency, MTTR, and cost efficiency.
  • Partner closely with Engineering, Product, IT, Security, and Data teams.
  • Contribute to long-term technology and architecture roadmap.

 

These areas are separated into two teams with subordinate Managers:

  1. Cloud/Infrastructure Operations
  2. DevOps

 

Required Skills / Experience:

 

  • 10+ years of hands-on experience in DevOps, SRE, cloud engineering, and infrastructure.
  • 5+ years as a Director in leadership/people management role (leading managers and/or large teams)
  • Deep expertise in modern tools and practices:
    • Cloud platforms (AWS, Azure).
    • CI/CD (GitHub Actions, GitLab CI, Jenkins, ArgoCD).
    • Containers & orchestration (Kubernetes, Docker, Helm).
    • Infrastructure as Code (Terraform, Pulumi, Crossplane).
    • Monitoring/Observability (DataDog, Sumo, Grafana, ELK, Datadog, New Relic).
    • Scripting/automation (Python, Go, Bash).
  • Strong understanding of Agile/Scrum/SAFe methodologies.
  • Proven track record of building high-performance teams and driving cultural change.
  • Excellent communication, strategic thinking, and cross-functional collaboration skills.
  • Experience with large-scale, high-availability environments.


Preferred Skills/Experience:

  • Relevant certifications (e.g., AWS Certified DevOps Engineer, Certified Kubernetes Administrator, Terraform Associate, SRE-related).
  • Experience in regulated industries (healthcare) or with ML/AI infrastructure.
  • Background in cost management and cloud financial operations (FinOps).

Related jobs

Other jobs at RxBenefits, Inc.

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

✨

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.