Logo for Alta Technology Group

Lead DevOps/MLOps Engineer

Key Facts

Remote From: 
Category:  Lead Developer
Full time
Senior (5-10 years)
English

Roles & Responsibilities

  • Experience operating production infrastructure at meaningful scale
  • Strong in practical DevOps execution and operational reliability
  • Focus on automation, observability, and deployment safety
  • Comfortable improving developer workflows and infrastructure tooling

Requirements:

  • Improve CI/CD pipelines, deployment workflows, and release reliability
  • Standardize infrastructure and deployment patterns across environments
  • Improve observability through logging, metrics, and monitoring
  • Support ML-oriented infrastructure including SageMaker and GPU scaling patterns

Job description


We're looking for a strong DevOps engineer who can help scale and operationalize our infrastructure as the platform grows. This is not a pure platform-architecture role — the focus is CI/CD, infrastructure automation, deployment reliability, observability, and GPU-oriented workload scaling.
What You'll Own
  • Improve CI/CD pipelines, deployment workflows, and release reliability
  • Standardize infrastructure and deployment patterns across environments
  • Improve observability through logging, metrics, tracing, dashboards, and rollout monitoring
  • Partner closely with backend engineering on:
    • deployment strategies
    • infrastructure automation
    • environment consistency
    • migration workflows
    • possible Kubernetes migration efforts
  • Support ML-oriented infrastructure as a secondary responsibility:
    • SageMaker workloads
    • Ray clusters
    • GPU scaling patterns
    • distributed batch execution
    • autoscaling behavior
    • runtime/image management
    • artifact delivery/versioning
The Kind of Problems You'll Work On
  • Deployment safety and rollback strategies
  • Infrastructure consistency across environments
  • Release automation and environment promotion flows
  • Autoscaling and runtime stability
  • GPU workload orchestration and scaling efficiency
  • Operational tooling that reduces friction for engineering teams
Stack
  • AWS
  • Terraform
  • Docker
  • Kubernetes
  • CI/CD systems
  • SageMaker
  • Ray
  • GPU compute infrastructure
You'll Probably Do Well Here If
  • You've operated production infrastructure at meaningful scale
  • You're strong in practical DevOps execution and operational reliability
  • You care about automation, observability, and deployment safety
  • You're comfortable improving developer workflows and infrastructure tooling
  • You've worked with distributed systems or GPU-oriented workloads before

Lead Developer Related jobs

Other jobs at Alta Technology Group

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

✨

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.