Logo for Upshop

SRE / DevOps Manager

Roles & Responsibilities

  • 10+ years of experience in DevOps, SRE, or infrastructure engineering.
  • 2+ years in a leadership or managerial role.
  • 3+ years of experience with cloud platform deployments (AWS, GCP, Azure).
  • 3+ years of experience working with MongoDB and Cosmos DB.

Requirements:

  • Team Leadership: Manage and mentor a team of SRE and DevOps engineers; drive hiring, onboarding, and professional development; set clear goals and performance metrics.
  • Reliability and Incident Management: Own system uptime, performance, and reliability; lead incident response and root cause analysis; define and monitor SLAs, SLOs, and SLIs.
  • Infrastructure Automation: Oversee Azure cloud infrastructure; implement Infrastructure as Code (IaC) using Terraform; drive automation of CI/CD pipelines and DevSecOps processes (integrating with Azure DevOps, GitLab, etc.).
  • Monitoring, Observability, and Security: Implement and maintain monitoring, alerting, and logging systems (Datadog, Prometheus, Grafana, ELK); ensure infrastructure security and compliance with industry standards; collaborate with InfoSec on audits and vulnerability management.

Job description

About the Role

We are seeking a seasoned SRE / DevOps Manager to lead our reliability and operations engineering team. You will be responsible for ensuring the scalability, security, and performance of our infrastructure while fostering a culture of automation, ownership, and continuous improvement.

At Upshop, we believe that great businesses are built by great people. Our People function is at the heart of our company’s growth, ensuring we attract, develop, and retain A Players who drive our mission forward.

Our Values:

  • Extremely Accountable
  • Customer Obsessed
  • Always Innovating
  • Demand Excellence
  • Biased for Action

Key Responsibilities

Team Leadership

  • Manage and mentor a team of SRE and DevOps engineers.
  • Drive hiring, onboarding, and professional development.
  • Set clear goals and performance metrics.

Reliability & Incident Management

  • Own system uptime, performance, and reliability.
  • Lead incident response and root cause analysis.
  • Define and monitor SLAs, SLOs, and SLIs.

Infrastructure & Automation

  • Oversee cloud infrastructure (Azure).
  • Implement Infrastructure as Code (IaC) using tools like Terraform or other similar tools
  • Drive automation of CI/CD pipelines and operational tasks.
  • Build and manage a DevSecOps process to connect CI/CD pipelines with AzureDevOps, Gitlab etc.

Monitoring & Observability

  • Implement and maintain monitoring, alerting, and logging systems.
  • Use tools like Datadog or other similar tools like Prometheus, Grafana, ELK stack.

Security & Compliance

  • Ensure infrastructure security and compliance with industry standards.
  • Collaborate with InfoSec teams on audits and vulnerability management.

Cross-functional Collaboration

  • Work closely with software engineering, product, and QA teams.
  • Advocate for DevOps and SRE best practices across the organization.

Qualifications

  • 10+ years of experience in DevOps, SRE, or infrastructure engineering.
  • 2+ years in a leadership or managerial role.
  • 3+ years of expertise with Cloud platform deployments
  • 3+ years of experience working with MongoDB and cosmosdb
  • Strong experience with cloud platforms (AWS, GCP, Azure).
  • Proficiency in scripting languages (Power shell scripting, Python, Bash, Go).
  • Hands-on experience with Kubernetes, Docker, CI/CD tools.
  • Excellent communication and leadership skills.

Preferred Qualifications

  • Experience with compliance frameworks (SOC 2, ISO 27001).
  • Familiarity with Agile and DevOps methodologies.
  • Certifications in cloud technologies or DevOps practices.

Benefits/Perks

  • Hybrid Opportunity
  • Competitive salary
  • Employer-matched 401(k) plan
  • Attractive paid time off policy
  • Career growth and development opportunities

Related jobs

Other jobs at Upshop

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.