Match score not available

Senior Manager of Site Reliability Engineering

Remote: 
Full Remote
Contract: 
Salary: 
129 - 174K yearly
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

Bachelor’s degree in Computer Science, IT, or related field, 8+ years in site reliability engineering, 5+ years in a leadership role, Strong knowledge of Kubernetes, Docker, Terraform, Experience with AWS and/or Azure.

Key responsabilities:

  • Lead and mentor SRE team members
  • Implement monitoring and observability tools
  • Optimize system performance and reliability
  • Manage reliability enhancement projects
  • Provide updates to stakeholders on project progress
WebPT logo
WebPT https://www.webpt.com
501 - 1000 Employees
See more WebPT offers

Job description

Who We Are Looking For

We are seeking a highly skilled and experienced Senior Manager of Site Reliability Engineering (SRE) to lead our SRE team. The ideal candidate will have a strong background in site reliability engineering, with a proven track record of managing large-scale, highly available systems. This role requires excellent leadership, strategic planning, and technical skills to ensure the reliability, performance, and security of our infrastructure.

What You’ll Be Doing As A Part of Our Team

  • Team Leadership:
    • Lead, mentor, and develop a team of database administrators and engineers.
    • Align team efforts with company goals and strategic initiatives.
  • Reliability Engineering:
    • Implement and maintain monitoring and observability tools to ensure comprehensive visibility into system health.
    • Collaborate with Architecture and Engineering teams to design and implement scalable, reliable infrastructure architectures.
    • Optimize system performance through tuning and industry best practices.
    • Work closely with operations, development, and other teams to ensure seamless system integration and performance tuning.
    • Implement best practices for incident management, post-incident analysis and measures to prevent its recurrence.
    • Continuously enhance processes to improve system reliability and performance.
    • Drive automation initiatives to minimize manual intervention and boost efficiency.
  • Project Oversight:
    • Manage reliability enhancement projects from planning through execution.
    • Lead and continuously improve controls and practices related to availability and disaster recovery.
    • Coordinate with cross-functional teams to ensure project deadlines and budgets are met.
    • Provide regular updates to stakeholders on project progress.
What You Should Have To Qualify

  • Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • 8+ years of experience in site reliability engineering or a related field.
  • 5+ years of experience in a leadership or managerial role.
  • Strong knowledge of infrastructure and configuration management systems (Kubernetes, Docker, Terraform).
  • Experience with cloud-based solutions (AWS, and/or Azure preferred).
  • Proficiency in observability, performance tuning and optimization.
  • Excellent problem-solving and analytical skills.
  • Strong communication and interpersonal skills.
  • Experience with ITIL’s incident management, problem management and Continuous Service Improvement functions
  • Must have experience managing and collaborating with globally distributed teams.

Ideally, You Would Also Have These

  • Certifications in cloud administration. Development or SRE
  • Experience with monitoring and observability tools (Prometheus, Grafana, ELK Stack).
  • Knowledge of DevOps practices and tools.
  • Keen proponent of automation and continuous improvement.

Culture is at our Core

  • Service: Create Raving Fans
  • Accountability: F Up; Own Up
  • Attitude: Possess True Grit
  • Personality: Be Minty
  • Work Ethic: Be Rock Solid
  • Community Outreach: Give Back
  • Health and Wellness: Live Better
  • Resource Efficiency: Do Más With Menos

About Us

Here, we work hard—but we have lots of fun doing it. We believe in equal opportunity for all, autonomy, trailblazing, and always doing right by our Members. Most importantly, though, we believe in empowering rehab therapy professionals to achieve greatness in practice. So, if you’re a can-do kinda person who loves to help Members win and enjoys working from just about anywhere—then you’ll fit right in. We’ve got big plans, but we can’t achieve them without you. Join us, and let’s achieve greatness.

Company Perks

  • Ample Time Off for fun and rest
  • Work from nearly anywhere in the US
  • WFH supply budget
  • Time Off to make an impact through volunteering
  • Multiple Employee Resource Groups (ERGs)
  • Health, Dental, Vision, 401k, HSA, any many other benefits
  • Authenticity and Acceptance

At WebPT, we're dedicated to fair and competitive compensation based upon our industry peer benchmarks. While job postings offer a pay range as a general reference, the final offer depends on candidate qualifications and experience. Our aim is to provide equitable compensation that recognizes your unique skills and contributions. During interviews, we'll discuss your qualifications and expectations, striving for a competitive and fair offer. The initial hiring range for this position is: $128,900 - $174,000.

Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Analytical Skills
  • Social Skills
  • Leadership
  • Strategic Planning
  • Verbal Communication Skills
  • Problem Solving

Engineering Manager Related jobs