Senior Manager, Site Reliability Engineering

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

5+ years of experience leading Site Reliability Engineering (SRE) teams with a proven track record of success., Hands-on experience with Linux systems, preferably Debian-based, and proficiency in programming languages like Go and Python., In-depth knowledge of cloud SRE best practices and platforms such as GCP, AWS, and Azure., Excellent communication skills and experience collaborating with diverse, global teams..

Key responsibilities:

  • Oversee day-to-day SRE operations, including managing on-call schedules and deployment tasks.
  • Collaborate with Support and Engineering teams to foster a culture of accountability and mutual respect.
  • Lead SRE projects from initiation to completion, focusing on quality, timing, and operational readiness.
  • Attract, retain, and nurture a standout SRE team, promoting a culture of innovation and collaboration.

Platform.sh logo
Platform.sh Internet Scaleup https://linkstre.am/platform.sh
201 - 500 Employees
See all jobs

Job description

About Platform.sh

Platform.sh is Platform-as-a-Service (PaaS) that removes the complexities of cloud infrastructure management and optimizes development-to-production workflows, reducing the time it takes to build and deploy applications. Delivering efficiency, reliability, and security, giving development teams both control and peace of mind. Built for developers, by developers.

Adopted and loved by 16,000+ developers, 7,000 customers, and for nearly a decade Platform.sh has been providing innovative capabilities that serve as the launchpad for creative development teams’ out-of-the-box thinking.

We provide 24x7 support, managed cloud infrastructure, and automated security and compliance with an all-in-one PaaS. We give our customers complete control over their data by keeping applications secure and available around the clock.

Platformers are a remote, global workforce, and we thrive in a multicultural team. We are committed to open source and an open, welcoming environment. Our team spans the globe and the experience spectrum. What's our commonality, our cultural fabric? A curious spirit and a thirst for knowledge; an eagerness for innovative ideas and cultures. We believe we can build anything together in an environment that frees you to do your best work.

Bring your expertise and enthusiasm to our growing, global organization. Your contributions, collaboration, and unique point of view are recognized and valued here.

Impact of a Senior Manager, Site Reliability Engineering

As a Senior Manager, Site Reliability Engineering, you lead the Site Reliability Engineering (SRE) team in the Americas timezone. You are responsible for growing and supporting a team of engineers, shaping team strategy, and evolving our SRE practices in close collaboration with peers across the company.

You focus on building and improving the processes that support monitoring, automation, alerting, and incident management. You help define the future of SRE at Platform.sh. You ensure our systems are reliable, our operations are efficient, and our teams are set up to scale.

Your leadership helps drive the success and resilience of our SRE organization.

What to expect
  • Oversee day-to-day SRE operations: Manage on-call schedules and deployment tasks.
  • Collaborate across teams: Work closely with Support and Engineering to foster a culture of accountability and mutual respect.
  • Deploy and maintain SRE tooling: Take charge of deploying and maintaining tooling, actively contributing to our mission of reliability and transparency.
  • Lead innovation and technology assessment: Engage in assessing new technologies, leading innovation in forward-thinking SRE strategies.
  • Drive project delivery: Lead SRE projects from initiation to completion, focusing on quality, timing, and operational readiness, with a dynamic and proactive approach.
  • Build and grow your team: Attract, retain, and nurture a standout SRE team, defining clear goals and promoting a culture of innovation and collaboration.
  • Enable agile feedback loops: Use agile leadership to establish swift feedback loops with the business, ensuring efficient and effective problem-solving.
  • Manage external partnerships: Cultivate strong connections with external partners and vendors, ensuring alignment with internal and external priorities.
What you bring
  • SRE leadership and strategy: 5+ years leading SRE teams with proven success driving initiatives and communicating across the company. Extensive experience in day-to-day SRE operations and developing strategies aligned with business goals.
  • Linux expertise: Hands-on experience with Linux systems, preferably Debian-based, including troubleshooting and tuning.
  • Engineering background and programming skills: Hands-on experience in software development and infrastructure engineering, with proficiency in Go and Python for building automation and tools.
  • Communication and collaboration: Excellent skills working with diverse, global teams, including stakeholder management and clear articulation of complex concepts.
  • Team leadership and mentoring: Passion for leading and mentoring diverse global team of engineers, with a track record of guiding both individual contributors and leaders.
  • Cloud infrastructure knowledge: In-depth experience with cloud SRE best practices and platforms such as GCP, AWS, and Azure.
  • Global cross-functional collaboration: Proven ability to collaborate closely with engineering, support, and other teams worldwide.
Where we hire

At Platform.sh, remote work isn't just a trend - it's our foundation. The freedom of remote work with the support of a diverse, global team has been our successful model for nearly a decade. Our culture celebrates flexibility and collaboration, and while we have team members in over 30 countries around the globe, we are currently focused on hiring for this role in Canada. Although we’re unable to provide visa sponsorship at this time, we welcome applications from all qualified candidates who are legally authorized to work in Canada. 

How we hire

We know that a great hire won’t meet every requirement that we’ve outlined. If you can see yourself elevating the team, we want to hear your story. Few of us would be here had we not taken a chance.

You can expect 4 interviews on Google Meet to follow the order below. Should you successfully move through the entire process you will have the opportunity to meet with a variety of Platformers. Our goal is to ensure you can make the most informed decision on whether this role, and our culture aligns with what you’re looking for in your future working environment. 

  1. 45 Minutes with Talent Acquisition 
  2. 60 Minutes with Hiring Manager (Senior Director, Site Reliability Engineering)
  3. 60 Minutes with Team (Senior Director, Site Reliability Engineering; Director, Site Reliability Engineering)
  4. 45 Minutes with Executive (CTO)

All roles require background checks.

What we offer

💡 A product you can believe in - Join us in transforming how businesses build and manage web applications, driven making a positive impact as a proud B Corp.

🏆 An Award-Winning Workplace - We’ve been recognized by Forbes’ Top 30 Companies for Remote Jobs and France’s Best Workplaces for Women.

🗣️ A culture that values your voice - Join a flexible, open, and inclusive work environment where your voice is encouraged, and your ideas shape our growth and evolution.

🌎 A global team - Collaborate with colleagues from diverse backgrounds across the world, embracing different perspectives

🎉 Benefits and perks - Make the most of what matters to you

🩺 Comprehensive health coverage (CA)

🏝 Flexible PTO

📈 Company stock options

🧠 Professional development budget

💻 Office equipment budget

💆‍♀️ Wellness budget

🧳 Annual team gatherings

🛜 Internet reimbursement

👶 Inclusive parental leave

✈️ Remote work travel program

You belong here

At Platform.sh, we celebrate diversity in all its forms and are committed to fostering an inclusive, equitable, and supportive workplace where everyone can thrive. We embrace and value different perspectives, backgrounds, and experiences, because they make us stronger as a team. Whoever you are, wherever you're from, and whatever path you've taken, you are welcome here. We encourage you to bring your whole self to work, connect with others, and share your passion.

If you need accommodations at any stage of our hiring process, please let us know. We're here to ensure an accessible and comfortable experience for you.

Required profile

Experience

Industry :
Internet
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Mentorship
  • Team Leadership
  • Collaboration
  • Communication

Site Reliability Engineer (SRE) Related jobs