Staff Site Reliability Engineer

extra holidays - fully flexible
Work set-up: 
Full Remote
Contract: 
Experience: 
Expert & Leadership (>10 years)
Work from: 

Offer summary

Qualifications:

10+ years of experience in SRE/DevOps or related fields., Deep expertise in cloud platforms such as AWS, GCP, and Azure., Proficiency in programming languages like Go and Python., Experience with orchestration systems like Kubernetes, Nomad, or Mesos..

Key responsibilities:

  • Set technical direction for system reliability and scalability.
  • Lead large-scale projects and cross-team initiatives.
  • Mentor and develop senior engineers within the organization.
  • Manage post-incident reviews and drive systemic improvements.

SentinelOne logo
SentinelOne Large http://www.sentinelone.com
1001 - 5000 Employees
See all jobs

Job description

About Us

At SentinelOne, we’re redefining cybersecurity by pushing the limits of what’s possible—leveraging AI-powered, data-driven innovation to stay ahead of tomorrow’s threats.

From building industry-leading products to cultivating an exceptional company culture, our core values guide everything we do. We’re looking for passionate individuals who thrive in collaborative environments and are eager to drive impact. If you’re excited about solving complex challenges in bold, innovative ways, we’d love to connect with you.

What are we looking for?

SRE organization’s mission at SentinelOne (S1) is to keep our uptime promise to our customers by ensuring we meet our SLOs/SLAs, help our engineering teams ship software to our customers fast and with quality and ensure our customers are successful.

As a Staff Site Reliability Engineer, you will be a technical leader within the SRE organization, responsible for setting the technical direction and driving the long-term reliability  vision for SentinelOne's production service. You will be empowered to solve systemic, cross-team challenges and improve the reliability, scalability, and performance of our entire service ecosystem. You will not just contribute to major initiatives like our Monitoring and Observability Uplift and Logging Pipeline modernization; you will be instrumental in leading the strategy and architecture for these large-scale projects, ensuring they meet the long-term needs of the business.

What will you do? 

As a Staff SRE, you will be a key technical leader, strategist, and mentor. You will operate across teams to solve the most challenging reliability and scalability problems at SentinelOne. Your responsibilities will include:

  • Setting the technical direction for reliability across multiple services, partnering with engineering leaders to create and execute long-term roadmaps.
  • Identifying and eliminating entire classes of operational work by designing and building scalable, automated platforms for use by all of SRE and Engineering.
  • Leading post-mortems for major, multi-system incidents and owning the strategic follow-up to address systemic root causes across the organization.
  • Mentoring and developing senior engineers within the SRE organization, acting as a force multiplier to level up the entire team.
  • You will join a like minded team of SRE’s who help run our operations smoothly at scale by building a platform on which S1’s services can run. If the thought of running a large scale cybersecurity platform on various cloud providers and air gapped environments excite you, you’ve found the right place!
  • As a team we value good written communication skills, data driven decisions and a keen eye for continuous improvements. You’ll help simplify, have a passion for new ideas and know how to execute iteratively towards the final goal. We value candor and collaboration.

What skills and knowledge should you bring?

  • An extensive and proven track record (e.g., 10+ years) in SRE/DevOps, with deep experience leading large, cross-functional technical projects from inception to completion.
  • Deep, architectural-level expertise across multiple cloud providers (AWS, GCP, Azure), with proven experience designing, running, and troubleshooting highly-available systems in complex, multi-cloud and air-gapped environments.
  • Great proficiency in one or more mainstream languages (e.g., Go, Python), with demonstrable experience building scalable software and automation platforms.
  • Strong Production experience with orchestration systems like Kubernetes, Nomad or Mesos (We are a Kubernetes shop)
  • Proven ability to set technical direction and influence the roadmap of multiple engineering teams without direct authority.
  • Experience with SecOps & Compliance processes and their touch points with SRE is desired
  • Polyglot experience with other SRE tools – we integrate with more tools every day

Apart from the above technical skills, following soft skills are required:

  • A strong sense of business acumen and the ability to evaluate technical decisions in the context of cost, risk, and long-term company strategy
  • Demonstrated experience in mentoring and growing senior engineers.
  • Exceptional communication skills, with the ability to articulate complex technical concepts to diverse audiences, from junior engineers to executive leadership.
  • Curiosity, fast-learning, pursuit to improvements, great communication
  • Ability to work in a diverse and distributed team
  • A self-starter that is passionate and motivated by new technologies and has empathy for legacy systems
  • A quick learner that can navigate through unfamiliar programming languages, systems and processes

Why Us? 

You will be joining a cutting-edge company, where you will tackle extraordinary challenges and work with the very best in the industry along with competitive compensation. 

  • Flexible working hours and hybrid/remote work model
  • Flexible Time Off.
  • Flexible Paid Sick Days.
  • Global gender-neutral Parental Leave (16 weeks, beyond the leave provided by the local laws) 
  • Generous employee stock plan in the form of RSUs (restricted stock units)
  • On top of RSUs, you can benefit from our attractive ESPP (employee stock purchase plan)
  • Gym membership by Cultfit.
  • Wellness Coach app, with 3,000+ on-demand sessions, daily interactive classes, audiobooks, and unlimited private coaching. 
  • Private medical insurance plan for you and your family.
  • Life Insurance covered by S1 (for employees)
  • Telemedical app consultation (Practo)
  • Global Employee Assistance Program (confidential counseling related to both personal and work life matters)
  • High-end MacBook or Windows laptop.
  • Home-office-setup allowances (one time) and maintenance allowance. 
  • Internet allowances.
  • Provident Fund and Gratuity (as per govt clause)
  • NPS contribution (Employee contribution)
  • Half yearly bonus program depending on the individual and company performance.
  • Above standard referral bonus as per policy.
  • Udemy Business platform for Hard/Soft skills Training & Support for your further educational activities/trainings
  • Sodexo food coupons.

SentinelOne is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

SentinelOne participates in the E-Verify Program for all U.S. based roles. 

Required profile

Experience

Level of experience: Expert & Leadership (>10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Communication
  • Teamwork
  • Business Acumen
  • Curiosity
  • Mentorship

Site Reliability Engineer (SRE) Related jobs