Logo for Sysdig

Senior Site Reliability Engineer

Roles & Responsibilities

  • 5 years of hands-on experience handling production environments
  • Proficiency with cloud platforms (AWS, GCP, IBM or Azure)
  • Experience with containerization technologies (Docker, Kubernetes)
  • Comfortable writing scripts in Bash, Python, or Go or similar languages and working with Linux and command line interfaces

Requirements:

  • Design, implementation, monitoring and maintenance of Sysdig's infrastructure at scale across multiple clouds and on-prem
  • Collaborate with development teams to improve system reliability, performance, and scalability
  • Participate in on-call rotation, respond to incidents, conduct root cause analyses, and implement preventive measures
  • Manage cloud infrastructure using Infrastructure as Code practices, and implement security and data protection best practices and compliance requirements

Job description

At Sysdig, we believe cloud security isn't a compromise - it's a promise. From the start, our mission has been clear: to help organizations secure innovation in the cloud, the right way.

We created Falco, the open standard for cloud threat detection, and continue to lead the cloud security market with runtime insights, open innovation, and agentic Al. Creators of technology trusted by over 60% of the Fortune 500, Sysdig gives teams the real-time clarity to move fast and defend what matters most.

Culture matters here. We believe diversity fuels stronger ideas, and open dialogue drives sharper decisions. Recognized as a Best Place to Work and one of Deloitte's fastest-growing companies for the past 5 years, we're here to raise the standard for what cloud security and workplace culture should be.

If you have the passion to dig deeper, the desire to challenge convention, and the curiosity to build something better, Sysdig is the right place for you.

What you will do
  • Design, implementation, monitor and maintenance of Sysdig's Infrastructure at scale on different clouds and on-prem.
  • Collaborate with development teams to improve system reliability, performance, and scalability
  • Participate in on-call rotation, respond to incidents, conduct root cause analyses, and implement preventive measures
  • Manage cloud infrastructure using Infrastructure as Code practices
  • Implement security and data protection best practices and compliance requirements

  • What you will bring with you
  • 5 years of hands-on experience handling production environments
  • Proficiency with cloud platforms (AWS, GCP, IBM or Azure)
  • Experience with monitoring and observability tools
  • Background in automating operational tasks and reducing toil
  • Experience with containerization technologies (Docker, Kubernetes)

  • What we look for
  • Comfortable writing scripts in Bash, Python, or Go or similar languages and working with Linux and command line interfaces
  • Problem-solving mindset focused on automation, prevention, and operational excellence
  • Deep understanding of distributed systems and microservices architecture
  • Fast-learner with proven experience working in dynamic environments

  • When you join Sysdig, you can expect:
  • Extra days off to prioritize your well-being
  • Mental health support for you and your family through the Modern Health app
  • Great compensation package
  • We would love for you to join us! Please reach out even if your experience doesn't perfectly match the job description. We can always explore other options after starting the conversation. Your background and passion will set you apart, especially if your career path is different.

    Some of our Hiring Managers are globally distributed, an English version of your CV will be appreciated.

    Sysdig values a diverse workplace and encourages women, people of color, LGBTQIA+ individuals, people with disabilities, members of ethnic minorities, foreign-born residents, and veterans to apply. Sysdig is an equal-opportunity employer. Sysdig does not discriminate on the basis of race, color, religion, sex, national origin, age, disability, genetic information, sexual orientation, gender identity, or any other legally protected status.

    #LI-FP1
    #LI-Remote

    Site Reliability Engineer (SRE) Related jobs

    Other jobs at Sysdig

    We help you get seen. Not ignored.

    We help you get seen faster — by the right people.

    🚀

    Auto-Apply

    We apply for you — automatically and instantly.

    Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

    ✨

    AI Match Feedback

    Know your real match before you apply.

    Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

    Upgrade to Premium. Apply smarter and get noticed.

    Upgrade to Premium

    Join thousands of professionals who got noticed and hired faster.