Match score not available

Staff Site Reliability Engineer

Remote: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

8+ years of backend software development experience, Strong understanding of operating systems, Knowledge of containerization technologies, Expertise in at least one programming language, Experience with CI/CD pipelines.

Key responsabilities:

  • Build and maintain monitoring and alerting systems
  • Troubleshoot and resolve outages effectively
  • Develop tools to reduce manual toil
  • Analyze complex problems for effective solutions
  • Conduct chaos engineering experiments for resilience
Forma logo
Forma Human Resources, Staffing & Recruiting Scaleup https://www.joinforma.com/
51 - 200 Employees
See more Forma offers

Job description

About Forma

The market for employee benefits is broken. Companies spend millions annually on employee benefits that employees neither value nor regularly use. Founded in 2017, Forma set out to build a better model by challenging traditional one-size-fits-all approaches.

Forma’s flexible benefits software helps companies offer competitive benefits packages while reducing costs and inefficiencies, by giving employees more choice and flexibility in how they spend their benefit allowances. The platform also saves HR professionals countless hours managing and supporting various point solutions.

Using Forma, companies can select from a suite of products that include Lifestyle Spending Accounts, Health Spending Accounts, Health Reimbursement Arrangements, Flexible Spending Accounts, and more to design and deliver customized benefits programs–all through a single platform. Employees then have three choices to spend account funds: The Forma Store with discounted products and services, The Forma Visa Card, or claim reimbursement backed by Forma’s world-class member support team.

Forma has helped hundreds of the world’s most admired companies, including Stripe, Zoom, Lululemon, and Affirm, design and support flexible, inclusive benefits programs for nearly a million employees. And, we are seeing great success with 98% customer retention, 75 NPS, and 98 CSAT ratings from members.

Forma is backed by Emergence Capital and Ribbit Capital and has received numerous awards for its exponential growth, its software innovation, and as a “Great Place to Work.” 

About the Role

The SRE team is responsible for managing foundational infrastructure and developer experience at Forma.  We're looking for a Site Reliability Engineer with strong cloud (GCP, Azure, or AWS) experience.  In this role, you will be designing and implementing across the tools and services that our engineering teams build upon, such as our container orchestration platform, databases, and cloud infrastructure. You will have an opportunity to help shape the direction of infrastructure and tooling at Forma as we grow, as well as to help improve the reliability and resilience of our platform.

You Will
  • Monitoring and Observability: 

    • SREs build and maintain on-call, monitoring and alerting systems to proactively detect issues before they impact users. They also ensure systems have robust SLOs, logging and tracing capabilities for effective troubleshooting.  

    Incident Response and Management: 

    • SREs are often on the front line when things go wrong. They troubleshoot and resolve outages, conduct post-incident reviews (postmortems), and implement changes to prevent future occurrences.  

  • Automation and Tooling: 

    • SREs are strong advocates for automation. They develop tools and scripts to streamline operations, reduce manual toil, and improve efficiency.   

  • Architecture Reviews:

    • Ability to analyze complex problems, identify root causes, and develop effective solutions.   

    • Mentor Engineering team members to promote a culture of technical excellence and innovation

  • Reliability Engineering: 

    • SREs apply software engineering principles to infrastructure and operations. They conduct chaos engineering experiments to identify system weaknesses and improve resilience. 

       

Preferred Skills  
  • 8+ years of backend software development experience

  • Strong understanding of operating systems (Linux, Windows, etc.), networking, and cloud infrastructure (AWS, GCP, Azure).  

  • Knowledge of containerization technologies (Kubernetes) and orchestration tools.   

  • Expertise in at least one programming language (Python, Go, Java, etc.) and scripting languages (Bash, PowerShell).  

  • Experience with CI/CD pipelines and tools (Circle CI, Github Actions, etc.) to automate software delivery and deployments. 

  • Working knowledge of IAC tools such as Terraform

  • Experience at an early-stage startup is a plus

  • Fintech experience a plus

Benefits and Perks

  • Remote-first working environment

  • Medical, dental and vision insurance plans

  • Employee wellness program

  • One-time home office stipend

  • 401(k) savings plan

  • Flexible PTO policy

  • 12 weeks Parental Leave + 4 additional weeks for the Birthing Parent

At Forma, we value diversity, and always treat all employees and job applicants based on merit, qualifications, competence, and talent. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Duties and responsibilities may not all be covered in the description, or may change over time at the discretion of Forma. You're encouraged to apply even if your experience doesn't precisely match the job description. Your skills and passion will stand out—and set you apart—especially if your career has taken some extraordinary twists and turns. At Forma, we welcome diverse perspectives, and people who think rigorously / aren't afraid to challenge assumptions. Join us!

Required profile

Experience

Level of experience: Senior (5-10 years)
Industry :
Human Resources, Staffing & Recruiting
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Innovation
  • Mentorship
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs