Match score not available

Senior Site Reliability Engineer

Remote: 
Full Remote
Experience: 
Senior (5-10 years)

Offer summary

Qualifications:

Experienced in cloud infrastructure design and management, Strong background in incident management, Proficient in developing CI/CD pipelines, Solid understanding of software reliability practices.

Key responsabilities:

  • Design, build, and maintain cloud infrastructure
  • Develop SLIs and SLOs for software reliability
  • Lead incident management processes
  • Create efficient development workflows

Invert — We're hiring! logo
Invert — We're hiring!
2 - 10 Employees
See all jobs

Job description

The company

At Invert, we are on a mission to dramatically reduce the dollar and time cost of using biology to manufacture ~everything. Our customers use bioprocessing to do things like: produce new therapies to combat disease, create new biomaterials to solve the environmental crisis, and manufacture essential chemicals cleanly. We provide them with tools to automate the design, execution, and analysis of all that amazing work!

The Invert team is comprised of creative and talented engineers, data scientists, biologists, and more, and we are supported by amazing investors. We value diversity and welcome individuals from all backgrounds to join our remote-first, collaborative environment.

The team

You will be joining our Site Reliability Engineering team, a critical part of our Engineering organization. Our SRE team is at the heart of ensuring our software's reliability, performance, and seamless delivery from code to customer.

Key Responsibilities

Infrastructure and Reliability

  • Design, build, and maintain scalable and secure cloud infrastructure as code

  • Develop and enforce Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to ensure software reliability

  • Enable cost transparency and optimize infrastructure spending

Developer Experience and Productivity

  • Reduce cognitive load for product engineers by creating streamlined, efficient development workflows

  • Build and maintain robust CI/CD pipelines that accelerate time from code to customer

  • Create and maintain intuitive, comprehensive observability solutions for end-to-end system monitoring

Incident Management and On-Call

  • Lead and continuously improve our Incident Management process

  • Participate in the on-call rotation, serving as a First Responder to quickly address and resolve system issues

  • Develop and maintain incident response playbooks and post-mortem practices

The role

You will work closely with

  • Our Software Engineers

  • Our Product, CX, Growth, and Sales teams

  • Our CTO office

Competencies:

  • Adaptable: Resilient in the face of changing priorities

  • Ambitious: Intrinsically motivated, driven to succeed

  • Communicates effectively: Ensures that the right information gets to the right people at the right time

  • Mentors effectively: Educates and empowers others

  • Takes ownership: Takes accountability, prioritizes team success

  • Technically skilled: Experienced in the relevant tech stack

  • Technically productive: Prioritizes velocity while maintaining sufficient quality

  • Trustworthy: Acts in the company’s best interests

The package
  • High-growth startup with impactful work

  • Fully remote, distributed across US and European timezones

  • Competitive salary, equity, and benefits

  • New laptop, monitor, and accessories of your choice

  • Frequent team offsites

  • Unlimited PTO

The interview process

The interview process consists of the four stages described below. Candidates are assessed between each of these stages. The hiring manager is responsible for communicating decisions and next steps throughout the process. We aim to complete all stages within two weeks.

  1. Discovery: A 30-minute conversation with the hiring manager to determine whether there is mutual interest in moving forward.

  2. Non-Technical Competencies: Two 60-minute interviews with two different employees to assess non-technical competencies.

  3. Technical Competencies: A 90-minute working session with two employees to assess technical competencies.

  4. References and Founder Chat: Three 15-minute conversations between the hiring manager and previous colleagues to gather external input. Simultaneously, a 30-minute meet-and-greet with one or both of the founders (depending on whether they have already participated in previous interviews).

Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Technical Acumen
  • Adaptability
  • Communication
  • Trustworthiness
  • Mentorship

Site Reliability Engineer (SRE) Related jobs