Senior Site Reliability Engineer

Work set-up: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

Proficiency in scripting languages like Bash or Python., Fundamental knowledge of Linux internals, networking, and storage., Experience with cloud services such as AWS or GCP and infrastructure-as-code tools like Terraform or CloudFormation., Strong understanding of cloud native tools like Kubernetes, Prometheus, and Istio..

Key responsibilities:

  • Monitor and maintain mission-critical production systems for maximum uptime.
  • Design and implement scalable distributed systems for autonomous vehicles.
  • Develop incident management frameworks and promote a culture of continuous learning.
  • Automate system reliability and build comprehensive documentation and runbooks.

Stack AV logo
Stack AV Information Technology & Services Scaleup http://www.stackav.com/
51 - 200 Employees
See all jobs

Job description

About Stack:

Stack is developing revolutionary AI and advanced autonomous systems designed to enhance safety, reliability, and efficiency of modern operations. Stacks autonomous technology incorporates cuttingedge advancements in artificial intelligence, robotics, machine learning, and cloud technologies, empowering us to create innovative solutions that address the needs and challenges of the dynamic trucking transportation industry. With decades of experience creating and deploying real world systems for demanding environments, the Stack team is dedicated to developing an autonomous solution ecosystem tailored to the trucking industrys unique demands.

About the Role:

Stack AV Site Reliability Engineers are responsible for enabling and ensuring our production systems meet their servicelevel objectives. Through the implementation of centralized observability and automation, the SRE team constantly ensures the health, reliability, scalability, and performance of Stack AV’s infrastructure. Members of the team are expected to contribute to a culture of continuous learning, provide consultation on architecting for highavailability, and ultimately drive the uptime and performance of our systems.

Responsibilities:

Required profile

Experience

Level of experience: Senior (5-10 years)
Industry :
Information Technology & Services
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Teamwork
  • Communication

Site Reliability Engineer (SRE) Related jobs