Match score not available

Senior Site Reliability Engineer

extra holidays - extra parental leave
Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

5+ years of programming experience in Python, Go, or Shell Script., In-depth knowledge of containerization and orchestration technologies like Docker and Kubernetes., Experience with public cloud platforms such as GCP, Azure, or AWS., Strong understanding of networking concepts and Linux system administration..

Key responsabilities:

  • Build and manage systems for application and infrastructure lifecycle management.
  • Troubleshoot problems, downtime, and alerts effectively.
  • Contribute to the direction and goals of the Site Reliability Engineering team.
  • Automate delivery pipelines and simplify processes to enhance efficiency.

Clover Health logo
Clover Health Health Care Scaleup https://www.cloverhealth.com/
501 - 1000 Employees
See all jobs

Job description

At Counterpart Health, a subsidiary of Clover, we are transforming healthcare and improving patient outcomes with our innovative primary care tool, Counterpart Assistant which was incubated by Clover Health as Clover Assistant and has helped improve plan performance and clinical outcomes for Medicare members through proprietary AI technology.

We are looking for someone who is experienced in site reliability and infrastructure to join our engineering team. You will be supporting Counterpart Health's existing Technology Infrastructure that includes reviewing/improving our processes, helping develop new automation tools to manage and remove toils and troubleshoot issues as they arise. You will partner with technical leads in other engineering disciplines, as well as data scientists, and technology professionals to develop and maintain a modern, scalable infrastructure platform that hosts domestic and international work loads with a variety of compute, storage, and networking needs. We're looking for someone with prior experience deploying and maintaining containerized infrastructure and workloads. Kubernetes competency is highly valued.

As a Senior Site Reliability Engineer, you will:

  • Build systems for declarative application and infrastructure lifecycle management: continuous deployment, continuous integration, Kubernetes cluster management, service and workload inventory.
  • Prioritize and help troubleshoot problems, downtime, and alerts.
  • Contribute to setting the direction for the Site Reliability Engineering team, clearly establish goals that are aligned with Clover's company-level goals.
  • Foster a healthy, motivated, and inter-disciplinary culture that is the bedrock of high performing teams.
  • Simplify the process by automating the delivery pipeline and database changes.

You will love this job if:

  • You enjoy working in a fluid, collaborative environment, defining and taking ownership of priorities that add to our larger goals. You can bring clarity to ambiguity while remaining open-minded to new information that might change your mind.
  • You are not hesitant to jump in to help fix things that are broken and you get a sense of accomplishment from making sustainable systems. You are happy to fill in the gaps to reach a goal where necessary, even if it does not always fit your job description.
  • You want to be part of building a team that emphasizes delivery, reliability and security.
  • You have a genuine interest in what good technology can do to help people and take pride in tackling hard problems in an important industry.

You should get in touch if:

  • You have 5+ years of programming experience and are proficient in at least one of the following programming languages: Python, Go, or Shell Script.
  • You have in-depth knowledge of containerization technology and orchestration, such as Docker, Containerd, and Kubernetes, as well as experience with CNCF-based technologies like Helm, gRPC, and Prometheus.
  • You have experience with public cloud platforms such as GCP, Azure or AWS.
  • You are knowledgeable in basic networking such as TCP/IP, UDP, firewall, routing, DNS, and load balancing.
  • You have experience with Linux system administration and basic knowledge of Linux’s design.
  • You understand the key concepts in SRE such as monitoring, performance tuning, and automation. 
  • You are able to work autonomously with limited guidance.
  • You have excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams and are able to adapt quickly to new challenges and technologies.

Benefits Overview

  • Financial Well-Being: Our commitment to attracting and retaining top talent begins with a competitive base salary and equity opportunities. Additionally, we offer a performance-based bonus program, 401k matching, and regular compensation reviews to recognize and reward exceptional contributions.
  • Physical Well-Being: We prioritize the health and well-being of our employees and their families by providing comprehensive medical, dental, and vision coverage. Your health matters to us, and we invest in ensuring you have access to quality healthcare.
  • Mental Well-Being: We understand the importance of mental health in fostering productivity and maintaining work-life balance. To support this, we offer initiatives such as No-Meeting Fridays, monthly company holidays, access to mental health resources, and a generous flexible time-off policy. Additionally, we embrace a remote-first culture that supports collaboration and flexibility, allowing our team members to thrive from any location.
  • Professional Development: Developing internal talent is a priority for Clover. We offer learning programs, mentorship, professional development funding, and regular performance feedback and reviews.

Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records. We are an E-Verify company.

A reasonable estimate of the base salary range for this role is $165k to $266k. Final pay is based on several factors including but not limited to internal equity, market data, and the applicant’s education, work experience, certifications, etc.

Required profile

Experience

Industry :
Health Care
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Communication
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs