Site Reliability Engineer (SRE)

unlimited holidays
Work set-up: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)

Offer summary

Qualifications:

Bachelor's, Master's, or Ph.D. in Computer Science, Engineering, Mathematics, or related field., At least 3 years of professional experience in a high-growth environment., Extensive experience with Kubernetes and building scalable infrastructure., Knowledge of infrastructure-as-code tools and CI/CD pipelines..

Key responsibilities:

  • Build and maintain scalable infrastructure for machine learning models.
  • Establish standards for reliability and performance across infrastructure.
  • Automate processes, especially for CI/CD pipelines.
  • Collaborate with teams to understand requirements and translate them into technical solutions.

Baseten logo
Baseten Startup https://www.baseten.co/
11 - 50 Employees
See all jobs

Job description

ABOUT BASETEN

Baseten provides the infrastructure, tooling, and expertise needed to bring great AI products to market fast. Backed by top investors including IVP, Spark Capital, Greylock, and Conviction, we’re trusted by leading AIdriven innovators like Writer, Abridge, Bland, Patreon, Descript, Retool, and Zed to deliver industryleading performance, security, and reliability for their missioncritical workloads. With our recent $75M Series C funding, we’re growing fast to make AI accessible across all products.

THE ROLE

As a Site Reliability Engineer, youll envision and build robust systems and processes that ensure our infrastructure is scalable, reliable, and efficient. This can range from automating deployments and monitoring systems to optimizing performance and managing incidents.

We all work closely with our users, learning from their past struggles in operationalizing ML, onboarding them onto our platform, and turning our learnings into ideas for improving Baseten.

EXAMPLE INITIATIVES

Youll get to work on these types of projects as part of our Infrastructure team:

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Mentorship
  • Empathy
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs