Logo for ioet

Site Reliability Engineer (SRE)

Roles & Responsibilities

  • 5+ years of professional experience in Site Reliability Engineering (SRE) with hands-on software development background
  • Strong experience with AWS cloud infrastructure and Terraform for infrastructure provisioning and automation
  • Advanced Datadog experience for monitoring, telemetry, and observability, plus solid Python scripting and automation skills
  • Experience with Kubernetes, CI/CD pipelines, Git, and implementing SLOs/SLIs and error budgets

Requirements:

  • Operate in a hybrid SRE/software engineering role, contributing to codebase while establishing and scaling reliability practices
  • Define and implement observability, reliability metrics, and automation across the platform to ensure scalability, resilience, and high availability
  • Collaborate with engineering teams to implement SLOs/SLIs, telemetry, and instrumentation, while writing the code required to support reliability initiatives
  • Develop and maintain CI/CD pipelines and automation workflows to support frequent, incremental deployments

Job description

At ioet, a leading software company with a talented team across LATAM, we provide Software Engineering as a service to clients worldwide. Join us for exciting professional challenges, working on projects ranging from innovative startups to globally recognized brands. Our positions are full-time, remote, and offer competitive compensation in USD.

We are looking for an experienced Site Reliability Engineer (SRE) who is eager to grow professionally within our dynamic and highly skilled software development team. This role is ideal for engineers who combine strong software development experience with hands-on reliability engineering practices.

In this position, you will operate in a hybrid role combining SRE and software engineering responsibilities, helping establish and scale reliability practices while contributing directly to the codebase. You will be responsible for defining and implementing observability, reliability metrics, and automation across the platform, ensuring systems remain scalable, resilient, and highly available.

You will work closely with engineering teams to implement Service Level Objectives (SLOs), Service Level Indicators (SLIs), telemetry, and instrumentation, while also writing the code required to support these reliability initiatives.

Requirements

  • 5+ years of professional experience in a Site Reliability Engineering role.

  • Previous hands-on experience as a Software Engineer, with strong coding fundamentals.

  • Strong experience working with AWS cloud infrastructure.

  • Extensive hands-on experience with Terraform for infrastructure provisioning and automation.

  • Advanced experience with Datadog for monitoring, telemetry, and observability (must be highly proficient).

  • Solid scripting and automation skills using Python.

  • Experience with Kubernetes.

  • Strong understanding of SRE principles, including reliability engineering practices, SLOs, SLIs, and error budgets.

  • Experience building and maintaining CI/CD pipelines, automation workflows, and deployment systems.

  • Experience working with Git and modern development workflows.

  • Comfortable with frequent, incremental deployments and testing.

  • Strong English communication skills – Advanced level required.

  • Send your application and CV in English (mandatory).

  • Based in Latin America.

Nice to Have

  • Development experience using Java.

  • Development experience using TypeScript or JavaScript.

  • Experience building instrumentation and telemetry frameworks for distributed systems.

  • Experience contributing to reliability programs in production environments.

Benefits

  • Remote work

  • Flexible schedule

  • Collaboration with international clients

  • USD compensation

  • Paid Holidays and Vacations

  • Paid family and sick leaves

  • English classes

  • Educational and wellness bonus

  • Structured career plan with regular salary reviews

  • Emphasis on personal growth and mentorship

Are you ready to be part of the ioet journey?
Get your CV in English and Apply Now.

If you are curious to know more about our culture, technologies, and blogs, v

Site Reliability Engineer (SRE) Related jobs

Other jobs at ioet

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.