Match score not available

Site Reliability Engineer (SRE)

Remote: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

FusionHit logo
FusionHit https://www.fusionhit.com
51 - 200 Employees
See all jobs

Job description

We are looking for a Site Reliability Engineer (SRE) to join our fast-paced, dynamic environment. You will be working with a high-performing team to enhance system reliability, scalability, and automation. This position offers the opportunity to work on cutting-edge infrastructure, collaborating with engineers to improve performance, reduce downtime, and optimize cloud-native applications.

Our client is a leader in the technology sector, providing innovative, scalable solutions to businesses worldwide. Their mission is to ensure highly available and resilient services, leveraging automation, performance monitoring, and infrastructure optimization.

This project focuses on building and maintaining reliable, scalable platforms that support business-critical applications.

Location: Must reside and have work authorization in Latin America.

Availability: Must be available to work with significant overlap with Mountain Standard Time (MST). 


The Ideal Candidate Has: 

  • BS/MS in Computer Science, Information Technology, or related field with 5+ years of experience in site reliability engineering, DevOps, or infrastructure management.
  • Proven experience with web applications and distributed systems.
  • Strong automation skills with experience in CI/CD pipelines (Jenkins, Bamboo, Concourse, or similar).
  • Experience with monitoring and observability tools like Dynatrace, Splunk, or Prometheus.
  • Strong troubleshooting and root cause analysis skills to resolve production issues.
  • Experience with performance tuning and cloud-native application optimization.
  • Knowledge of Infrastructure as Code (IaC) and automation (Terraform, Ansible, or similar).
  • Experience working with SQL and NoSQL databases.
  • Programming experience in Java, Python, Scala, or other object-oriented languages.
  • Excellent communication skills in English (C1 preferred, strong B2 may be considered). 

Key Responsibilities: 

  • Ensure system reliability and availability by monitoring key service metrics, identifying issues, and implementing proactive solutions.
  • Automate repetitive tasks to reduce operational overhead and improve system efficiency.
  • Perform performance testing and capacity planning to optimize infrastructure.
  • Investigate and resolve incidents to improve system stability and prevent recurring issues.
  • Collaborate with development teams to ensure seamless deployments and infrastructure improvements.
  • Improve observability by building and maintaining dashboards for system health monitoring.
  • Contribute to technology roadmaps by recommending improvements in reliability, performance, and automation.

Perks of working at FusionHit: 

  • Certified as a Great Place to Work, offering a supportive and inclusive work culture. 
  • Work from home position
  • Private Medical Insurance 
  • Corporate Access to FusionHit Udemy Account 
  • Personal and Professional Development Courses & Certifications 
  • Flexible Schedule 
  • 3 Sick Days per year 
  • Birthday Off 
  • Extra Days for Special Occasions 
  • Team Building Meal Reimbursement 
  • Equipment Granted 
  • Monthly Recognitions 
  • High Impact Committees 

Are you curious already? 

 

Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Troubleshooting (Problem Solving)
  • Communication

Site Reliability Engineer (SRE) Related jobs