Match score not available

Site Reliability Engineer (Colombia)

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Bachelor’s or Master’s degree in Computer Science, Engineering, or related field., Extensive experience in DevOps practices and Kubernetes orchestration., 3.Strong programming skills in Java..

Key responsabilities:

  • Automate infrastructure using IaC tools like Terraform.
  • Manage CI/CD pipelines for Java Spring applications.
  • Orchestrate containerized workloads with Kubernetes.
  • Implement monitoring solutions using Prometheus or Grafana for performance issues identification.
  • Respond to incidents, conduct root cause analysis, and optimize system performance.
Captivate Chat logo
Captivate Chat Startup https://captivatechat.com/
11 - 50 Employees
See more Captivate Chat offers

Job description

Position Overview: This is for a “Follow the Sun” model with support in New Zealand, the Philippines and Columbia. We are seeking an experienced Site Reliability Engineer (SRE) to join our dynamic team. The ideal candidate will have extensive experience in DevOps practices, continuous integration and continuous deployment (CI/CD) pipelines, and container orchestration with Kubernetes. As an SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our integration platforms, with a focus on Java Spring applications.

Key Responsibilities:

  1. Infrastructure Automation: Design, implement, and maintain infrastructure as code (IaC) using tools such as Terraform, Ansible, or Chef to automate the deployment and management of cloud infrastructure.
  2. CI/CD Pipeline Management: Develop and optimize CI/CD pipelines using GitHub Actions or other similar tools to automate build, test, and deployment processes for Java Spring applications.
  3. Kubernetes Orchestration: Deploy, configure, and manage Kubernetes clusters to orchestrate containerized workloads, ensuring high availability, scalability, and reliability.
  4. Monitoring and Alerting: Implement monitoring and alerting solutions using tools like Prometheus, Grafana, or ELK stack to proactively identify and address performance issues and service disruptions.
  5. Incident Response and Troubleshooting: Respond to and resolve incidents in a timely manner, conducting root cause analysis and implementing preventive measures to minimize the risk of recurrence.
  6. Performance Optimization: Identify opportunities for performance optimization and efficiency improvements in the infrastructure and application stack, collaborating with development teams to implement solutions.
  7. Security and Compliance: Implement security best practices and compliance standards (e.g., GDPR, HIPAA) in the infrastructure and application environments, ensuring data privacy and regulatory compliance.
  8. Documentation and Knowledge Sharing: Document system configurations, procedures, and troubleshooting steps, and share knowledge with the team to foster collaboration and continuous learning.

Requirements:

  1. Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
  2. Extensive experience in DevOps practices, including infrastructure automation, configuration management, and CI/CD pipelines.
  3. Proficiency in GitHub pipelines and CI/CD practices, with hands-on experience in configuring and managing GitHub Actions.
  4. Strong expertise in container orchestration with Kubernetes, including cluster management, deployment, scaling, and monitoring.
  5. Solid programming skills in Java and experience with Java Spring framework.
  6. Experience with cloud platforms such as AWS, Azure, or Google Cloud Platform.
  7. Knowledge of networking concepts, security principles, and best practices.
  8. Excellent problem-solving skills, attention to detail, and ability to work effectively in a fast-paced environment.
  9. Strong communication and collaboration skills, with the ability to work closely with cross-functional teams.

Location: Remote, Colombia

Contract Type: Full-time

Salary: 2,500 – 3,000 USD (Full-time)

How to Apply:
Interested candidates are invited to submit their resume and a cover letter detailing their relevant experience to [email protected]. Please include “Site Reliability Engineer” in the subject line.

Job Category: Site Reliability Engineer
Job Type: Full Time
Job Location: Colombia

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Verbal Communication Skills
  • Ability To Meet Deadlines
  • Organizational Skills
  • Analytical Skills

Site Reliability Engineer (SRE) Related jobs