Match score not available

Mid-Level Site Reliability Engineer - #33772

72% Flex
Remote: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

2-3 years experience in related roles, Proficiency in Grafana, Kubernetes, Linux.

Key responsabilities:

  • Ensure system reliability, scalability, performance
  • Optimize KPIs, resolve issues, automate tasks
  • Architect advanced features for better performance
  • Document risks, ensure business continuity
  • Collaborate with team, foreign counterparts
Manila Recruitment logo
Manila Recruitment Human Resources, Staffing & Recruiting SME https://www.manilarecruitment.com/
11 - 50 Employees
See more Manila Recruitment offers

Job description

Logo Jobgether

Your missions

As a Mid-level Site Reliability Engineer, you will will be responsible for ensuring the reliability, scalability, and performance of the production environment. This role involves proactive monitoring, issue resolution, and optimization of key performance indicators (KPIs) across infrastructure.


Company Profile :

Our client is headquartered in Manchester, UK where they leverage cutting-edge technology to offer robust and scalable cloud solutions. They are the forefront of the private cloud sector, pioneering advanced solutions within the cloud-native ecosystem. They are dedicated to pushing the boundaries of what’s possible in cloud technology.

Due to their continued success, they are looking to expand their team in the Philippines and are seeking individuals with a high degree of motivation, technical capabilities, and a genuine desire to develop themselves while the business grows further. They are looking for an energetic and proactive Mid-level Site Reliability Engineer to be part of their growing team.

This is an amazing career opportunity for someone who is passionate and has proficiency in providing IT services that exceeds the business requirements. This is the perfect career move for someone who wants to work in a more challenging environment while working hand-in-hand with their foreign counterparts to help and learn from each other.

Duties and Responsibilities:

  • Ensure our production environment operates within defined SLAs through vigilant monitoring and proactive issue resolution.
  • Propose and develop solutions to maintain and enhance key performance indicators (KPIs) across our infrastructure.
  • Optimise system performance, automate routine operational tasks, and implement disaster recovery solutions to ensure business continuity and data integrity
  • Conduct architecture reviews, identify risks, and develop recommendations for improving system performance and reliability.
  • Calculate and communicating risks to ensure and rule out possibility of downtime
  • Advanced High Availability Capacity Building.
  • Architect and implement advanced features, focusing on scalability, security, and performance within the Kubernetes ecosystem.
  • Ensure the reliability and maintainability of our systems by documenting pr

Requirements

Must-have Skills / Qualification:

  • .Have at least 2-3 years working experience as a System Administration, DevOps Engineering, SRE, or other similar roles.
  • Experience with Grafana for at least 2-3 years
  • Experience in Kubernetes
  • Knowledge in managing sensitive data and hardening security via Vault, AWS Secrets management, etc
  • Experience managing Linux OS Machines
  • Knowledge in managing large scale systems using infrastructure as a code such as Crossplane and Terraform
  • Basic networking
  • Open source tools: knowledge dealing with open source tools such and external-dns, reloader, descheduler, cert-manager, etc and the eagerness to learn and explore new tools
  • Familiarity with the broader cloud-native ecosystem, including CI/CD practices, container orchestration, and major cloud services (AWS, GCP, Azure).
  • Excellent problem-solving skills and the ability to work both independently and collaboratively in a team setting.
  • Amenable to work on a shifting schedule
  • Excellent English communication skills to effectively collaborate with foreign counterpart

Advantageous skills:

  • (Loki, Grafana, Tempo, and Mimir)
  • Experience in Docker, and Helm

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Industry :
Human Resources, Staffing & Recruiting
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Go Premium: Access the World's Largest Selection of Remote Jobs!

  • Largest Inventory: Dive into the world's largest remote job inventory. More than half of these opportunities can't be found on standard platforms.
  • Personalized Matches: Our AI-driven algorithms ensure you find job listings perfectly matched to your skills and preferences.
  • Application fast-lane: Discover positions where you rank in the TOP 5% of applicants, and get personally introduced to recruiters with Jobgether.
  • Try out our Premium Benefits with a 7-Day FREE TRIAL.
    No obligations. Cancel anytime.
Upgrade to Premium

Find more Site Reliability Engineer jobs