Match score not available

Site Reliability Engineer

Remote: 
Full Remote
Work from: 

Offer summary

Qualifications:

5+ years as a SRE engineer, Experience with large-scale distributed systems, In-depth knowledge of operating systems and cloud services, Proficiency in Python, Go, or Java.

Key responsabilities:

  • Collaborate with RD engineers for production operations
  • Design and maintain infrastructure solutions
Infinity Group logo
Infinity Group http://www.infinity-group.pl
51 - 200 Employees
See all jobs

Job description

Are you a skilled Site Reliabilty or DevOps Engineer looking for a challenging role at a global tech company? Our client, a leading provider of cloud communication solutions, is seeking talented engineers to join their team.

This is a fully remote position. We welcome candidates from Georgia, Armenia, Kazakhstan, Serbia, Moldova, Cyprus and the European Union.

You’ll work on core infrastructure, collaborate with brilliant engineers, and contribute to ensuring the stability and performance of our systems.

If you’re passionate about technology and want to make a real impact, this is the opportunity for you.

Responsibilities

  • Collaborate with R&D engineers on coordination, communication, and execution of production-related operations;
  • Design, implement, and maintain scalable and reliable infrastructure solutions to support our applications and services;
  • Develop and deploy monitoring, alerting, and logging systems to proactively identify and mitigate operational issues;
  • Build a SRE dashboard with KPI to measure application reliability;
  • Conduct capacity planning and performance tuning to optimize system performance and resource utilization for improved user experience;
  • Automate repetitive tasks and processes to streamline operations and improve efficiency;
  • Participate in incident response and resolution, including root cause analysis and post-mortem reviews;
  • Continuously evaluate and adopt new technologies and methodologies to enhance our infrastructure and operations;
  • Documentation and Knowledge Sharing: Create and maintain documentation, runbooks, and knowledge base articles to document system configurations, procedures, and best practices.

Requirements

  • 5+ years as a SRE engineer with a passion for technology and strong motivation to build highly reliable solutions;
  • Proven experience in managing large-scale distributed systems and understanding the principles of scalability and reliability;
  • In-depth understanding of operating systems, networking, and cloud services;
  • Experience with Observability and Monitoring tools mandatory;
  • Git, Virtualization, Containers, Dockers, Kubernetes;
  • Cloud providers: GCP preferably; AWS, Azure an advantage;
  • Proficiency in programming languages such as Python, Go, or Java;
  • Strong communication skills, both verbal and written, with the ability to adapt the messaging to different perspectives (technical, business) and levels of detail;
  • Ability to grasp new technologies quickly and prioritize and multitask on multiple responsibilities;
  • Excellent problem-solving skills and the ability to work effectively in a fast-paced, dynamic environment.

Advantages

  • Experience IaC (infrastructure as a Code).
  • Knowledge of the Russian language.

Our offer

  • B2B contract.
  • 100% remote work.
  • 20 working days of paid vacation annually.
  • Compensation for professional development courses directly related to the candidate's role.
  • Access to resources, including sessions with professional psychologists.
  • No screen catchers and keyloggers.
  • No bureaucracy.
  • International contract with a multinational company, valued in USD.

Pay scales

B2B: 4 000-5 500 USD gross

Online recruitment process

  • CV analysis
  • HR interview
  • Client's technical interview (with short coding task)
  • Client's technical interview with the CTO
  • Decision

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Time Management
  • Adaptability
  • Communication
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs