Match score not available

Senior SRE Engineer

72% Flex
Remote: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

Proven experience in SRE or Software Engineering, Knowledge of major cloud providers and Kubernetes, Proficiency in Go language, GitOps, and other languages, Experience with cluster management systems, Experience with MongoDB, Redis, MySQL.

Key responsabilities:

  • Designing, analyzing, and troubleshooting large-scale distributed systems
  • Monitoring and optimizing system capacity and performance
  • Leading incident management process and adoption
  • Collaborating with cross-functional teams on system design
  • Maintaining and improving monitoring tools and observability
ESL FACEIT Group [EFG] logo
ESL FACEIT Group [EFG] Gaming SME https://eslfaceitgroup.com/
501 - 1000 Employees
See more ESL FACEIT Group [EFG] offers

Job description

Logo Jobgether

Your missions

At EFG (ESL FACEIT Group) we create worlds beyond gameplay, where players and fans become a community. We pride ourselves in having a corporate social responsibility which is that “IT’S NOT GG, UNTIL IT’S GG FOR ALL”.

Our passion, craft, and DNA are aligned to create and shape the world of esports, gaming tournaments, leagues, events, and holistic ecosystems through our millions of players, fans, and heroes, as well as through our people, and culture.

About FACEIT

With more than 25m users playing 30m matches every month FACEIT is the leading competitive gaming platform. We provide gamers the best experience possible by making sure we are always on top of our tech – and continue to deliver industry-leading features to our already awesome platform.

The Team:

As a Senior Site Reliability Engineer at EFG, you will be designing, analyzing, and troubleshooting large-scale distributed systems. You will demonstrate a systematic problem-solving approach, and the ability to debug and optimize code and to automate routine tasks. You will ensure that EFG’s services and systems are reliable, that they have uptime appropriate to users' needs and they have a fast rate of improvement. 

Apart from monitoring our systems' capacity and performance, you will also focus on optimizing existing systems, on building infrastructure and on eliminating work through automation.  You will work collaboratively with the software engineering teams to deploy and operate our systems, and you will help to automate and streamline our operations and processes. Within this role, you will be given real responsibilities, and you have the opportunity to drive change and have a big impact on our products and platform.

  • Maintaining and improving the monitoring and observability tools (Grafana/Prometheus/Thanos/Jaeger)
  • Working closely with your team and with other cross-functional teams to help design, maintain and operate systems at scale
  • Developing and driving the adoption of SRE best practices across the company
  • Leading on incident management process and adoption
  • Using your troubleshooting skills to help identify and fix operational issues
  • Working with Cloud Native technologies such as Kubernetes, Envoy, Istio, Prometheus and Helm
  • Working with the “Hashi Stack” (terraform, packer, vault)
  • Experimenting with and introducing cutting-edge technologies

Requirements

  • Proven experience as an SRE Engineer or Software Engineer, focusing on building and maintaining scalable infrastructures
  • Excellent working knowledge on at least one of the major cloud providers (GCP/AWS/Azure)
  • You have experience with cluster management systems (Kubernetes)
  • Knowledge of incident management: ability to investigate, troubleshoot, recover and prevent the recurrence of incidents that interfere with the normal delivery of IT services
  • Proficient in Go language and proficiency in at least another language: Java, Python, Rust…
  • You have knowledge of GitOps practices
  • You have production scale experience with one of the following; MongoDB, Redis, MySQL
  • Experience contributing to open source technologies would be an added bonus

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Industry :
Gaming
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Soft Skills

  • Problem Solving
  • Team Collaboration

Go Premium: Access the World's Largest Selection of Remote Jobs!

  • Largest Inventory: Dive into the world's largest remote job inventory. More than half of these opportunities can't be found on standard platforms.
  • Personalized Matches: Our AI-driven algorithms ensure you find job listings perfectly matched to your skills and preferences.
  • Application fast-lane: Discover positions where you rank in the TOP 5% of applicants, and get personally introduced to recruiters with Jobgether.
  • Try out our Premium Benefits with a 7-Day FREE TRIAL.
    No obligations. Cancel anytime.
Upgrade to Premium

Find other similar jobs