Match score not available

LatAM Site Reliability Engineer

Remote: 
Full Remote
Experience: 
Mid-level (2-5 years)
Work from: 
Latin America

Offer summary

Qualifications:

4+ years SRE/DevOps experience, Experience independently leading project deployments, Expertise in cloud platforms, especially AWS, Knowledge of networking protocol, cybersecurity, caching, and CI/CD workflow.

Key responsabilities:

  • Collaborate with DevOps/SRE and DBA teams
  • Enhance current systems and streamline processes for future expansion
  • Monitor cloud infrastructure efficiently and ensure security compliance
  • Liaise with external security agencies and optimize architecture for rapid deployments to new countries
Sporty Group logo
Sporty Group Scaleup https://careers.sporty.com/
501 - 1000 Employees
See more Sporty Group offers

Job description

Sporty's sites are some of the most popular on the internet, consistently staying in Alexa's list of top websites for the countries they operate in

In addition to our DevOps Team we are building a Site Reliability Team whose purpose is to focus on site reliability and security. It will also involved deployment, configuration, and monitoring, as well as the availability, latency, change management, emergency response, and capacity management of services in production.

Responsibilities

Work with a team of DevOps/SRE and DBA professionals
Improve existing infrastructure and processes currently deployed in as well as streamlining processes deploy to new countries in the future
Holistically improve all aspects of our current infrastructure including: reducing costs; streamlining environment provisioning; lowering response times and incorporating the latest techniques and technologies
Monitor and maintain the existing cloud infrastructure via autoscaling, automated alerts, andOpsWork and Grafana dashboards
Take ownership and responsibility for our cloud operation activities
Liaise with external security agencies for annual audits as well as perform our own internal security sweeps
Aid in reconfiguring existing architecture to allow for rapid deployments to new countries
Mentoring less experienced team members

Requirements

4+ years SRE/DevOps experience
Be based in Latin America
Experience independently leading the planning and deployment of a project
Experienced with cloud platforms, especially AWS, including solid knowledge of how to utilize cloud resources to fulfill the demand from other teams and production
Familiar with one program language or script language (Python, Java....)
Experience managing multiple kubernetes clusters in production (virtualization, orchestration, scalability, security, and high availability), skillset such as Helm, Rancher, ArgoCD
Solid networking protocol and cyber security knowledge, especially the TCP / IP stack and HTTP protocol 
A strong understanding of cache, including CDN, HTTP cache (CloudFlare, AWS CloudFront)
Experienced with CloudNative Monitoring solution in Large distributed system using observation model(Trace, Metric, Logging), skillset such as Prometheus, Jaeger, Loki, ELK, Grafana
Excellent troubleshooting skills, including Linux OS issue diagnosis and OS parameter optimization

Beneficial

Experience working with other cloud platform is a plus. (GCP, Azure, AliCloud)
Familiar with at least one of infrastructure as Code (Terraform, Cloudformation)
Design and implement CI/CD workflow is a plus (Jenkins, Github Action)
Experience with system automation tools (Ansible, Salt, Chef)
Understanding of modern Micro Services and Service Mesh concepts is a plus(Containers, Istio)
Benefits

Quarterly and flash bonuses
We have core hours of 10am-3pm in a local timezone, but flexible hours outside of this
Education allowance
Referral bonuses
28 days paid annual leave
2 x annual company retreats (Lisbon + Dubai in 2022 / Phuket in Q2 2023 + 1 more TBC!)
Highly talented, dependable co-workers in a global, multicultural organisation
Payment via world class online wallet system DEEL
Top of the line equipment supplied by market leader Hofy
We score 100% on The Joel Test
Our teams are small enough for you to be impactful
Our business is globally established and successful, offering stability and security to our Team Members

Our Mission

Our mission is to be an everyday entertainment platform for everyone

Our Operating Principles

1. Create Value for Users
2. Act in the Long-Term Interests of Sporty 
3. Focus on Product Improvements & Innovation 
4. Be Responsible 
5. Preserve Integrity & Honesty 
6. Respect Confidentiality & Privacy 
7. Ensure Stability, Security & Scalability 
8. Work Hard with Passion & Pride

Interview Process

HackerRank Test 
Remote video screening with our Talent Acquisition Team + live ID check
Remote 90 min video interview loop with 3 x Team Members (30 mins each)
Pre offer call with Talent Acquisition Team
ID check via Zinc & 2 References from previous employers
24-72 hour feedback loops throughout process

Working at Sporty

The top-down mentality at Sporty is high performance based, meaning we trust you to do your job with an emphasis on support to help you achieve, grow and de-block any issues when they're in your way.
Generally employees can choose their own hours, as long as they are collaborating and doing stand-ups etc. The emphasis is really on results. 

As we are a highly structured and established company we are able to offer the security and support of a global business with the allure of a startup environment. Sporty is independently managed and financed, meaning we don’t have arbitrary shareholder or VC targets to cater to. 

We literally build, spend and make decisions based on the ethos of building THE best platform of its kind. We are truly a tech company to the core and take excellent care of our Team Members.

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Industry :
Leisure & Entertainment
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Site Reliability Engineer (SRE) Related jobs