Offer summary
Qualifications:
10+ years in SRE or infrastructure engineering, 5+ in leadership, Experience managing large-scale cloud systems (AWS, GCP, Azure), Strong skills in automation (Terraform, Ansible) and scripting (Python, Bash), Expertise in Docker, Kubernetes, and network infrastructure, Strong knowledge of CI/CD pipelines, incident management, and security practices.
Key responsabilities:
- Lead infrastructure design, ensuring high availability and scalability
- Build and mentor a global SRE team with 24/7 support
- Develop SLAs for uptime and performance, focusing on automation
- Implement strategies for monitoring, incident response, and rapid recovery
- Collaborate with engineering teams on scalable architecture and processes