Offer summary
Qualifications:
BS or MS in CS/CE/EE or equivalent experience, At least 2 years of k8s experience on-prem and in AWS, At least 4 years building automation software for large scale computing clusters, Versatile with at least one programming language like Go or Python, Deep knowledge of networking fundamentals.
Key responsabilities:
- Propose and create solutions to improve availability of the AI platform
- Automate critical processes for distributed GPU clusters
- Drive development of new SRE automation tools
- Work on enhancing observability tooling
- Impact the efficiency of the AV Perception team