Offer summary
Qualifications:
BS/MS/PhD or equivalent in Computer Science, Data Science, Engineering, or Mathematics, At least 8 years experience with Python/C++ and software development, Experience with medium to large scale AI training and key libraries like NeMo Framework, Familiarity with HPC systems, cloud architectures (AWS, Azure, GCP), and deployment tools, Expertise in coding, debugging, and deploying high-performance AI solutions.
Key responsabilities:
- Build robust AI/HPC infrastructure for customers
- Support performance, monitoring, and reliability of AI clusters
- Engage in and improve service lifecycle from design to operation
- Measure progress of AI jobs and suggest improvements
- Travel regionally for on-site customer interactions