Offer summary
Qualifications:
Masters, PhD, or equivalent experience in Computer Science, AI, Applied Math, or related field, 5+ years of relevant work or research experience in Deep Learning, Strong proficiency in Python, PyTorch, and related ML tools, Excellent software design skills including debugging and performance analysis, Strong algorithms and programming fundamentals.
Key responsabilities:
- Train, develop, and deploy generative AI models like LLMs
- Develop high-performance optimization techniques for inference
- Analyze and profile GPU kernel-level performance to identify optimizations
- Collaborate with teams across NVIDIA for implementing performant kernels
- Innovate on inference performance of NVIDIA's AI software solutions