Offer summary
Qualifications:
MSc or PhD in Computer Science, Engineering, or related field., Over 5 years of confirmed experience in model inference optimization., Expertise in modern machine learning frameworks, particularly PyTorch and ONNX., Strong programming proficiency in CUDA, Python, and C++..Key responsabilities:
- Develop strategies to optimize AI model inference for on-device deployment.
- Benchmark performance, identify bottlenecks, and implement solutions.