Offer summary
Qualifications:
Degree in Computer Science or related field with 5+ years experience, Knowledge of HPC and AI technologies, job scheduling, Linux networking, Experience in storage solutions, Python, bash scripting, network protocols.
Key responsabilities:
- Design, implement and maintain large scale HPC/AI clusters
- Manage job/workload schedules, develop CI/CD pipelines
- Automate deployment, monitoring and alerting of infrastructure
- Troubleshoot bottom-up technical issues, define standard methodologies
- Support Research & Development, engage in POCs for improvements