Match score not available

Distinguished Sr Engineer, AI Systems - Remote | WFH

Remote: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

Bachelor's degree in Computer Science or related field, 9+ years in distributed computing and ML systems, 6+ years developing AI/ML algorithms, 3+ years with AI/ML frameworks in public cloud environments, Expertise in HPC and large-scale ML systems.

Key responsabilities:

  • Design resilient infrastructure for training tasks
  • Develop serving infrastructure for ML models
  • Deploy a thousand-node training cluster with optimal resources
  • Create benchmarks to evaluate AI system performance
  • Innovate applications using large language models
Get It Recruit - Information Technology logo
Get It Recruit - Information Technology Human Resources, Staffing & Recruiting TPE https://www.get.it/
2 - 10 Employees
See more Get It Recruit - Information Technology offers

Job description

Logo Jobgether

Your missions

Job Overview

Join our innovative team in the pursuit of creating reliable and human-centric AI systems that are revolutionizing the banking industry. We are at the forefront of utilizing machine learning technologies to deliver intelligent and automated customer experiences. Our applications leverage AI to enhance customer interactions, whether it is alerting them to unusual transactions or providing real-time support for their inquiries.

In light of our substantial investments in public cloud infrastructure and machine learning platforms, we are exceptionally positioned to exploit the transformative power of AI. Our commitment to assembling top-tier applied science and engineering teams is aimed at enhancing our capabilities and delivering exceptional product experiences anchored in scalable, high-performance AI infrastructure.

Key Responsibilities

  • Design and construct resilient infrastructure that ensures reliable support for extensive training tasks, even during individual node failures, utilizing containers and checkpointing libraries.
  • Develop infrastructure for serving large-scale machine learning models within a public cloud setting.
  • Deploy and optimize a thousand-node training cluster, ensuring efficient storage and networking, alongside tightly coupled training pipelines that leverage various parallelism strategies.
  • Create and implement benchmarks to evaluate AI system performance, providing informed recommendations regarding technology selection.
  • Innovate applications that harness the potential of large language models and foundational models.
  • Establish and maintain MLOps capabilities for foundational models.

Required Skills

  • Extensive experience in designing and constructing high-performance computing (HPC) and large-scale machine learning systems.
  • Proficiency in AI and machine learning algorithm development, primarily using Python or C/C++.
  • Comprehensive understanding of the full machine learning development lifecycle utilizing AI and ML frameworks in public cloud environments.
  • Familiarity with large-scale distributed platforms or systems in cloud environments such as AWS, Azure, or GCP.
  • Expertise in architecting cloud systems with a focus on security, scalability, performance, and cost efficiency.
  • Knowledge of the full stack for distributed training of large models, including ML compilers and frameworks like PyTorch and TensorFlow.

Qualifications

  • Bachelor's degree in Computer Science, Computer Engineering, or a related field.
  • A minimum of 9 years of relevant experience in distributed computing and large-scale machine learning systems.
  • At least 6 years of experience developing AI and machine learning algorithms.
  • A minimum of 3 years of hands-on experience with AI and ML frameworks in public cloud environments.

Career Growth Opportunities

Our organization places a high value on continuous learning and professional development. You will have the opportunity to engage in diverse initiatives and work alongside exceptional engineers and researchers, thereby enhancing your skills and contributing to impactful projects in the realm of AI and machine learning.

Company Culture And Values

We are committed to fostering a culture of collaboration and innovation, where creativity and the application of cutting-edge technology are encouraged to solve real-world challenges. Our commitment to diversity and inclusion ensures that every team member feels valued and empowered to make meaningful contributions.

Compensation And Benefits

This position offers a competitive salary range based on geographic location, supplemented by performance-based incentives. Our comprehensive benefits package is designed to support your overall well-being and includes health and financial benefits. Eligibility for specific benefits may vary based on employment status and management level.

Join Us

Become a vital part of a team that is at the forefront of innovation in the banking sector through advanced AI solutions. We look forward to reviewing your application!

Employment Type: Full-Time

Required profile

Experience

Level of experience: Senior (5-10 years)
Industry :
Human Resources, Staffing & Recruiting
Spoken language(s):
Check out the description to know which languages are mandatory.

Soft Skills

  • Collaboration
  • Problem Solving
  • Leadership Development
  • Innovation

Artificial Intelligence Engineer Related jobs