Senior Machine Learning Engineer

Work set-up: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

Strong experience in deploying diffusion-based generative models into production., Proficiency in building automated MLOps pipelines for training, deployment, and monitoring., Hands-on experience with AWS services such as S3, EC2, and SageMaker, and infrastructure-as-code tools like Terraform., Excellent Python skills with a focus on maintainable and collaborative coding..

Key responsibilities:

  • Collaborate with researchers to bring new models from prototype to production.
  • Build and optimize pipelines for generative models and inference speed.
  • Develop and maintain automated MLOps workflows, including CI/CD and data pipelines.
  • Support other teams with ML expertise to accelerate AI-powered feature development.

Leonardo.Ai logo
Leonardo.Ai https://leonardo.ai/
11 - 50 Employees
See all jobs

Job description

Leonardo.Ai is building one of the world’s highestthroughput Generative AI platforms, enabling millions of users, from beginners to professionals, to create highquality images and videos with ease. Now part of the Canva family, we’re growing our global R&D team to deliver AI tools, products, and infrastructure that make creativity limitless for nearly a quarter of a billion users.

The Role:

We’re looking for a Senior Machine Learning Engineer to join our Platform Tribe, the team that provides the tools, infrastructure, and expertise that help every other product team at Leonardo move faster.

In this role, you’ll work at the intersection of Generative AI and MLOps. You’ll partner with researchers and engineers, be embedded in other teams for highimpact projects, and help design the pipelines, GPU infrastructure, and automation that transform cuttingedge research into productionready features, making creativity accessible to millions of people.

Your work will have a direct and visible impact: you’ll enable teams across Leonardo to deliver AIpowered features faster, more reliably, and at scale.

What You’ll Do:

Generative AI & Inference:

  • Collaborate with researchers to bring new models from prototype to production and ensure they deliver meaningful value.

      • Build and maintain production pipelines for diffusionbased and related generative models (e.g. LoRA, ControlNet).

      • Optimise inference for speed, reliability, and efficiency using techniques such as quantisation, distillation, caching, and multiGPU parallelism.

      • Tackle complex challenges like orchestrating multiGPU video pipelines while ensuring systems are intuitive and maintainable.

        • MLOps & Infrastructure:

          • Develop and maintain automated MLOps pipelines covering training, deployment, monitoring, and retraining.

          • Build CICD workflows for machine learning that make handovers from research to production seamless and safe.

          • Create scalable data pipelines and storage solutions to support highthroughput workloads.

          • Set up clear monitoring and alerting for model performance (e.g. Prometheus, Grafana, CloudWatch).

          • Design secure, reliable infrastructure on AWS (S3, EC2, SageMaker) using InfrastructureasCode tools like Terraform.

            • Platform Acceleration:

              • Be embedded in other teams, from Generations to Enterprise, to support highimpact projects requiring deep ML expertise.

              • Develop shared tooling, reusable workflows, and architecture patterns that help teams across Leonardo build and ship faster.

              • Promote knowledgesharing, best practices, and scalable solutions across the organisation.

                • Skills We Love:

                  • Generative AI Experience: Experience deploying diffusionbased or similar generative models into production and working with inference optimisation techniques.

                  • MLOps Expertise: Skilled in building automated, reliable workflows for training, deploying, and monitoring ML models at scale.

                  • Infrastructure & Cloud: Handson with AWS (S3, EC2, SageMaker), Kubernetes, Docker, and InfrastructureasCode (Terraform).

                  • Performance & Efficiency: Proficient in techniques such as quantisation, distillation, caching, and distributed inference.

                  • Data Foundations: Ability to design scalable data pipelines and storage solutions (SQLNoSQL).

                  • Engineering Craft: Strong Python skills and a focus on writing clear, maintainable, and collaborative code.

                  • Collaboration & Growth: Thrive in crossfunctional teams, value open feedback, and enjoy supporting others’ success while learning continuously.

                    • Our Culture:

                      • Inclusive Culture: We celebrate diversity and are committed to creating an inclusive environment where everyone feels valued and empowered. At Leonardo AI, your unique perspectives and experiences are welcomed and essential to our success.

                      • Flexible Work Environment: We understand the importance of worklife balance. Enjoy the flexibility to work remotely or from our vibrant offices. We have employees all over the world, ensuring you can thrive personally and professionally.

                      • Empowering Growth: Your development is our priority. We offer continuous learning opportunities and career growth tailored to your goals. You’ll be encouraged to grow and excel in your career at Leonardo AI.

                      • Impactful Work: Join us in shaping the future of AI. Youll work on innovative projects that have a meaningful impact, and your contributions will help drive advancements in AI creativity.

                        • Leonardo.Ai Benefits:

                          • A range of benefits to set you up for every success in and outside of work. Heres a taste of whats on offer:

                          • Impact the future of AI

                          • Reward package including equity we want our success to be yours too

                          • Inclusive parental leave policy that supports all parents & carers with 18 weeks paid leave

                          • An annual Vibe & Thrive allowance to support your wellbeing, social connection, office setup & more

                          • Flexible leave options that empower you to be a force for good, take time to recharge and support you personally, including remote working abroad

                          • Support with your professional development

                          • Fun and engaging company events, both virtual and inperson

                          • 20 days annual leave

                            • Please apply…

                              We’re realistic about experience, even if you haven’t worked at this scale before. We encourage anyone exposed to deploying generative models to production, working with techniques like LoRA or diffusion, and optimising inference across GPUs to apply for this role.

Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration

Machine Learning Engineer Related jobs