Expertise in machine learning optimization, Experience with Pytorch and Kubernetes, Strong understanding of model deployment, Background in speech and audio processing.
Key responsabilities:
Collaborate with multidisciplinary teams on models
Optimize machine learning infrastructure for scaling
Report This Job
Help us maintain the quality of our job listings. If you find any issues with this job post, please let us know.
Select the reason you're reporting this job:
Our mission is to unlock the potential of human creativity—by giving a million creative artists the opportunity to live off their art and billions of fans the opportunity to enjoy and be inspired by it.
Spotify transformed music listening forever when it launched in Sweden in 2008. Discover, manage and share over 70m tracks for free, or upgrade to Spotify Premium to access exclusive features including offline mode, improved sound quality, and an ad-free music listening experience.
Today, Spotify is the most popular global audio streaming service with 365m users, including 165m subscribers across 178 markets. We are the largest driver of revenue to the music business today.
The Personalization team makes deciding what to play next easier and more enjoyable for every listener. From Discover Weekly to AI DJ, we’re behind some of Spotify’s most-loved features. We built them by understanding the world of music and podcasts better than anyone else. Join us and you’ll keep millions of users listening by making great recommendations – and providing valuable context – to each and every one of them.
Do you want to help Spotify invent new personalized sessions with generative voice AI to delight users? In this role, you’ll work with Spotify’s Text-to-Speech (TTS) team, Speak, to create generated voice audio that enriches users’ experience of music and podcast recommendations.
What You'll Do
Collaborate with a multidisciplinary team to optimize machine learning models for production use cases, ensuring they are highly efficient and scalable
Design and build efficient serving infrastructure for machine learning models that supports large-scale deployments across different regions
Optimize machine learning models in Pytorch or other libraries for real-time serving and production applications
Lead the effort to transition machine learning models from research and development into production, working closely with researchers and machine learning engineers
Build and maintain scalable Kubernetes clusters to manage and deploy machine learning models, ensuring reliability and performance
Implement and monitor logging metrics, diagnose infrastructure issues, and contribute to an on-call schedule to maintain production stability
Influence the technical design, architecture, and infrastructure decisions to support new and diverse machine learning architectures
Collaborate with stakeholders to drive forward initiatives related to the serving and optimization of machine learning models at scale.
Who You Are
You have a passion for speech, audio and/or generative machine learning
You have world-class expertise in optimizing machine learning models for production use cases, and extensive experience with machine learning frameworks like Pytorch
You are experienced in building efficient, scalable infrastructure to serve machine learning models, and managing Kubernetes clusters in multi-region setups
You have a strong understanding of how to bring machine learning models from research to production and are comfortable working with innovative, cutting-edge architectures
You are familiar with writing logging metrics and diagnosing production issues, and are willing to take part in an on-call schedule to maintain uptime and performance
You have a collaborative mindset, enjoy working closely with research scientists, machine learning engineers, and backend engineers to innovate and improve model deployment pipelines
You thrive in environments that require solving complex infrastructure challenges, including scaling and performance optimization
Experience with low-level machine learning libraries (e.g., Triton, CUDA) and performance optimization for custom components is a bonus
Where You'll Be
We offer you the flexibility to work where you work best! For this role, you can be within the European region as long as we have a work location.
This team operates within the GMT/CET time zone for collaboration.
Excluding France due to on-call restrictions.
Required profile
Experience
Level of experience:Senior (5-10 years)
Industry :
Music
Spoken language(s):
English
Check out the description to know which languages are mandatory.