Production Engineering Lead

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Over 5 years of experience managing production environments, such as SRE, DevOps, or Production Engineering., Deep knowledge of monitoring and observability tools like Datadog, Prometheus, Grafana, and ELK., Experience with incident response and on-call programs in a 24/7 environment., Fluency with cloud-native technologies including AWS, Kubernetes/ECS, containers, and CI/CD pipelines..

Key responsibilities:

  • Own the health, visibility, and reliability of production systems.
  • Build and scale monitoring infrastructure and incident response processes.
  • Collaborate with cross-functional teams to ensure system resilience and scalability.
  • Drive operational practices to enable rapid and safe deployment of features.

Riverside.fm logo
Riverside.fm Scaleup https://riverside.fm/
51 - 200 Employees
See all jobs

Job description

Description

For many of us, there’s that one podcast we never miss—and video content is a daily habit, whether for work or play. But few truly understand the effort behind the scenes. At Riverside, we do. That’s exactly why we built an AI-powered platform that helps creators, podcasters, marketers, and teams at brands like Netflix, Disney, Google, and Microsoft produce high-quality content with ease.


Our technology streamlines the entire content creation process—from idea to professional-grade output—without the need for expensive equipment or external production services. The secret? AI-driven tools that automate traditional production roles like editing, directing, and design, making studio-level content possible at the click of a button.


About the Engineering Team

We’re a team of smart, curious engineers building scalable, reliable systems that power content creation for millions. We work with modern web technologies, tackle real-world challenges in distributed systems, and keep things practical—no overengineering, just solid solutions. If you love solving tough problems, moving fast, and building tech that creators actually use, you’ll fit right in.


On your day to day

We’re now looking for a hands-on Production Engineering Lead to take ownership of the health, visibility, and reliability of our production systems. You’ll build and scale the monitoring infrastructure, incident response processes, and service ownership models that enable engineers to ship quickly and safely. Working closely with the VP of R&D and collaborating across platform and product teams, you’ll drive how we operate in production—ensuring our systems are resilient, observable, and ready to scale with us.


Requirements

What Will Make You Stand Out

  • 5+ years of experience managing production environments at scale (as SRE, DevOps, Production Engineer, or similar)
  • Deep knowledge of monitoring and observability stacks (e.g., Datadog, Prometheus, Grafana, ELK)
  • Experience running or building incident response and on-call programs in a 24/7 environment
  • Fluency with cloud-native stacks (AWS, Kubernetes/ECS, containers, CI/CD pipelines)
  • Excellent communication skills under pressure with a strong sense of ownership and accountability
  • Bonus: experience working in high-scale SaaS or regulated uptime environments (SOC 2, HIPAA, etc.)


Bottom line? If you wanna take part in transforming how people and businesses share their stories globally, Riverside’s your place. The work is challenging, the culture is fast-paced, and the people are exceptionally brilliant. And if that’s not enough, we guarantee that your ideas will genuinely make an impact.


Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Accountability
  • Communication

Related jobs