Role overview

Qualifications

Extensive experience as a Staff Engineer or Senior/Backend/Platform Engineer with proven technical leadership.
Experience building backend services, distributed systems, developer platforms, or workload orchestration systems.
Strong understanding of async processing, queues, scheduling, retries, back pressure and failure handling.
Proficiency in Python, Go or similar languages with solid engineering fundamentals and ability to balance reliability, latency, scale and developer experience while communicating clearly.

Responsibilities

Build the core systems behind Runware’s serverless platform (workload execution, routing, scheduling, isolation and scaling) leveraging the inference engine.
Enable a simple developer experience to deploy models and run AI workloads without managing GPUs, servers, queues or infrastructure via a simple SDK.
Design and improve the control plane for serverless execution (APIs, workers, lifecycle management, retries and failure handling).
Collaborate with infrastructure and ML teams to improve workload startup time, GPU utilization, model warm-up, caching and placement, and build global observability for monitoring at scale.

Key facts

Remote from: United Kingdom
Full time
Mid-level (2-5 years)
Software Engineer
English

Other skills

Mentorship
Communication
Collaboration

About the company

Runware

Runware provides lightning-fast text-to-image and image-to-image generation speeds—0.3 seconds for SD 1.5 and 1.9 seconds for SDXL. All through one powerful API.By combining proprietary hardware with accelerated software and orchestration, Runware sets new industry benchmarks with 10x inference efficiency and significant cost savings compared to other providers. No image degradation, just supercharged Stable Diffusion, supporting the entire SD ecosystem, including technologies like ControlNet, IP-Adapters, InstantID, and more.Get complete flexibility in model selection, with more than 150k open source SD models included – and the option to bring any of your own privately trained models or fine-tunes.Runware is backed by some of the world’s leading investors, including: a16z Speedrun, Lunar Ventures, Zero Prime, Begin Capital, and more.

Company details

Company size2 - 10

Links

LinkedIn See all jobs

Your match analysis

See how your profile stacks up against this role.

We compared the job requirements to your profile to show where you're strong and where you fall short.

Job description

We’re looking for a Staff/Senior Software Engineer to technically lead and build Runware’s serverless platform. You’ll be part of a small team working closely with senior leadership to bring a new product line to market.

Runware is evolving beyond model APIs into a platform where developers can deploy models and run AI workloads without managing GPUs, servers or infrastructure. You’ll build the software layer that connects our proprietary high-performance sonic inference engine and turns complex infrastructure into a compelling developer experience.

This is a role for someone who enjoys technical leadership, hard platform problems and clean, simple systems. Your work will help enterprises, model labs and developers move from idea to production faster, then scale without needing to build and operate the infrastructure themselves.

What you’ll do

Build the core systems behind Runware’s serverless platform, including workload execution, routing, scheduling, isolation and scaling leveraging our inference engine
Make it simple for developers to deploy models and run AI workloads without managing GPUs, servers, queues or infrastructure through a simple SDK interface
Design and improve the control plane for serverless execution, including APIs, workers, lifecycle management, retries and failure handling
Work closely with our infrastructure and ML teams to improve workload startup time, GPU utilisation, model warm-up, caching and placement
Build observability that makes serverless workloads easy to monitor, debug and operate at scale globally
Lead the technical design, mentor other engineers and help define the engineering standards for a new product area

Requirements

Strong experience as a Staff Engineer, Senior Software Engineer, Backend Engineer, Platform Engineer or similar
Experience building backend services, distributed systems, developer platforms or workload orchestration systems
Strong understanding of async processing, queues, scheduling, retries, back pressure and failure handling
Comfortable working across APIs, control planes, workers, databases and observability systems
Strong engineering fundamentals in one or more backend languages such as Python, Go or similar
Good judgement around trade-offs between reliability, latency, scale, cost and developer experience
Clear communication, strong ownership and the ability to lead technical direction in a fast-moving environment

Nice to have

Experience building serverless platforms, job execution systems, container platforms or compute orchestration systems
Experience with GPU-backed workloads, AI/ML inference, model serving, batch processing or high-performance compute
Familiarity with technologies such as vLLM, TensorRT, Triton, Kubernetes, Nomad or Knative
Experience improving workload performance through batching, autoscaling, model warm-up, caching, request routing or queue management
Experience with multi-tenant isolation, sandboxing, quotas, rate limits, resource accounting or usage-based billing

Benefits

We’re a remote-first collective, meeting in person twice a year to plan, brainstorm, celebrate wins, and enjoy some face-to-face time. We have core hours for cooperative working and calls, but outside of that your calendar is yours. Work the hours that let you perform at your peak while also building a healthy life.

Our release cycles are fast and intense, but they’re followed by real downtime. After big pushes we expect the team to unplug, recharge, and come back ready & stronger than ever for the next leap.

Generous paid time off – vacation, sick days, public holidays
Meaningful stock options – share in the upside you create
Remote-first setup – work from home anywhere we can employ you
Flexible hours – own your schedule outside core collaboration blocks
Family leave – paid maternity, paternity, and caregiver time
Company retreats – twice-yearly gatherings in inspiring locations