Logo for Runware

Staff Software Engineer - Serverless

Roles & Responsibilities

  • Extensive experience as a Staff Engineer or Senior/Backend/Platform Engineer with proven technical leadership.
  • Experience building backend services, distributed systems, developer platforms, or workload orchestration systems.
  • Strong understanding of async processing, queues, scheduling, retries, back pressure and failure handling.
  • Proficiency in Python, Go or similar languages with solid engineering fundamentals and ability to balance reliability, latency, scale and developer experience while communicating clearly.

Requirements:

  • Build the core systems behind Runware’s serverless platform (workload execution, routing, scheduling, isolation and scaling) leveraging the inference engine.
  • Enable a simple developer experience to deploy models and run AI workloads without managing GPUs, servers, queues or infrastructure via a simple SDK.
  • Design and improve the control plane for serverless execution (APIs, workers, lifecycle management, retries and failure handling).
  • Collaborate with infrastructure and ML teams to improve workload startup time, GPU utilization, model warm-up, caching and placement, and build global observability for monitoring at scale.

Job description

We’re looking for a Staff/Senior Software Engineer to technically lead and build Runware’s serverless platform. You’ll be part of a small team working closely with senior leadership to bring a new product line to market.

Runware is evolving beyond model APIs into a platform where developers can deploy models and run AI workloads without managing GPUs, servers or infrastructure. You’ll build the software layer that connects our proprietary high-performance sonic inference engine and turns complex infrastructure into a compelling developer experience.

This is a role for someone who enjoys technical leadership, hard platform problems and clean, simple systems. Your work will help enterprises, model labs and developers move from idea to production faster, then scale without needing to build and operate the infrastructure themselves.

What you’ll do

  • Build the core systems behind Runware’s serverless platform, including workload execution, routing, scheduling, isolation and scaling leveraging our inference engine
  • Make it simple for developers to deploy models and run AI workloads without managing GPUs, servers, queues or infrastructure through a simple SDK interface
  • Design and improve the control plane for serverless execution, including APIs, workers, lifecycle management, retries and failure handling
  • Work closely with our infrastructure and ML teams to improve workload startup time, GPU utilisation, model warm-up, caching and placement
  • Build observability that makes serverless workloads easy to monitor, debug and operate at scale globally
  • Lead the technical design, mentor other engineers and help define the engineering standards for a new product area

Requirements

  • Strong experience as a Staff Engineer, Senior Software Engineer, Backend Engineer, Platform Engineer or similar
  • Experience building backend services, distributed systems, developer platforms or workload orchestration systems
  • Strong understanding of async processing, queues, scheduling, retries, back pressure and failure handling
  • Comfortable working across APIs, control planes, workers, databases and observability systems
  • Strong engineering fundamentals in one or more backend languages such as Python, Go or similar
  • Good judgement around trade-offs between reliability, latency, scale, cost and developer experience
  • Clear communication, strong ownership and the ability to lead technical direction in a fast-moving environment

Nice to have

  • Experience building serverless platforms, job execution systems, container platforms or compute orchestration systems
  • Experience with GPU-backed workloads, AI/ML inference, model serving, batch processing or high-performance compute
  • Familiarity with technologies such as vLLM, TensorRT, Triton, Kubernetes, Nomad or Knative
  • Experience improving workload performance through batching, autoscaling, model warm-up, caching, request routing or queue management
  • Experience with multi-tenant isolation, sandboxing, quotas, rate limits, resource accounting or usage-based billing

Benefits

We’re a remote-first collective, meeting in person twice a year to plan, brainstorm, celebrate wins, and enjoy some face-to-face time. We have core hours for cooperative working and calls, but outside of that your calendar is yours. Work the hours that let you perform at your peak while also building a healthy life.

Our release cycles are fast and intense, but they’re followed by real downtime. After big pushes we expect the team to unplug, recharge, and come back ready & stronger than ever for the next leap.

  • Generous paid time off – vacation, sick days, public holidays
  • Meaningful stock options – share in the upside you create
  • Remote-first setup – work from home anywhere we can employ you
  • Flexible hours – own your schedule outside core collaboration blocks
  • Family leave – paid maternity, paternity, and caregiver time
  • Company retreats – twice-yearly gatherings in inspiring locations

Software Engineer Related jobs

Other jobs at Runware

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.