Logo for Featherless AI

Senior Software Engineer - API Gateway

Roles & Responsibilities

  • Experience with popular open LLMs and building applications with LLMs
  • Experience with web technologies (REST, WebSockets, DNS, networking, observability)
  • Experience with core stack components (Kubernetes, Node.js, Mikro-ORM, Fastify, Redis, MongoDB, Python, Elastic Cloud, Cloudflare, Sentry)
  • Ability to debug complex issues across a wide stack and build instrumentation as needed

Requirements:

  • Develop features and fix bugs to meet client needs, resolve user issues, and onboard new models
  • Improve API reliability through instrumentation, monitoring, and right-sizing infrastructure
  • Respond to availability incidents and triage issues related to inference quality and reliability
  • Manage the infrastructure on which the API gateway runs

Job description

About the Role

Featherless.ai is building the world’s most reliable and comprehensive open-model inference platform — the infrastructure powering the next generation of AI creators, startups, and enterprises. Our serverless approach to inference unlocks the best GPU utilization in AI infrastructure.

We’re hiring Senior Software Engineers to support and evolve the API gateway to our inference cloud, which is responsible for

  • authentication and inference to all models

  • subscription management and subscription entitlement (e.g. context-length, concurrency limits)

  • and providing the necessary API surface for applications and builders

API Gateway is constantly evolving in response to the unending stream of new models, modalities, clients and inference load.

What you'll do

The API gateway is managed by the Platform Team, who aim to make Featherless the best place to find and use models. As a member of the platform team, you will

  • undertake feature development and bug fixes to keep up with clients, resolve user issues, and onboard new models

  • improve the reliability of the existing API (increasing instrumentation and monitoring, right-sizing infrastructure)

  • respond to availability incidents

  • triage and resolve issues of inference quality and reliability

  • manage the infrastructure on which our gateway runs

What you'll bring

  • first-hand experience of the user’s we’re building for (familiarity with popular open LLMs, common clients, and experience building with LLM)

  • experience with the technologies and paradigms of the web (REST, websockets, DNS, networking, opentelemetry)

  • experience with significant components of our stack (k8s, node, mikro-orm, fastify, redis, mongodb, python, elastic cloud, cloudflare, sentry, otel)

  • ability to debug complex issues across a wide stack and build instrumentation as necessary

  • desire to work collaboratively as part of a skilled team

  • Alignment with team and company values, including

    • bias to action

    • responsiveness to users (bug-fixes over features)

    • instinct to iterate

    • subscribing to that done means proven by usage data

Other

This team operates on Eastern Time. We are remote, but with a preference to hire in Toronto, Canada.

Software Engineer Related jobs

Other jobs at Featherless AI

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.