Logo for baasi

Backend Engineer (Node.js/NestJS) — San Jose, Costa Rica

Roles & Responsibilities

  • Strong experience with Node.js and the NestJS framework.
  • Proficiency in PostgreSQL and Redis for persistence and caching.
  • Hands-on experience with Socket.IO or other WebSocket libraries.
  • Experience with secure configuration and secrets management (HashiCorp Vault preferred).

Requirements:

  • Build and maintain a Node.js-based proxy backend (NestJS) that accepts inference requests, schedules prompts, manages QKV cache, and exposes APIs to manage LoRA adapters, while integrating with authentication, RBAC, logging, and metrics.
  • Extend the dashboard backend to support dataset uploads, training job views, model management, inference usage, request history, and adapter selection; reuse Auth0, Stripe, and user management code; add endpoints for new UI flows.
  • Develop the core stack using NestJS with PostgreSQL, Redis, and HashiCorp Vault; use Socket.IO for real-time updates; ensure secure Stripe and Auth0 integration and collaborate on deployment pipelines (Proxmox, Docker, CI/CD).
  • Collaborate with C++/CUDA engineers on low-level runtime features, reason about scheduling, state management, and concurrency; maintain strong debugging and systems-thinking.

Job description

About Us

We are a stealth-mode startup building a new AI platform. Our mission is to make advanced language models deployable, customizable, and secure across diverse environments.

Role

We are seeking a Backend Engineer (Node.js/NestJS) to extend our platform using our existing codebase. You’ll build the proxy backend that interacts with our custom inference runtime and extend dashboards.

This role requires strong backend engineering skills, the ability to integrate existing systems, and comfort working closely with C++/CUDA engineers building low-level runtime features.

Responsibilities

Proxy Backend for Inference Runtime

  • Build and maintain a Node.js-based proxy backend that:
    • Accepts inference requests from the frontend.

    • Schedules and serializes prompts.

    • Manages QKV cache load/unload.

    • Provides APIs to manage LoRA adapters.

  • Integrate with authentication, RBAC, and logging already provided by the existing stack.

  • Expose metrics and logs for monitoring inference usage and performance.

Dashboards

  • Extend the existing Dashboard with Dataset upload, training job view, model management, inference usage, request history, and adapter selection.

  • Reuse auth, billing, and user management code (Auth0, Stripe).

  • Add necessary backend endpoints to support new UI flows.


Core Stack & Infrastructure

  • Develop using NestJS as the main backend framework.

  • Work with PostgreSQL, Redis, and HashiCorp Vault for persistence, caching, and secrets.

  • Use Socket.IO for real-time updates (job status, inference progress).

  • Ensure secure integration with Stripe (billing) and Auth0 (identity).

  • Collaborate with DevOps on deployment pipelines (Proxmox, Docker, CI/CD).

Requirements

  • Strong experience with Node.js and NestJS framework.

  • Proficiency in PostgreSQL and Redis for persistence and caching.

  • Hands-on experience with Socket.IO or other WebSocket libraries.

  • Experience with secure configuration and secrets management (HashiCorp Vault preferred).

  • Comfortable working with microservices and integrating with existing codebases.

  • Strong debugging and systems thinking — able to reason about scheduling, state management, and concurrency.

Nice to Have

  • Experience integrating with AI runtimes (gRPC/REST backends for inference).

  • Experience with RAG and MCP.

  • Experience with authentication/authorization frameworks (Auth0, JWT, RBAC).

  • Familiarity with Stripe API or similar billing systems.

  • Contributions to backend open-source projects.

  • Experience with WebRTC.

Why Join

  • Extend a proven SaaS foundation into a new AI runtime platform.

  • Competitive compensation, equity potential.

Back-End Engineer Related jobs

Other jobs at baasi

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.