Logo for TechBiz Global

Lead AI Application Engineer (Infrastructure & LLMOps)

Key Facts

Remote From: 
Full time
Senior (5-10 years)
English

Other Skills

  • •
    Problem Solving
  • •
    Teamwork
  • •
    Communication

Requirements:

  • Build and maintain a multi-tenant AI Platform that supports the full ML lifecycle across cloud and on-premises environments.
  • Implement LLMOps/MLOps best practices, including automated deployment pipelines for models.
  • Develop and expose 'as-a-service' capabilities: Inference-as-a-Service, Embeddings-as-a-Service, and RAG-as-a-Service.
  • Own the deployment and scaling of Vector Databases and Feature Stores.

Job description

At TechBiz Global, we are providing recruitment service to our TOP clients from our portfolio.

We are currently looking for a dedicated Lead AI Aplication Engineer to join one of our clients' teams. If you're looking for an exciting opportunity to grow in an innovative environment, this could be the perfect fit for you.

Key Responsibilities:

  1. Build & Run the Shared AI Platform

  • Architect and maintain a multi-tenant AI Platform that supports the full ML lifecycle across cloud and on-premises environments.

  • Ensure high availability, low latency, and cost-efficiency for all shared AI resources.

  • Implement LLMOps/MLOps best practices, including automated deployment pipelines for models.

2. Curate the AI Services Catalogue

  • Develop and expose "as-a-service" capabilities: Inference-as-a-Service, Embeddings-as-a-Service, and RAG-as-a-Service.

  • Standardize how squads interact with LLMs, providing unified APIs and abstraction layers to prevent vendor lock-in.

3. Manage AI Data Infrastructure

  • Own the deployment and scaling of Vector Databases (e.g., Pinecone, Milvus, Weaviate) and Feature Stores (e.g., Feast, Tecton, Hopsworks).

  • Optimize data retrieval patterns to support real-time AI applications and agentic workflows.

  • Oversee Model Hosting environments, utilizing Kubernetes (K8s) and GPU orchestration to manage compute resources efficiently.

4. Enable Developer Self-Service

  • Build and maintain a Self-Service Portal or CLI that allows product squads to provision AI environments, models, and data stores independently.

  • Reduce "Time-to-Inference" for new features by providing pre-configured templates and blueprints.

  • Conduct internal workshops and provide documentation to empower squads to use the platform effectively.

AI Operations (AI Ops) Engineer Related jobs

Other jobs at TechBiz Global

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

✨

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.