Inference Engineer

Remote: 
Full Remote
Contract: 

Offer summary

Qualifications:

Advanced degree in Computer Science or related field, Strong background in Model Inference, Experience scaling AI models in production, Python expertise and interest in systems programming.

Key responsabilities:

  • Architect and improve cloud-based deployment systems
  • Create efficient solutions for concurrent model serving
  • Lead the transition from Open Source and third-party tools to custom frameworks
  • Drive innovation in model compression and performance

techire ai logo
techire ai http://www.techire.ai
2 - 10 Employees
See all jobs

Job description

Want to scale AI Agents through innovative ML infrastructure?


A pioneering AI company is looking for an experienced Engineer to revolutionize how their agent technology is deployed and served. While others follow conventional paths, they're creating new approaches to agent-specific serving challenges.


Short term, you'll focus on cloud deployment and performance optimization, enhancing their current infrastructure. Long term, you'll help design and build proprietary frameworks that challenge the status quo of model serving.


What You'll Do:

  • Architect and improve cloud-based deployment systems
  • Create efficient solutions for concurrent model serving
  • Lead the transition from Open Source and third-party tools to custom frameworks
  • Drive innovation in model compression and performance
  • Design new approaches to large-scale model deployment


You Should Have:

  • Advanced degree in Computer Science or related field
  • Strong background in Model Inference
  • Experience scaling AI models in production
  • Python expertise and interest in systems programming
  • Track record of solving complex deployment challenges


Bonus Points For:

  • LLM model serving such as vLLM or similar
  • Building custom serving solutions
  • Knowledge of GPU optimization
  • Experience with large language models
  • Background in high-performance computing


You'll join a world-class team pushing the boundaries of what's possible with AI agents. Work remotely (EU/US East Coast) or hybrid from their London office.


This role is perfect for someone who:

  • Enjoys tackling unprecedented technical challenges
  • Thinks creatively about infrastructure problems
  • Values practical solutions while innovating for the future
  • Thrives in fast-paced, research-driven environments


Compensation is highly competitive (£125,000-£200,000 basic salary), negotiable based on experience.


Ready to help define the future of AI infrastructure? Contact Marc at Techire AI to learn more. All applications will receive a response.

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Creativity
  • Teamwork
  • Innovation
  • Problem Solving

Related jobs