Key Facts

Remote From:

Anywhere

Full time

English

Hard Skills

Other Skills

•
Communication
•
Teamwork

Roles & Responsibilities

Strong background in machine learning or deep learning
Hands-on experience with model distillation (LLMs or other neural networks)
Solid understanding of training dynamics, loss functions, and optimization
Experience with PyTorch (or JAX) and modern ML tooling

Requirements:

Design and implement knowledge distillation pipelines (teacher–student, self-distillation, multi-teacher, etc.)
Distill large foundation models into smaller, faster, and cheaper models for inference
Run and analyze large-scale training experiments to evaluate quality, latency, and cost tradeoffs
Collaborate with research to translate new distillation ideas into production-ready code

Job description

About the Role

We’re looking for a Machine Learning Engineer focused on model distillation to help us build smaller, faster, and more efficient models without sacrificing quality. You’ll work at the intersection of research and production—taking cutting-edge techniques and turning them into systems that scale.

This is a hands-on role with real ownership: you’ll design distillation pipelines, run large-scale experiments, and ship models used in production.

What You’ll Do

Design and implement knowledge distillation pipelines (teacher–student, self-distillation, multi-teacher, etc.)
Distill large foundation models into smaller, faster, and cheaper models for inference
Run and analyze large-scale training experiments to evaluate quality, latency, and cost tradeoffs
Collaborate with research to translate new distillation ideas into production-ready code
Optimize training and inference performance (memory, throughput, latency)
Contribute to internal tooling, evaluation frameworks, and experiment tracking
(Optional) Contribute back to open-source models, tooling, or research

What We’re Looking For

Strong background in machine learning or deep learning
Hands-on experience with model distillation (LLMs or other neural networks)
Solid understanding of training dynamics, loss functions, and optimization
Experience with PyTorch (or JAX) and modern ML tooling
Comfort running experiments on multi-GPU or distributed setups
Ability to reason about model quality vs. performance tradeoffs
Pragmatic mindset: you care about shipping, not just papers

Nice to Have

Experience distilling LLMs or large sequence models
Experience with inference optimization (quantization, pruning, kernels, etc.)
Familiarity with evaluation for language models
Open-source contributions or research publications
Experience in early-stage or fast-moving startups

Why Join

Work on core model quality and cost efficiency—not side projects
High ownership and direct impact on product and roadmap
Small, senior team with strong research + engineering culture
Competitive compensation + meaningful equity
Remote-friendly, async-first environment

Ready to apply?

APPLY

Share ·

Machine Learning Engineer Related jobs

Worldwide Machine Learning Engineer

Senior Machine Learning Engineer I

30+ days ago

Parexel

Full time

Natural Language Processing (NLP)Machine LearningDeep LearningPython (Programming Language)Data Structures

Senior Staff Engineer, Machine Learning

30+ days ago

Nagarro

Full time

Machine LearningPython (Programming Language)KubernetesProof Of Concept (POC) DevelopmentRoot Cause Analysis

Senior Data Engineer- AI/ML (Remote)

30+ days ago

Ad Hoc LLC

Fixed term

MLOps (Machine Learning Operations)PyTorch (Machine Learning Library)Python (Programming Language)EmbeddingMLflow

Staff Software Engineer, Machine Learning Infrastructure

30+ days ago

Clarifai

Full time

Lifecycle ManagementScalabilityOpen Source DevelopmentDev TestingPerformance Improvement

Machine Learning Engineer II

30+ days ago

Parexel

Full time

Natural Language Processing (NLP)Machine LearningPython (Programming Language)Deep LearningData Structures

Other jobs at Featherless AI

Senior Software Engineer - API Gateway

30+ days ago

Featherless AI

Full time

Node.js (Javascript Library)Application Programming Interface (API)KubernetesObservabilityApplication Programming Interface (API)

Developer Relations Associate/Intern (Partnerships) Boston-Based

30+ days ago

Featherless AI

Internships
120 - 120K

JavaScript (Programming Language)API TestingPython (Programming Language)EcologyCloud Computing

Developer Relations (DevRel)

30+ days ago

Featherless AI

Full time
Senior (5-10 years)
250 - 250K

Large Language ModelingCommunity DesignDevelopment SupportCustomer Success ManagementBusiness Analysis

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

✨

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.

Machine Learning Engineer — Distillation

Key Facts

Hard Skills

Other Skills

Roles & Responsibilities

Requirements:

Job description

About the Role

What You’ll Do

What We’re Looking For

Nice to Have

Why Join

Machine Learning Engineer Related jobs

Senior Machine Learning Engineer I

Senior Staff Engineer, Machine Learning

Senior Data Engineer- AI/ML (Remote)

Staff Software Engineer, Machine Learning Infrastructure

Machine Learning Engineer II

Other jobs at Featherless AI

Senior Software Engineer - API Gateway

Developer Relations Associate/Intern (Partnerships) Boston-Based

Developer Relations (DevRel)

We help you get seen. Not ignored.

Auto-Apply

AI Match Feedback