Key Facts

Remote From:

Anywhere

Full time

Mid-level (2-5 years)

English

Hard Skills

Other Skills

•
Teamwork
•
Lateral Communication
•
Analytical Thinking

Job description

We’re looking for a Machine Learning Engineer to own and scale our multilingual data pipeline—from sourcing and curation to evaluation and continuous improvement. You’ll work closely with researchers and infra engineers to ensure our models perform robustly across languages, scripts, and cultural contexts.

This role sits at the intersection of data, research, and production ML and is ideal for someone who cares deeply about data quality, linguistic diversity, and model generalization beyond English.

What You’ll Do

Design, build, and maintain large-scale multilingual datasets across high- and low-resource languages
Develop data pipelines for collection, cleaning, normalization, deduplication, and labeling
Implement quality filters using statistical, heuristic, and model-based methods
Work with researchers to define language coverage, benchmarks, and evaluation metrics
Analyze dataset bias, coverage gaps, and failure modes across regions and scripts
Support training, fine-tuning, and distillation workflows with high-quality multilingual data
Continuously iterate on datasets based on model performance and real-world usage

What We’re Looking For

3+ years of experience as an ML Engineer, Applied Scientist, or similar role
Strong experience working with multilingual or non-English datasets
Solid understanding of NLP fundamentals (tokenization, embeddings, language modeling)
Experience building scalable data pipelines (Python, Spark, Ray, or similar)
Familiarity with Unicode, scripts, tokenization challenges, and language-specific quirks
Comfort collaborating with researchers and translating research needs into production systems

Nice to Have

Experience with low-resource languages or multilingual benchmarks (e.g. FLORES, XTREME)
Exposure to LLM training, fine-tuning, or distillation
Linguistics background or experience working with native language experts
Contributions to open-source datasets or ML tooling
Experience with data quality evaluation at scale

Why Join

Real ownership over a core differentiator of the product
Work on models used globally, not just in English-speaking markets
Small, high-caliber team with deep ML and systems experience
Competitive compensation + meaningful equity at Series A stage

Ready to apply?

APPLY

Share ·

Machine Learning Engineer Related jobs

Worldwide Machine Learning Engineer

Senior Machine Learning Engineer I

30+ days ago

Parexel

Full time

Natural Language Processing (NLP)Machine LearningDeep LearningPython (Programming Language)Data Structures

Senior Staff Engineer, Machine Learning

30+ days ago

Nagarro

Full time

Machine LearningPython (Programming Language)KubernetesProof Of Concept (POC) DevelopmentRoot Cause Analysis

Senior Data Engineer- AI/ML (Remote)

30+ days ago

Ad Hoc LLC

Fixed term

MLOps (Machine Learning Operations)PyTorch (Machine Learning Library)Python (Programming Language)EmbeddingMLflow

Staff Software Engineer, Machine Learning Infrastructure

30+ days ago

Clarifai

Full time

Lifecycle ManagementScalabilityOpen Source DevelopmentDev TestingPerformance Improvement

Machine Learning Engineer II

30+ days ago

Parexel

Full time

Natural Language Processing (NLP)Machine LearningPython (Programming Language)Deep LearningData Structures

Other jobs at Featherless AI

Senior Software Engineer - API Gateway

30+ days ago

Featherless AI

Full time

Node.js (Javascript Library)Application Programming Interface (API)KubernetesObservabilityApplication Programming Interface (API)

Developer Relations Associate/Intern (Partnerships) Boston-Based

30+ days ago

Featherless AI

Internships
120 - 120K

JavaScript (Programming Language)API TestingPython (Programming Language)EcologyCloud Computing

Developer Relations (DevRel)

30+ days ago

Featherless AI

Full time
Senior (5-10 years)
250 - 250K

Large Language ModelingCommunity DesignDevelopment SupportCustomer Success ManagementBusiness Analysis

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

✨

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.

Machine Learning Engineer — Multilingual Data

Key Facts

Hard Skills

Other Skills

Job description

What You’ll Do

What We’re Looking For

Nice to Have

Why Join

Machine Learning Engineer Related jobs

Senior Machine Learning Engineer I

Senior Staff Engineer, Machine Learning

Senior Data Engineer- AI/ML (Remote)

Staff Software Engineer, Machine Learning Infrastructure

Machine Learning Engineer II

Other jobs at Featherless AI

Senior Software Engineer - API Gateway

Developer Relations Associate/Intern (Partnerships) Boston-Based

Developer Relations (DevRel)

We help you get seen. Not ignored.

Auto-Apply

AI Match Feedback