Logo for Inworld AI

Senior / Lead Research Scientist - Germany

Roles & Responsibilities

  • Foundation models: training, new architectures, RL, reward modeling, scaling
  • Evaluation: benchmarks, eval loops, quality measurement, LLM-as-judge, and failure analysis
  • PhD in ML/NLP or equivalent practical experience with evidence of research impact (publications, open-source work, or notable projects)
  • Full-stack research ownership: ability to frame questions, run experiments, write papers, and ship results

Requirements:

  • Frame research questions, design and run experiments, analyze results, and ship outcomes (papers, models, or systems)
  • Own end-to-end evaluation as a first-class research product: build benchmarks, eval loops, metrics, and failure analysis
  • Collaborate with cross-functional teams, including US leadership and engineering, and communicate clearly in English
  • Produce visible impact by sharing work that moves the field forward and delivering results with a bias for impact over purely academic output

Job description

About Inworld

Inworld is a product-oriented research lab of top AI researchers and engineers, developing best-in-class realtime multimodal models and the only realtime orchestration platform optimized for thousands of queries per second.

We’ve raised more than $125M from Lightspeed, Section 32, Kleiner Perkins, Microsoft’s M12 venture fund, Founders Fund, Meta and Stanford, among others. Our technology has powered experiences from companies such as NVIDIA, Microsoft Xbox, Niantic, Logitech Streamlabs, Wishroll, Little Umbrella and Bible Chat. We’ve also been recognized by CB Insights as one of the 100 most promising AI companies globally and have been named one of LinkedIn's Top 10 Startups in the USA.

Who We're Looking For

A year ago, reliably working agentic systems barely existed. Nobody has a decade of experience here. So we're not screening for a resume template — we're looking for strong people from varied backgrounds who learn fast, thrive in ambiguity, and can show us what they've built, broken, and understood.

Experience We Find Useful

You don't need all of this. But you need enough to make a case.

  • Foundation models: training, new architectures, RL, reward modeling, scaling

  • Evaluation: benchmarks, eval loops, quality measurement, LLM-as-judge, failure analysis

  • Frontier topics: multimodal models, agents, tool use, test-time compute, world models

  • Published research at ICML, ICLR, NeurIPS, EMNLP, ACL, or AAAI

  • PhD in ML/NLP — or equivalent practical experience you can point to

  • Public work: non-trivial AI side projects, interdisciplinary experiments, open-source contributions

  • Full-stack research ownership: you frame the question, run the experiments, write the paper, ship the result

If you learned through building, competitions, or collaborations outside academia — that counts. We care about evidence, not credentials.

Who Thrives Here

  • Pathfinders: You don’t need a roadmap to start walking; you’re comfortable picking a direction and building the map as you go.

  • Full-Cycle Researchers: You believe research isn't finished until it’s shipped. You have a bias for impact over purely academic output.

  • First-Principles Engineers: You don't just ship code; you obsess over the why. You’re the first to question an approach if you think there’s a better way to solve the core problem.

  • Mission Owners: You aren't satisfied with "the PM said so." You thrive on deep context and want to understand the fundamental logic behind every decision we make.

What Working Here Is Like

We hand you unclear problems and expect you to make them clear. We value researchers who say "I don't know yet" — and then design the experiment that finds out. We treat evaluation as a first-class research product, not a box to check before launch. Impact comes before publications though we support sharing work that moves the field forward. Your work should be visible. Flat structure, fast iterations, minimal process theater.

We don't need a cover letter. A link to something you've built tells us more.


Professional fluency in English (written and spoken) is required, as you will be collaborating daily with our US-based leadership and engineering teams.

Related jobs

Other jobs at Inworld AI

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.