Research Scientist Engineer – Data

Work set-up: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Strong programming skills in Python and PyTorch., Experience with large-scale datasets and multimodal data processing., Understanding of computer vision, audio processing, and natural language processing techniques., Preferred experience with interleaved multimodal data and multimodal models..

Key responsibilities:

  • Identify capability gaps and research solutions.
  • Design datasets and data ablation experiments to enhance model capabilities.
  • Develop evaluation frameworks and benchmarking methods for multimodal AI.
  • Create prototypes and demonstrations of new multimodal functionalities.

Luma AI logo
Luma AI https://lumalabs.ai/dream-machine
11 - 50 Employees
See all jobs

Job description

About the Role

Data is a fundamental layer in Luma that unlocks advanced capabilities in our foundation models. We tackle the fundamental data questions around how different modalities can be combined to enable new behaviors and capabilities, working on the openended challenges of what makes multimodal AI systems truly powerful and versatile.

Responsibilities
  • Identify capability gaps and research solutions

  • Design datasets and datamixture ablations to systematically improve model capabilities across vision, audio, and language

  • Develop evaluation frameworks and benchmarking approaches for multimodal AI capabilities

  • Create prototypes and demonstrations that showcase new multimodal capabilities

    • Experience
      • Strong programming skills in Python and PyTorch

      • Experience with largescale dataset

      • Experience with multimodal data processing pipeline

      • Understanding of computer vision, audio processing, and or natural language processing techniques

      • (Preferred) Expertise working with interleaved multimodal data

      • (Preferred) Handson experience with Vision Language Models, Audio Language Models, or generative video models

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Related jobs