Research engineerScientist Post Training

Work set-up: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

Master's or PhD in Computer Science, AI, or related fields with focus on deep learning and computer vision., Industry experience in large-scale deep learning model training, especially with generative AI architectures., Proficiency in deep learning frameworks like PyTorch, JAX, or TensorFlow., Strong skills in translating product requirements into technical solutions and enhancing visual content quality..

Key responsibilities:

  • Optimize and fine-tune image and video generative models to improve quality and performance.
  • Implement reinforcement learning techniques to align models with human preferences.
  • Collaborate with research and product teams to identify requirements and execute fine-tuning initiatives.
  • Develop and evaluate advanced post-training capabilities for generative models.

Luma AI logo
Luma AI https://lumalabs.ai/dream-machine
11 - 50 Employees
See all jobs

Job description

About the Role

At Luma, the Posttraining team is responsible for unlocking creative control in the world’s largest and most powerful pretrained multimodal models. The team works closely with the Fundamental Research team and the Product teams across Luma to train our image and video generative models improving their capabilities in the final step refining them to be better aligned with what our users expect.

What You’ll Do
  • Optimize Lumas image and video generative models through targeted finetuning to improve visual quality, instruction adherence, and overall performance metrics.

  • Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards.

  • Partner closely with the Applied Research team to identify product requirements, understand diverse use cases across Lumas platforms, and execute targeted finetuning initiatives to address performance gaps and enhance userfacing capabilities.

  • Conduct comprehensive sidebyside evaluations comparing model performance against leading market competitors, systematically analyzing the impact of posttraining techniques on downstream performance metrics and identifying areas for improvement.

  • Develop advanced posttraining capabilities for Luma’s video models including Camera control, Object & character Reference, Image & Video Editing, Human Performance & Motion Transfer Approaches.

  • Architect data processing pipelines for largescale video and image datasets, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories.

  • Research and deploy cuttingedge diffusion sampling methodologies and hyperparameter optimization strategies to achieve superior performance on established visual quality benchmarks.

  • Research emerging posttraining methodologies in generative AI, evaluate their applicability to Lumas product ecosystem, and integrate promising techniques into our Posttraining recipe.

    • Qualifications
      • Advanced degree (Masters or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies. Demonstrated ability to do independent research in Academic or Industry settings.

          • Substantial industry experience in largescale deep learning model training, with demonstrated expertise in at least one of Large Language Models, VisionLanguage Models, Diffusion Models, or comparable generative AI architectures.

          • Comprehensive technical proficiency and practical experience with leading deep learning frameworks, including advanced competency in one of PyTorch, JAX, TensorFlow, or equivalent platforms for model development and optimization.

          • Strong orientation toward applied AI implementations with emphasis on translating product requirements into technical solutions, coupled with exceptional visual discrimination and dedicated focus on enhancing visual fidelity and aesthetic quality of generated content.

          • Proficiency in accelerated prototyping and demonstration development for emerging features, facilitating efficient iteration cycles and comprehensive stakeholder evaluation prior to production implementation.

          • Established track record of effective crossfunctional teamwork, including successful partnerships with teams spanning Product, Design, Evaluation, Applied, and creative specialists.

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Teamwork
  • Communication
  • Problem Solving

Related jobs