Match score not available

Research engineerScientist Post Training

Work set-up:

Full Remote

Contract:

Experience:

Mid-level (2-5 years)

Work from:

California (USA)

Offer summary

Qualifications:

Master's or PhD in Computer Science, AI, or related fields with focus on deep learning and computer vision., Industry experience in large-scale deep learning model training, especially with generative AI architectures., Proficiency in deep learning frameworks like PyTorch, JAX, or TensorFlow., Strong skills in translating product requirements into technical solutions and enhancing visual content quality..

Key responsibilities:

Optimize and fine-tune image and video generative models to improve quality and performance.
Implement reinforcement learning techniques to align models with human preferences.
Collaborate with research and product teams to identify requirements and execute fine-tuning initiatives.
Develop and evaluate advanced post-training capabilities for generative models.

Luma AI https://lumalabs.ai/dream-machine

11 - 50 Employees

Job description

About the Role
At Luma, the Posttraining team is responsible for unlocking creative control in the world’s largest and most powerful pretrained multimodal models. The team works closely with the Fundamental Research team and the Product teams across Luma to train our image and video generative models improving their capabilities in the final step refining them to be better aligned with what our users expect.
What You’ll Do
Optimize Lumas image and video generative models through targeted finetuning to improve visual quality, instruction adherence, and overall performance metrics.
Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards.
Partner closely with the Applied Research team to identify product requirements, understand diverse use cases across Lumas platforms, and execute targeted finetuning initiatives to address performance gaps and enhance userfacing capabilities.
Conduct comprehensive sidebyside evaluations comparing model performance against leading market competitors, systematically analyzing the impact of posttraining techniques on downstream performance metrics and identifying areas for improvement.
Develop advanced posttraining capabilities for Luma’s video models including Camera control, Object & character Reference, Image & Video Editing, Human Performance & Motion Transfer Approaches.
Architect data processing pipelines for largescale video and image datasets, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories.
Research and deploy cuttingedge diffusion sampling methodologies and hyperparameter optimization strategies to achieve superior performance on established visual quality benchmarks.
Research emerging posttraining methodologies in generative AI, evaluate their applicability to Lumas product ecosystem, and integrate promising techniques into our Posttraining recipe.
Qualifications
Advanced degree (Masters or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies. Demonstrated ability to do independent research in Academic or Industry settings.
Substantial industry experience in largescale deep learning model training, with demonstrated expertise in at least one of Large Language Models, VisionLanguage Models, Diffusion Models, or comparable generative AI architectures.
Comprehensive technical proficiency and practical experience with leading deep learning frameworks, including advanced competency in one of PyTorch, JAX, TensorFlow, or equivalent platforms for model development and optimization.
Strong orientation toward applied AI implementations with emphasis on translating product requirements into technical solutions, coupled with exceptional visual discrimination and dedicated focus on enhancing visual fidelity and aesthetic quality of generated content.
Proficiency in accelerated prototyping and demonstration development for emerging features, facilitating efficient iteration cycles and comprehensive stakeholder evaluation prior to production implementation.
Established track record of effective crossfunctional teamwork, including successful partnerships with teams spanning Product, Design, Evaluation, Applied, and creative specialists.

Required profile

Experience

Level of experience: Mid-level (2-5 years)

Spoken language(s):

English

Check out the description to know which languages are mandatory.

Hard Skills

Generative Artificial Intelligence System Optimization Computer Vision Deep Learning Large Language Modeling Diffusion Process JAX-WS TensorFlow Reinforcement Learning PyTorch (Machine Learning Library)Data Architecture Digital Prototyping

Other Skills

Teamwork
Communication
Problem Solving

Are you interested?

Share

Related jobs

Research Accountant

Research Accountant

Research Accountant

30+ days ago

Drexel University's Westphal College of Media Arts & Design

Full time

Financial AnalysisSecurities ResearchU.S. Regulatory ComplianceInvestment Account Management

Enterprise Technical Account Manager (MSP)

Enterprise Technical Account Manager (MSP)

Enterprise Technical Account Manager (MSP)

30+ days ago

Managed Solution

Full time

System AdministrationAccount ManagementCustomer Relationship Management

Advance BOARD Developer in Dallas, Texas

Advance BOARD Developer in Dallas, Texas

Advance BOARD Developer in Dallas, Texas

3 days ago

SRI Tech Solutions Inc.

Full time

Data IntegrationSmart BoardAgile Software Development

Administrador/a Senior Linux/Ansible/Kubernetes

Administrador/a Senior Linux/Ansible/Kubernetes

Administrador/a Senior Linux/Ansible/Kubernetes

30+ days ago

Inetum

Full time

AnsibleCloud ComputingKubernetesLinux

Implementation Project Manager

Implementation Project Manager

Implementation Project Manager

11 days ago

Entrata

Full time

Risk ManagementProject ManagementClient Server Models