Data Scientist - Language Model Fine-Tuning Specialist 

extra parental leave
Work set-up: 
Full Remote
Contract: 
Work from: 

Orion Innovation logo
Orion Innovation XLarge https://www.orioninc.com/
5001 - 10000 Employees
See all jobs

Job description

Orion Innovation is a premier, award-winning, global business and technology services firm.  Orion delivers game-changing business transformation and product development rooted in digital strategy, experience design, and engineering, with a unique combination of agility, scale, and maturity.  We work with a wide range of clients across many industries including financial services, professional services, telecommunications and media, consumer products, automotive, industrial automation, professional sports and entertainment, life sciences, ecommerce, and education.

Data Scientist - Language Model Fine-Tuning Specialist 

Job Description: 

We are seeking a highly skilled Data Scientist with expertise in fine-tuning language models using proprietary company data. The ideal candidate will have a strong background in data preparation, model fine-tuning, and benchmarking, as well as staying updated on the latest advancements in the field. 

Key Responsibilities: 

  • Develop and prepare datasets for language model fine-tuning using proprietary data.
  • Fine-tune language models using techniques such as LoRA, QLoRA, and GRPO.
  • Benchmark model performance and analyze results to ensure optimal outcomes. 
  • Stay current with the latest research and innovations in language modeling, including relevant arXiv papers. 
  • Collaborate with cross-functional teams to integrate fine-tuned models into products and services. 

Requirements: 

  • Proven experience in fine-tuning language models with proprietary datasets. 
  • Proficiency in advanced fine-tuning techniques and methodologies.
  • Strong analytical skills for benchmarking model performance. 
  • Up-to-date knowledge of recent research papers and developments in the field. 
  • Excellent communication skills and ability to collaborate with technical and non-technical stakeholders. 

Required Tools and Technologies: 

  • Proficiency in Python programming.
  • Experience with machine learning frameworks such as TensorFlow or PyTorch. 
  • Familiarity with Hugging Face Transformers for language model manipulation. 
  • Knowledge of data processing and manipulation libraries, such as Pandas and NumPy. 
  • Experience with version control systems like Git. 

Preferred Qualifications: 

  • Experience with Relevance-Augmented Generation (RAG) and GraphRAG frameworks.
  • Experience with reasoning models and algorithm development. 
  • Familiarity with additional machine learning tools and libraries. 
  • Advanced degree in Computer Science, Data Science, or related field.

 

Orion is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, citizenship status, disability status, genetic information, protected veteran status, or any other characteristic protected by law.

Candidate Privacy Policy

Orion Systems Integrators, LLC and its subsidiaries and its affiliates (collectively, “Orion,” “we” or “us”) are committed to protecting your privacy. This Candidate Privacy Policy (orioninc.com) (“Notice”) explains:

  • What information we collect during our application and recruitment process and why we collect it;
  • How we handle that information; and
  • How to access and update that information.

Your use of Orion services is governed by any applicable terms in this notice and our general Privacy Policy.

 

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Communication
  • Analytical Skills

Data Scientist Related jobs