Logo for Xometry

Senior Data Scientist, LLM

Key Facts

Remote From: 
Category:  Data Scientist
Full time
Senior (5-10 years)
English

Other Skills

  • Analytical Skills
  • Problem Solving
  • Teamwork
  • Innovation

Roles & Responsibilities

  • A bachelor’s degree is required; an advanced degree (M.S. or PhD) in computer science, data science, machine learning, or a related field is highly preferred.
  • 5+ years of experience in data science and machine learning, with expertise in Visual Language Models or multimodal machine learning.
  • Strong experience with machine learning libraries and frameworks such as PyTorch, TensorFlow, or Hugging Face.
  • Proficiency in Python, including libraries like pandas, numpy, and scikit-learn.

Requirements:

  • Develop, fine-tune, and evaluate Visual Language Models (VLMs) to enhance document understanding, focusing on multimodal data such as text, images, and technical drawings.
  • Design and implement data preparation, cleaning, and augmentation processes tailored to multimodal model training, ensuring high-quality data pipelines for VLMs.
  • Leverage transfer learning and pre-trained models to accelerate model development and optimize performance on Xometry’s specific data.
  • Use cloud resources (e.g., Amazon Web Services) to scale training and fine-tuning processes for VLMs efficiently.

Job description

Xometry (NASDAQ: XMTR) powers the industries of today and tomorrow by connecting the people with big ideas to the manufacturers who can bring them to life. Xometry’s digital marketplace gives manufacturers the critical resources they need to grow their business while also making it easy for buyers at Fortune 1000 companies to tap into global manufacturing capacity.

Xometry is seeking a Senior Data Scientist to join our Generative AI team. The candidate will focus on training and fine-tuning Visual Language Models (VLMs) for multimodal document understanding. The ideal candidate will leverage their expertise in machine learning and computer vision to advance Xometry's capabilities in processing and extracting structured data from complex documents and images. This is a 1-year contract.  

Responsibilities:

  • Develop, fine-tune, and evaluate Visual Language Models (VLMs) to enhance document understanding, focusing on multimodal data such as text, images, and technical drawings.
  • Design and implement data preparation, cleaning, and augmentation processes tailored to multimodal model training, ensuring high-quality data pipelines for VLMs.
  • Leverage transfer learning and pre-trained models to accelerate model development and optimize performance on Xometry’s specific data.
  • Use cloud resources (e.g., Amazon Web Services) to scale training and fine-tuning processes for VLMs efficiently.
  • Collaborate with data engineering and machine learning operations (MLOps) teams to deploy VLMs into production and monitor their performance.
  • Interpret model outputs and improve model accuracy and robustness by applying data analysis and visualization tools (such as Python, Jupyter Notebooks, and SQL).
  • Experiment with and implement state-of-the-art model architectures, continuously optimizing VLM performance in a fast-paced, iterative environment.
  • Work within a team-oriented setting, participating in peer reviews, sharing insights, and contributing to an environment of continuous learning and improvement.

Qualifications:

  • A bachelor’s degree is required; an advanced degree (M.S. or PhD) in computer science, data science, machine learning, or a related field is highly preferred.
  • 5+ years of experience in data science and machine learning, with expertise in Visual Language Models or multimodal machine learning.
  • Strong experience with machine learning libraries and frameworks such as PyTorch, TensorFlow, or Hugging Face.
  • Proficiency in Python, including libraries like pandas, numpy, and scikit-learn.
  • Solid understanding of deep learning techniques and experience with transfer learning, fine-tuning, and model evaluation.
  • Experience with cloud platforms (e.g., AWS SageMaker) for model training and deployment.
  • Familiarity with data processing and visualization tools (SQL, Jupyter Notebooks, Looker, etc.) and basic database knowledge (e.g., Snowflake, MongoDB).
  • Excellent analytical and problem-solving skills, with a strong ability to work in an environment that values teamwork, innovation, and continuous learning.
  • Familiarity with computer vision tasks and frameworks, as well as experience with multimodal data, is a plus.

#LI-Remote

Xometry is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran, or disability status.

For US based roles: Xometry participates in E-Verify and after a job offer is accepted, will provide the federal government with your Form I-9 information to confirm that you are authorized to work in the U.S.

Data Scientist Related jobs

Other jobs at Xometry

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.