Data Engineer

Work set-up: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

Proven experience in data engineering and building data systems., Strong programming skills in Java or Python., Experience with data warehouses, data lakes, and data orchestration tools., Educational background in a relevant technical field is preferred..

Key responsibilities:

  • Design and implement scalable data orchestration and lineage processes.
  • Optimize data storage and retrieval to improve efficiency and reduce costs.
  • Collaborate with the engineering team to build various parts of the Protege platform.
  • Ensure data validation, transformation, and compliance standards are met.

Protege logo
Protege http://www.withprotege.ai
2 - 10 Employees
See all jobs

Job description

Company Overview:

We are building Protege to solve the biggest unmet need in AI β€” getting access to the right training data. The process today is time intensive, incredibly expensive, and often ends in failure. The Protege platform facilitates the secure, efficient, and privacy-centric exchange of AI training data.

Solving AI's data problem is a generational opportunity. The company that succeeds will be one of the largest in AI β€” and in tech.

We're hiring for multiple data engineers ranging from senior to staff level to join the team.

Key Responsibilities and Scope

  • Work with the engineering team to design and implement scalable, automatable and robust data orchestration and lineage processes, incorporating high standards of data validation, transformation, and compliance.

  • Optimize data storage and retrieval costs, making data operations more efficient and cost-effective.

  • Work as a generalist engineer to help build other areas of the Protege platform.

About You

  • You are curious, tenacious, and proactive.

  • You are not bothered by ambiguity but embrace finding patterns in complex environments.

  • Proven experience in data engineering, with a solid track record of building data systems and processes.

  • Strong technical proficiency with data engineering tools, data warehouses and data lakes.

  • Strong programming skills in Java or Python.

  • Excellent problem-solving skills and adaptability in a dynamic and evolving tech landscape.

  • Excited to work in a company that deals with moving and transforming large volumes of data.

Bonus if you have these attributes

  • Experience with either Snowflake or Databricks

  • Experience with AWS

  • Prior startup experience

  • Familiarity with fine-tuning AI models

Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Adaptability
  • Problem Solving

Data Engineer Related jobs