Match score not available

Data Integration Specialist

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Bachelor's degree in computer science, data science, information systems, or related field., 1-3 years of experience in data transformation or ETL processes., Proficiency in Python and experience with data parsing libraries., Strong problem-solving abilities and attention to detail..

Key responsabilities:

  • Lead projects to design and implement data extraction processes from various sources.
  • Develop and maintain parsers for diverse data sources including APIs and databases.
  • Identify and extract valuable features from complex raw data sets.
  • Collaborate with cross-functional teams to integrate data pipelines into broader systems.

FirstPrinciples logo
FirstPrinciples Non-profit Organization - Charity TPE http://www.firstprinciples.com/
2 - 10 Employees
See all jobs

Job description

About FirstPrinciples:
FirstPrinciples is a non-profit foundation dedicated to advancing our understanding of the universe’s fundamental principles through technological innovation, data-driven strategies and powerful communication. We are building an AI-powered research ecosystem to revolutionize how scientific knowledge is discovered, analyzed, and applied. At the core of this effort is FirstPrinciples AI, an intelligence engine designed to help researchers analyze vast scientific literature, identify meaningful connections, and generate new insights across disciplines. This next-generation research platform will bridge the gap between AI and scientific inquiry, equipping scientists, institutions, and policymakers with the tools to accelerate breakthroughs and make informed, data-driven decisions that shape the future of discovery.

Job Description:
FirstPrinciples is seeking a skilled and detail-oriented Data Integration Specialist to play a crucial role in our data pipeline development. In this position, you will lead projects to design and implement data extraction processes from various structured and unstructured sources, create robust parsing mechanisms, and develop sophisticated logic to extract meaningful features from raw data. Working in an agile environment, you'll iteratively refine extraction methods based on on-going feedback.

Key Responsibilities:

Project Leadership:

  • Investigate and evaluate new data sources.
  • Create comprehensive extraction plans and strategies for each data source.
  • Lead the full lifecycle of data extraction projects from planning to implementation.
  • Work closely with peers and managers  to iterate quickly and refine various approaches. 
  • Progressively scale extraction processes from small test batches to full implementation.

Data Source Integration:

  • Develop and maintain parsers for diverse data sources including APIs, databases, web content, PDFs, and scientific literature.
  • Create reliable ETL processes to ensure data quality and consistency, including LLM-based extraction pipelines.
  • Design and refine prompts for LLMs to extract structured information from unstructured data sources, including text, images, and other multimodal inputs.
  • Implement error handling and logging systems to maintain data pipeline reliability.

Feature Engineering:

  • Identify and extract valuable features from complex raw data sets.
  • Develop logic and algorithms to transform unstructured information into structured, analyzable formats.
  • Create reproducible processes for data normalization and standardization.

Pipeline Architecture:

  • Design scalable data transformation workflows.
  • Optimize parsing procedures for performance and accuracy.
  • Document data lineage and transformation processes for transparency.

Collaboration:

  • Work closely with cross-functional  teams to understand feature requirements.
  • Coordinate with engineering team to integrate data pipelines into broader systems.
  • Communicate technical concepts clearly to non-technical stakeholders.
  • Engage directly with third party data vendors to obtain technical specifications and integration details.
  • Demonstrate ability to work effectively both as part of a collaborative team and independently on self-directed tasks.

Qualifications:

  • Educational Background: Bachelor's degree in computer science, data science, information systems, or related field.
  • Experience: 1-3 years of experience working with data transformation, ETL processes, or similar roles.
  • Project Management Skills:
    • Experience managing small to medium-sized data projects from conception to completion.
    • Demonstrated ability to create technical plans and roadmaps for data extraction.
    • Experience working in agile environments with iterative development cycles.
  • Technical Skills:
    • Proficiency in Python and/or similar languages for data processing.
    • Experience with data parsing libraries and frameworks.
    • Knowledge of data storage systems and formats (SQL, JSON, etc.)
    • Familiarity with regular expressions and text processing techniques.
    • Experience with prompt engineering for LLMs and AI-assisted data extraction.
  • Analytical Skills: Strong problem-solving abilities and attention to detail.
  • Communication: Ability to document processes clearly and communicate technical concepts.
  • Bonus Skills:
    • Experience with natural language processing.
    • Knowledge of scientific literature and research data structures.
    • Familiarity with cloud-based data processing.

Application Process:

  • Interested candidates are invited to submit their resume, a cover letter detailing their qualifications and vision for the role, and references. Please include "Data Integration Specialist" in the cover letter.

Join us at FirstPrinciples and be a part of a transformative journey where science drives progress and unlocks the potential of humanity.

 

Required profile

Experience

Industry :
Non-profit Organization - Charity
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Communication
  • Problem Solving

Integration Specialist Related jobs