Logo for RippedBoxStation

Healthcare Data Engineer (AA - 11112025 - PTHDE)

Roles & Responsibilities

  • Proven experience as a Data Engineer
  • Strong expertise in ETL/ELT processes
  • Hands-on experience with AWS or Google Cloud
  • Solid understanding of healthcare data standards
  • Proficiency in Python, PySpark, and SQL

Requirements:

  • Design and manage ETL/ELT data pipelines
  • Extract and consolidate data from various sources
  • Cleanse and standardize raw healthcare data
  • Utilize AWS or Google Cloud services
  • Implement data security and compliance protocols
  • Collaborate with data scientists and stakeholders

Job description

Position: Healthcare Data Engineer

Number of hours: 20 hours/week
Schedule: UK Time Zone - 9AM - 5PM


Key Responsibilities:

  • ETL Pipeline Development: Design, implement, and manage scalable and reliable ETL/ELT data pipelines to process diverse healthcare data from various sources.

  • Data Integration: Extract and consolidate data from disparate sources, including electronic health records (EHRs), real-world datasets, pharmacy sell-out data, and disease-specific surveys.

  • Data Transformation & Cleansing: Cleanse, validate, and standardize raw healthcare data. Map data to standard medical terminologies (e.g., ICD-10, SNOMED CT, LOINC), remove duplicates, and resolve inconsistencies to ensure high data quality.

  • Cloud Management: Utilize AWS or Google Cloud services to build, deploy, and monitor data processing solutions and storage infrastructure.

  • Compliance and Security: Implement and maintain strict data security, privacy, and governance protocols to ensure compliance with regulations such as HIPAA and GDPR, including encryption, access control, and audit trails.

  • Collaboration: Work closely with data scientists, analysts, and business stakeholders to understand data requirements and ensure the data architecture supports advanced analytics and business intelligence needs.

Qualifications:

  • Proven experience as a Data Engineer, preferably within healthcare or life sciences.

  • Strong expertise in designing and managing ETL/ELT processes and data pipelines.

  • Hands-on experience with at least one major cloud platform:
    • AWS: Proficiency with services such as AWS Glue, Amazon S3, AWS Lambda, Amazon Redshift, and AWS HealthLake.

    • Google Cloud: Proficiency with services like Cloud Healthcare API, BigQuery, Dataflow, and Dataproc.

  • Solid understanding of healthcare data standards (e.g. HL7, FHIR, DICOM) and data interoperability challenges.

  • Proficiency in programming languages such as Python and PySpark, and experience with SQL.

  • Knowledge of data security best practices and experience implementing measures to protect sensitive health information.

Data Engineer Related jobs

Other jobs at RippedBoxStation

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

✨

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.