Requirements
• 3+ years in data engineering roles in a production environment
• Advanced proficiency in Python and SQL for data engineering
• Up-to-date knowledge of and 1+ years of experience using Databricks for Lakehouse management
• Deep understanding of data modeling, data architecture, and data integration best practices
• Strong hands-on experience with Apache Spark
• Familiarity with data governance, security, and privacy principles
• Comfort using git or equivalent to manage the software development life cycle
• Exceptional ability to learn and use new software development techniques and tools
• Ability to manage multiple projects simultaneously
• High energy, humble team player with “get it done” attitude, seeking collaboration with colleagues
Preferred Qualifications
• Experience with the Azure cloud ecosystem
• Experience developing production-ready, real-time machine learning model serving pipelines
• Comfort developing in the Apache Spark Structured Streaming paradigm
• Experience working in a private equity-backed services company
• Experience deploying machine learning models with MLFlow or equivalent
• Experience developing CI/CD pipelines
Marsh McLennan Agency
Nasstar
KMC Solutions
Prosegur
Atlassian