Logo for New York Blood Center

Senior Data Engineer

Roles & Responsibilities

  • Master's degree in Data Science, Computer Science, Information Technology, or a closely related field with 3 years of experience in data engineering or a related role; or Bachelor's degree in the above fields with 5 years of experience in the position offered or a related role.
  • At least 2 years of experience in data engineering, including designing, implementing, and optimizing databases and data pipelines; experience with SQL Server, Oracle, or other RDBMS.
  • Proficiency in SQL and Python for advanced data manipulation and analytics; experience in data modeling and architecture for analytics and transactional systems within large-scale environments.
  • Experience with at least one major cloud data platform (Azure, AWS, or Google Cloud) with practical application in data engineering projects; Spark and cloud ETL tools like Databricks focusing on scalability and real-time processing; developing data models for BI and analytics initiatives.

Requirements:

  • Design, implement, and optimize robust and scalable data pipelines using SQL, Python, and cloud-based ETL tools such as Databricks; ensure efficient data flow and processing to support large-scale data handling; develop and refine data models and integrate with the broader data architecture, including Big Data frameworks like Spark.
  • Lead and contribute to the data architecture strategy, guiding decisions on data storage, consumption, integration, and management across cloud environments (Azure, AWS, Google Cloud).
  • Lead Agile/SCRUM processes, participate in sprints and stand-ups, and collaborate with data scientists, BI teams, and other engineering groups to translate data requirements into actionable engineering solutions; mentor junior data engineers.
  • Maintain data quality and governance standards, monitor performance, identify bottlenecks, document data processes and architectures, and stay current with emerging data engineering and AI technologies; generate comprehensive documentation.

Job description

Responsibilities:

Design, implement, and optimize robust and scalable data pipelines using SQL, Python, and cloud-based ETL tools such as Databricks. Ensure efficient data flow and processing to support large-scale data handling. Develop and refine data models to accurately represent business processes, ensuring they're scalable and fully integrate with our extensive data architecture, including Big Data frameworks like Spark. Enhance our overarching data architecture strategy, assisting in decisions related to data storage, consumption, integration, and management within cloud environments (Azure, AWS, or Google Cloud). Lead and contribute within Agile/SCRUM frameworks to ensure timely and efficient project deliveries. Actively participate in sprints and stand-ups, applying these methodologies to streamline development. Partner with data scientists, BI teams, and other engineering teams to understand and translate complex data requirements into actionable engineering solutions. Guide and mentor junior data engineers, promoting best practices in SQL, Python, and cloud technologies, and fostering a culture of continuous learning and improvement. Uphold and champion data quality standards and governance policies, ensuring reliability and compliance in all data-related tasks. Monitor and enhance the performance of data infrastructure, proactively identifying and resolving bottlenecks or inefficiencies in cloud and Big Data environments. Stay abreast of emerging data engineering and AI technologies and methodologies, recommending and implementing innovative tools or practices as appropriate. Generate comprehensive documentation for data processes, pipelines, and architectures to ensure clarity and ease of maintenance for the team, including detailed descriptions of cloud and Big Data implementations. Telecommuting permitted.

Qualifications:

Master's degree in Data Science, Computer Science, Information Technology, or closely related field and 3 years of experience in the position offered or a related role; or a Bachelor’s degree in the above mentioned fields and 5 years of experience in the position offered or a related role. Requires 2 years of experience in all of the following: data engineering; designing, implementing, and optimizing databases and data pipelines; SQL Server, Oracle, or other relational database management systems (RDBMS); SQL and Python for advanced data manipulation and analytics; data modeling and architecture for both analytics and transactional systems within large-scale environments; at least one major cloud data platform (Azure, AWS, Google Cloud) with practical application in data engineering projects; Spark and Cloud ETL tools like Databricks, focusing on scalability and real-time processing capabilities; and developing data models for integration and analysis that support business intelligence and data analytics initiatives.

 

SALARY: $125,000.00/yr. - $150,000.00/yr.

 

LOCATION: 310 E. 67th Street, New York, NY 10065

                   601 Midland Ave, Rye, NY 10580

Overview:

Founded in 1964, New York Blood Center (NYBC) has served the tri-state area for more than 60 years, delivering 500,000 lifesaving blood products annually to 150+ hospitals, EMS and healthcare partners. NYBC is part of New York Blood Center Enterprises (NYBCe), which spans 17+ states and delivers one million blood products to 400+ U.S. hospitals annually. NYBCe additionally delivers cellular therapies, specialty pharmacy, and medical services to 200+ research, academic and biopharmaceutical organizations. NYBCe’s Lindsley F. Kimball Research Institute is a leader in hematology and transfusion medicine research, dedicated to the study, prevention, treatment and cure of bloodborne and blood-related diseases. NYBC serves as a vital community lifeline dedicated to helping patients and advancing global public health. To learn more, visit nybc.org. Connect with us on Facebook, X, Instagram, and LinkedIn.

Data Engineer Related jobs

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.