Key Facts

Remote From:

New York (USA)

Full time

Senior (5-10 years)

English

Hard Skills

Other Skills

•
Communication
•
Leadership
•
Time Management
•
Teamwork
•
Problem Solving

Roles & Responsibilities

Master's degree in Data Science, Computer Science, Information Technology, or a closely related field with 3 years of experience in data engineering or a related role; or Bachelor's degree in the above fields with 5 years of experience in the position offered or a related role.
At least 2 years of experience in data engineering, including designing, implementing, and optimizing databases and data pipelines; experience with SQL Server, Oracle, or other RDBMS.
Proficiency in SQL and Python for advanced data manipulation and analytics; experience in data modeling and architecture for analytics and transactional systems within large-scale environments.
Experience with at least one major cloud data platform (Azure, AWS, or Google Cloud) with practical application in data engineering projects; Spark and cloud ETL tools like Databricks focusing on scalability and real-time processing; developing data models for BI and analytics initiatives.

Requirements:

Design, implement, and optimize robust and scalable data pipelines using SQL, Python, and cloud-based ETL tools such as Databricks; ensure efficient data flow and processing to support large-scale data handling; develop and refine data models and integrate with the broader data architecture, including Big Data frameworks like Spark.
Lead and contribute to the data architecture strategy, guiding decisions on data storage, consumption, integration, and management across cloud environments (Azure, AWS, Google Cloud).
Lead Agile/SCRUM processes, participate in sprints and stand-ups, and collaborate with data scientists, BI teams, and other engineering groups to translate data requirements into actionable engineering solutions; mentor junior data engineers.
Maintain data quality and governance standards, monitor performance, identify bottlenecks, document data processes and architectures, and stay current with emerging data engineering and AI technologies; generate comprehensive documentation.

New York Blood Center

About New York Blood Center

Founded in 1964, New York Blood Center (NYBC) is a nonprofit organization that is one of the largest independent, community-based blood centers in the world. NYBC, along with its partner organizations Community Blood Center of Kansas City, Missouri (CBC), Innovative Blood Resources (IBR), Blood Bank of Delmarva (BBD), and Rhode Island Blood Center (RIBC), collect approximately 4,000 units of blood products each day and serve local communities of more than 45 million people in the Tri-State area (NY, NJ, CT), Mid Atlantic area (PA, DE, MD), the Kansas City metropolitan area, Minnesota, Nebraska, Rhode Island, and Southern New England. NYBC and its partners also provide a wide array of transfusion-related medical services, including Comprehensive Cell Solutions, the National Center for Blood Group Genomics, the National Cord Blood Program, and the Lindsley F. Kimball Research Institute, which — among other milestones — developed the Hepatitis B vaccine and a patented solvent detergent plasma process innovating blood-purification technology worldwide.

Company type: Large

Founded: 2018

Company size: 1001 - 5000

Website LinkedIn See all jobs →

Job description

Responsibilities:

Design, implement, and optimize robust and scalable data pipelines using SQL, Python, and cloud-based ETL tools such as Databricks. Ensure efficient data flow and processing to support large-scale data handling. Develop and refine data models to accurately represent business processes, ensuring they're scalable and fully integrate with our extensive data architecture, including Big Data frameworks like Spark. Enhance our overarching data architecture strategy, assisting in decisions related to data storage, consumption, integration, and management within cloud environments (Azure, AWS, or Google Cloud). Lead and contribute within Agile/SCRUM frameworks to ensure timely and efficient project deliveries. Actively participate in sprints and stand-ups, applying these methodologies to streamline development. Partner with data scientists, BI teams, and other engineering teams to understand and translate complex data requirements into actionable engineering solutions. Guide and mentor junior data engineers, promoting best practices in SQL, Python, and cloud technologies, and fostering a culture of continuous learning and improvement. Uphold and champion data quality standards and governance policies, ensuring reliability and compliance in all data-related tasks. Monitor and enhance the performance of data infrastructure, proactively identifying and resolving bottlenecks or inefficiencies in cloud and Big Data environments. Stay abreast of emerging data engineering and AI technologies and methodologies, recommending and implementing innovative tools or practices as appropriate. Generate comprehensive documentation for data processes, pipelines, and architectures to ensure clarity and ease of maintenance for the team, including detailed descriptions of cloud and Big Data implementations. Telecommuting permitted.

Qualifications:

Master's degree in Data Science, Computer Science, Information Technology, or closely related field and 3 years of experience in the position offered or a related role; or a Bachelor’s degree in the above mentioned fields and 5 years of experience in the position offered or a related role. Requires 2 years of experience in all of the following: data engineering; designing, implementing, and optimizing databases and data pipelines; SQL Server, Oracle, or other relational database management systems (RDBMS); SQL and Python for advanced data manipulation and analytics; data modeling and architecture for both analytics and transactional systems within large-scale environments; at least one major cloud data platform (Azure, AWS, Google Cloud) with practical application in data engineering projects; Spark and Cloud ETL tools like Databricks, focusing on scalability and real-time processing capabilities; and developing data models for integration and analysis that support business intelligence and data analytics initiatives.

SALARY: $125,000.00/yr. - $150,000.00/yr.

LOCATION: 310 E. 67th Street, New York, NY 10065

601 Midland Ave, Rye, NY 10580

Overview:

Founded in 1964, New York Blood Center (NYBC) has served the tri-state area for more than 60 years, delivering 500,000 lifesaving blood products annually to 150+ hospitals, EMS and healthcare partners. NYBC is part of New York Blood Center Enterprises (NYBCe), which spans 17+ states and delivers one million blood products to 400+ U.S. hospitals annually. NYBCe additionally delivers cellular therapies, specialty pharmacy, and medical services to 200+ research, academic and biopharmaceutical organizations. NYBCe’s Lindsley F. Kimball Research Institute is a leader in hematology and transfusion medicine research, dedicated to the study, prevention, treatment and cure of bloodborne and blood-related diseases. NYBC serves as a vital community lifeline dedicated to helping patients and advancing global public health. To learn more, visit nybc.org. Connect with us on Facebook, X, Instagram, and LinkedIn.