Codeworks is an IT Services firm headquartered in SE Wisconsin, known for our strong commitment to quality and for our direct client relationships.
Who We’re Looking For: A Data Infrastructure Engineer to partner with Data Scientists and Developers working on a cutting-edge, innovative team tasked with supporting clinical research, AI solutions, application development, and deployment of these solutions within a practice of over 2,500 physicians. 100% Remote, contract or permanent position!
Key Responsibilities:
Design, implement, and maintain data pipelines that collect, process, and store large volumes of data from various sources.
Ensure data quality, integrity, and consistency throughout the pipeline.
Create and manage scalable data storage solutions, such as data warehouses, data lakes, and databases.
Optimize data storage for efficient retrieval and processing.
Set up and manage computational environments and clusters, including cloud-based resources and on-premises hardware.
Ensure that data scientists have access to the necessary computational power for their analyses and model training.
Implement and enforce data security measures to protect sensitive information.
Ensure compliance with relevant regulations and industry standards (e.g., GDPR, HIPAA).
Develop infrastructure for deploying machine learning models into production environments.
Monitor the performance and health of deployed models, ensuring they remain accurate and efficient.
Provide tools and platforms (such as Jupyter Notebooks, version control systems, and collaborative workspaces) that enable data scientists to work efficiently and collaboratively.
Implement best practices for code management, experiment tracking, and reproducibility.
Continuously optimize data processing workflows for speed and efficiency.
Implement techniques to reduce computational costs and improve resource utilization.
Required Skills and Qualifications:
Strong knowledge of programming languages (such as Python, R, and SQL) and experience with big data technologies (such as Hadoop, Spark, and Kafka).
Expertise in cloud platforms like AWS, Google Cloud Platform, or Azure, including services for data storage, processing, and machine learning.
Proficiency in managing relational databases (like PostgreSQL, MySQL) and NoSQL databases (like MongoDB, Cassandra).
Skills in ETL (Extract, Transform, Load) processes, data warehousing, and building robust data pipelines.
Experience with DevOps practices, including CI/CD (Continuous Integration/Continuous Deployment) pipelines, containerization (using Docker), and orchestration tools (such as Kubernetes).
Understanding of data security practices, encryption methods, and regulatory compliance requirements.
Ability to analyze complex systems and workflows to identify bottlenecks and areas for improvement.
About Codeworks: Codeworks has over 25 years of experience serving Fortune 1000 companies in Wisconsin as well as our client's national locations. Our recruiting team excels at evaluating, advising, and connecting IT professionals with new opportunities that will satisfy their expectations regarding income and opportunity for growth. At Codeworks, we're committed to diversity, equity, and inclusion in our workforce and beyond. We believe in equal opportunities and value the unique perspectives that every individual brings to our team. Join us in creating an inclusive, innovative, and collaborative workplace where your talents can thrive.
Codeworks is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, age, disability, or national origin.