Experience managing cloud services (AWS and/or GCP) including provisioning, configuring resources, cost optimization, and leveraging cloud-native services for scalability and reliability
Strong experience designing, implementing, and maintaining CI/CD pipelines for automated build, test, and deployment
Experience implementing monitoring solutions to track health, performance, and availability of applications and infrastructure components
Knowledge of security best practices across the SDLC and ability to collaborate with security teams; incident response and on-call experience
Requirements:
Manage cloud services (AWS and GCP), including provisioning, configuring resources, cost optimization, and leveraging cloud-native services for scalability and reliability
Design, implement, and maintain CI/CD pipelines to automate building, testing, and deployment of applications
Implement monitoring solutions to track the health, performance, and availability of applications and infrastructure components
Collaborate with security teams to enforce security best practices across the SDLC, and participate in incident response (on-call rotations and post-incident reviews)
Job description
Position: Devops Lead Engineer Location: Remote Duration: Contract
Rate: DOE
· Responsible for managing cloud services (AWS, GCP) effectively. This includes provisioning and configuring cloud resources, optimizing costs, and leveraging cloud-native services to enhance scalability and reliability. · Design, implement, and maintain CI/CD pipelines to automate the building, testing, and deployment of applications. · Implementing monitoring solutions to track the health, performance, and availability of applications and infrastructure components. · Collaborate with security teams to implement security best practices throughout the SDLC. · Foster a culture of collaboration and communication between development, operations, and other cross-functional teams. · Analyze system performance metrics and conduct capacity planning to ensure that infrastructure resources meet current and future demand. · Work with cross-functional teams to diagnose root causes, implement temporary fixes, and develop long-term solutions to prevent recurrence of Incidents and avoid outages. Participate in on-call rotations and perform post-incident reviews to identify areas for improvement. · Healthcare domain experience will be a plus.