Proven experience managing cloud environments on AWS and GCP, including provisioning/configuring resources and cost optimization.
Strong expertise in designing, implementing, and maintaining CI/CD pipelines.
Experience implementing monitoring, logging, and alerting for applications and infrastructure.
Knowledge of security best practices across the SDLC; healthcare domain experience is a plus.
Requirements:
Manage cloud services (AWS, GCP): provisioning/configuring resources, optimizing costs, and leveraging cloud-native services to improve scalability and reliability.
Design, implement, and maintain CI/CD pipelines to automate building, testing, and deployment of applications.
Implement monitoring solutions to track health, performance, and availability of applications and infrastructure components.
Collaborate with security teams to implement security best practices throughout the SDLC and participate in on-call rotations with post-incident reviews to drive continuous improvement.
Job description
Position- Devops Lead Engineer Location- Remote Duration- Contract Rate- DOE
Responsible for managing cloud services (AWS, GCP) effectively. This includes provisioning and configuring cloud resources, optimizing costs, and leveraging cloud-native services to enhance scalability and reliability.
Design, implement, and maintain CI/CD pipelines to automate the building, testing, and deployment of applications.
Implementing monitoring solutions to track the health, performance, and availability of applications and infrastructure components.
Collaborate with security teams to implement security best practices throughout the SDLC.
Foster a culture of collaboration and communication between development, operations, and other cross-functional teams.
Analyze system performance metrics and conduct capacity planning to ensure that infrastructure resources meet current and future demand.
Work with cross-functional teams to diagnose root causes, implement temporary fixes, and develop long-term solutions to prevent recurrence of Incidents and avoid outages. Participate in on-call rotations and perform post-incident reviews to identify areas for improvement.