Experience managing cloud services (AWS and GCP) including provisioning, configuring resources, cost optimization, and leveraging cloud-native services
Strong experience designing, implementing, and maintaining CI/CD pipelines for automated building, testing, and deployment
Experience implementing monitoring/observability solutions for health, performance, and availability of applications and infrastructure
Knowledge of security best practices across the SDLC and ability to collaborate with security teams; experience participating in on-call rotations and incident post-mortems
Requirements:
Manage cloud services (AWS, GCP) including provisioning/configuration, cost optimization, and leveraging cloud-native services to enhance scalability and reliability
Design, implement, and maintain CI/CD pipelines for automated building, testing, and deployment of applications
Implement monitoring/observability solutions to track health, performance, and availability of applications and infrastructure; collaborate with security teams to enforce SDLC security practices
Analyze system performance, conduct capacity planning, and participate in incident management (on-call rotations, post-incident reviews) to prevent outages and drive improvements
Job description
Job Title - DevOps Lead Engineer Location - Remote Duration - 12 Plus Months Rate - DOE U.S. Citizens and those authorized to work in the U.S. are encouraged to apply. We are unable to sponsor at this time. No H1B, CPT, OPT and 3rd party c2c for this client
Job Description
Responsible for managing cloud services (AWS, GCP) effectively. This includes provisioning and configuring cloud resources, optimizing costs, and leveraging cloud-native services to enhance scalability and reliability.
Design, implement, and maintain CI/CD pipelines to automate the building, testing, and deployment of applications.
Implementing monitoring solutions to track the health, performance, and availability of applications and infrastructure components.
Collaborate with security teams to implement security best practices throughout the SDLC.
Foster a culture of collaboration and communication between development, operations, and other cross-functional teams.
Analyze system performance metrics and conduct capacity planning to ensure that infrastructure resources meet current and future demand.
Work with cross-functional teams to diagnose root causes, implement temporary fixes, and develop long-term solutions to prevent recurrence of Incidents and avoid outages. Participate in on-call rotations and perform post-incident reviews to identify areas for improvement.