[Job23244] Senior DevOpsSRE Engineer , Brazil

Work set-up: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

Proven experience in DevOps or SRE roles with incident response and automation skills., Proficiency in programming languages like Python, Go, or Java for scripting tasks., Experience with cloud platforms such as Azure and containerization tools like Docker and Kubernetes., Excellent communication skills in English, both written and verbal..

Key responsibilities:

  • Lead and participate in incident response efforts to resolve critical issues.
  • Optimize system performance and collaborate on capacity planning.
  • Develop and maintain infrastructure as code using tools like Terraform.
  • Implement and improve monitoring, alerting, and logging systems.

Ci&T logo
Ci&T
5001 - 10000 Employees
See all jobs

Job description

We are tech transformation specialists, uniting human expertise with AI to create scalable tech solutions.
With over 7.400 CI&Ters around the world, we’ve built partnerships with more than 1,000 clients during our 30 years of history. Artificial Intelligence is our reality.

Job Description:
We are seeking an experienced Senior DevOps engineer to join our dynamic and innovative team. As a DevOps engineer, you will play a key role in ensuring the reliability, availability, and performance of our systems and services. You will work closely with crossfunctional teams to build and maintain a robust and scalable infrastructure while championing best practices for reliability, automation, performance optimization and monitoring and alerting.
You need advanced andor fluent proficiency in English to communicate with these different teams and clients during the workday.

Key Responsibilities:

Incident Response: Lead and participate in incident response efforts, managing critical incidents to resolution, conducting postincident analyses, and implementing preventive measures.

Performance Optimization: Identify and address performance bottlenecks, optimize system performance to meet servicelevel objectives (SLOs) with the team.

Capacity Planning: Collaborate on capacity planning efforts, ensuring that systems can handle current and future growth, and participate in capacity forecasting and resource allocation.

Automation: Develop and maintain infrastructure as code (IaC) using tools like Terraform and automate routine operational tasks to improve efficiency and reduce manual intervention.

Monitoring and Alerting: Implement and enhance monitoring, alerting, and logging systems to proactively detect issues, conduct root cause analysis, and ensure system health.

Collaboration: Collaborate with development, operations, and other teams to bridge the gap between development and production environments, and promote a culture of collaboration to improve automation, efficiency, delivery, and software quality.

Documentation: Maintain detailed documentation of systems, processes, and configurations, and contribute to knowledge sharing within the team.

MustHave Skills:
Excellent communication skills, both written and verbal in English;
Proven experience in a similar DevOps or SRE role, with a strong focus on incident response, performance optimization and automation;
Proficiency in at least one programming language (e.g., Python, Go, Java) for scripting and automation tasks;
Experience with cloud computing platforms (the client uses Azure) and containerization technologies (e.g., Docker, Kubernetes);
Indepth knowledge of infrastructure as code (IaC) principles and tools;
Strong expertise in implementing and managing monitoring and alerting solutions (e.g., Prometheus, Grafana, Datadog, ELK Stack);
Excellent problemsolving and troubleshooting skills, with a deep understanding of system and network fundamentals;
Experience with Gitlab andor Bitbucket and continuous integrationcontinuous deployment (CICD) pipelines (Jenkins + Groovy).

Desirable Skills:
Relevant certifications (e.g., Azure or AWS);
Familiarity with microservices architecture and service mesh technologies;
Experience with configuration management tools (e.g., Ansible, Puppet, Chef);
Knowledge of database administration and optimization;
Security best practices and experience with security tools and compliance;
Strong communication skills and the ability to work collaboratively in a crossfunctional environment;
Prior experience mentoring or leading junior DevOps engineer or SRE team members.

#LILF1
Our benefits:

Health and dental insurance
Meal and food allowance
Childcare assistance
Extended paternity leave
Partnership with gyms and health and wellness professionals via Wellhub (Gympass) TotalPass;
Profit Sharing and Results Participation (PLR);
Life insurance
Continuous learning platform (CI&T University);

Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Troubleshooting (Problem Solving)
  • Problem Solving
  • Collaboration
  • Communication

Field Engineer (Solutions) Related jobs