Match score not available

Incident Site Reliability Engineer (SRE) IRC222117

fully flexible
Remote: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

Master’s degree in Computer Science or Engineering preferred, 2+ years of experience as SRE, Advanced proficiency with Microsoft Azure services, Strong knowledge of Kubernetes and Helm, Proficiency in Bash and Python.

Key responsabilities:

  • Adhere to incident response protocols
  • Perform detailed post-incident assessments
  • Document findings and propose preventive measures
  • Evaluate and deploy automation solutions
  • Maintain communication with stakeholders
GlobalLogic logo
GlobalLogic https://www.globallogic.com/
10001 Employees
See more GlobalLogic offers

Job description

Description:

Our client is a world-leading provider of telecom equipment, solutions and services to mobile and fixed network operators and telecom providers all over the world. The project aim is to empower developers by providing access to advanced 5G network functionalities through open APIs and is a cutting-edge platform that brings together over 70 microservices for robust and scalable digital solutions.

Requirements:


  • Master’s degree in Computer Science or Engineering (IT, Telecom) preferred.
  • 2+ years of experience in a Site Reliability Engineer (SRE) role or similar position.
  • Advanced proficiency with Microsoft Azure services such as AKS, NSG, and Storage.
  • Strong practical knowledge of Kubernetes, Helm, and FluxCD.
  • Hands-on experience in creating and maintaining Terraform configurations.
  • Demonstrated troubleshooting skills for distributed cloud-native applications using tools like kubectl, k8s, Lens Pro, and metrics within the ELK stack.
  • Solid understanding of DevOps principles and proficiency in GitOps automation.
  • Proficiency in Bash and Python programming languages.
  • Good familiarity with GitLab CI/CD processes.
  • Effective interpersonal communication skills in a highly collaborative team environment
  • Advanced user level proficiency in Jira.


Job Responsibilities:


  • Adhere to incident response protocols and facilitate collaboration among teams to address issues promptly.
  • Perform detailed post-incident assessments to ascertain underlying causes of issues.
  • Document findings, propose preventive measures, and actively contribute to refining incident response protocols.
  • Evaluate automated remediation tools for addressing known issues or recurring incident scenarios.
  • Deploy automation solutions to minimize manual intervention during incident response procedures.
  • Maintain effective communication with stakeholders, including technical teams, management, and customers, ensuring timely updates on incident status and resolution progress


What We Offer

Empowering Projects: With 500+ clients spanning diverse industries and domains, we provide an exciting opportunity to contribute to groundbreaking projects that leverage cutting-edge technologies. As a team, we engineer digital products that positively impact people’s lives.

Empowering Growth: We foster a culture of continuous learning and professional development. Our dedication is to provide timely and comprehensive assistance for every consultant through our dedicated Learning & Development team, ensuring their continuous growth and success.

DE&I Matters: At GlobalLogic, we deeply value and embrace diversity. We are dedicated to providing equal opportunities for all individuals, fostering an inclusive and empowering work environment.

Career Development: Our corporate culture places a strong emphasis on career development, offering abundant opportunities for growth. Regular interactions with our teams ensure their engagement, motivation, and recognition. We empower our team members to pursue their career goals with confidence and enthusiasm.

Comprehensive Benefits: In addition to equitable compensation, we provide a comprehensive benefits package that prioritizes the overall well-being of our consultants. We genuinely care about their health and strive to create a positive work environment.

Flexible Opportunities: At GlobalLogic, we prioritize work-life balance by offering flexible opportunities tailored to your lifestyle. Explore relocation and rotation options for diverse cultural and professional experiences in different countries with our company.

About GlobalLogic GlobalLogic is a leader in digital engineering. We help brands across the globe design and build innovative products, platforms, and digital experiences for the modern world. By integrating experience design, complex engineering, and data expertise—we help our clients imagine what’s possible, and accelerate their transition into tomorrow’s digital businesses. Headquartered in Silicon Valley, GlobalLogic operates design studios and engineering centers around the world, extending our deep expertise to customers in the automotive, communications, financial services, healthcare and life sciences, manufacturing, media and entertainment, semiconductor, and technology industries. GlobalLogic is a Hitachi Group Company operating under Hitachi, Ltd. (TSE: 6501) which contributes to a sustainable society with a higher quality of life by driving innovation through data and technology as the Social Innovation Business.

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Interpersonal Communications
  • Troubleshooting (Problem Solving)

Site Reliability Engineer (SRE) Related jobs