Match score not available

Sr. Software Engineer- Observability Platform

Remote: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

5-8 yrs experience in Observability space, Hands-on knowledge of Linux, Chef, GitHub, Experience with Prometheus, OTEL, Grafana, Knowledge of Open Telemetry API and SDKs, Exposure to public Cloud solutions.

Key responsabilities:

  • Configure, maintain Observability tools
  • Collaborate with product teams for solutions
  • Monitor system performance and troubleshoot
  • Automate deployment using Chef and GitHub
  • Implement logging, metrics, traces using OTEL
Gap Inc. logo
Gap Inc. Retail (Super / Hypermarket) XLarge https://www.gapinc.com/
10001 Employees
See more Gap Inc. offers

Job description

Logo Jobgether

Your missions

About the Role
In this role you will work with multiple cross-functional teams to develop, maintain & migrate various observability tools. You will play a significant role in implementing our NextGen Observability platform with logs, metrics & traces. You will be responsible for ensuring the reliability and availability of our tools through monitoring best practices. Your contributions will impact our ability to monitor, analyze, and optimize our systems for peak performance. .
What You'll Do
  • Configure and maintain Observability solutions like NewRelic, Grafana, Prometheus & GCP Logging.
  • Collaborate with multiple product teams and respective owners to design observability solutions as needed. 
  • Monitor system performance and troubleshoot issues.
  • Automate deployment when needed using Chef and GitHub.
  • Implement solutions for logging, metrics and traces using Open Telemetry (OTEL).
  • Participate in on-call rotations and Incident response for observability platform.
  • Mentor more Jr team members when needed and able to collaborate efficiently.
Who You Are
  • 5-8 yrs of relevant experience in Observability space.
  • Strong hands-on admin knowledge of Linux, Chef & GitHub.
  • Strong understanding of Prometheus, OTEL & Grafana.
  • Demonstrated experience with implementing OTEL instrumentation, configuring OTEL collectors & node exporters to scrape telemetry data.
  • Experience with Open Telemetry API and SDKs.
  • Fair understanding of Application & Infra monitoring.
  • Ability to adapt and learn quickly in a fast-paced environment with excellent communication skills to collaborate cross-functionally.
  • Exposure to public Cloud solutions (preferably Azure & GCP).
  • Good exposure of Kubernetes/Docker/Containerization is preferred

Required profile

Experience

Level of experience: Senior (5-10 years)
Industry :
Retail (Super / Hypermarket)
Spoken language(s):
Check out the description to know which languages are mandatory.

Software Engineer Related jobs