Senior Platform Engineer

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Proficiency with Amazon Web Services (ECS, VPC, Route53, CloudFront, Lambda)., Solid programming skills in Java or Python with a deep understanding of object-oriented techniques., Hands-on experience with infrastructure-as-code tools, preferably Terraform., Familiarity with DevOps culture and the full Software Development life-cycle..

Key responsibilities:

  • Drive the design and evolution of observability platforms across various domains.
  • Lead architectural initiatives to ensure scalable and resilient observability tooling.
  • Mentor engineering teams in embedding observability in their processes.
  • Monitor platform health and optimize for scaling and innovation.

Ocado Intelligent Automation logo
Ocado Intelligent Automation http://ocadointelligentautomation.com
5001 - 10000 Employees
See all jobs

Job description

Ocado Technology is building the next-generation grocery e-commerce suite that’s changing the way the world shops.

This role is critical to scaling our OTP logging, tracing, and emerging observability domains—such as profiling and AI/ML-powered diagnostics. You will act as a technical leader, driving architectural decisions, reducing technical debt, and enabling engineering teams to operate with deep insight, resilience, and autonomy.

Key Responsibilities:

  • Drive the design, development, and evolution of observability platforms across logs, traces, profiling, and advanced analytics domains.
  • Lead architectural initiatives, ensuring observability tooling is scalable, resilient, and aligned with broader platform strategy.
  • Mentor and support engineering teams, helping them embed observability in design, development, and operations through hands-on guidance and training.
  • Reduce long-standing technical debt in our observability stack by modernizing and refactoring legacy systems.
  • Establish best practices and standards for observability instrumentation, telemetry collection, and diagnostics.
  • Build and maintain internal tooling, with automation and Infrastructure as Code (Terraform) at the core of deployment strategies.
  • Collaborate with cross-functional stakeholders, influencing product direction and platform priorities through data-driven insights.
  • Monitor platform usage and health, identifying areas for optimization, scaling, and innovation.
  • Participate in the full lifecycle of observability products, from discovery to delivery, including on-call responsibilities where appropriate.
  • Stay ahead of industry trends, championing adoption of emerging technologies such as Grafana innovations, OpenTelemetry, and AI/ML-based observability.

Knowledge, Skills and Experience

ESSENTIAL  

  • Proficiency with cloud service providers, primarily Amazon Web Services ( ECS, VPC, Route53, CloudFront, Lambda).
  • Solid programming skills in any object-oriented programming language, with a deep understanding of the underlying techniques (Java & Python preferred).
  • Hands-on experience with infrastructure-as-a-code tools (Terraform preferred).
  • Comprehensive experience in the full Software Development life-cycle, from design to deployment.
  • Understanding of operating systems, orchestration and deployment automation. 
  • Familiarity with DevOps culture, including its fundamental tools and concepts, such as source control management, CI/CD, and deployment strategies.

DESIRABLE

  • Past experience with Observability tools (Grafana preferred)  and performance tuning
  • Knowledge in code review and change control.
  • Experience in ad hoc reporting and analysis.
  • Exposure to research, development, and optimization tasks.

REQUIRED COMPETENCIES 

  • Technical Excellence: demonstrates intellectual rigor, possesses relevant abilities & is able to pick up new skills quickly
  • Innovation & Problem Solving: able to solve complex problems, participates in continuous improvement, adapts the ideas of others
  • Productivity, Drive & Achievement: proactive approach, gets things done, demonstrates accountability & ownership, prioritizing own workload
  • Business awareness: ability to apply learned skills, awareness beyond immediate area/role
  • Adaptability: working under pressure, flexible, positive & focused during times of change
  • Communication & Impact: strong verbal and written communication in English. Robust interaction with internal clients
  • Teamwork: works well with others & actively contributes towards team objectives

Please let us know in your application if you need any special adaptations for the selection process. At Ocado Barcelona, we adapt our selection processes to our candidates.

Be bold, be unique, be brilliant, be you. We are looking for individuality and we value diversity above gender, sexual orientation, race, nationality, ethnicity, religion, age, disability or union participation. We are an equal opportunities employer and we are committed to treating all applicants and employees fairly and equally.

#LI-REMOTE

#LI-OT

#LI-KS1

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Adaptability
  • Teamwork
  • Communication
  • Problem Solving

Platform Engineer Related jobs