Key Responsibilities
\n\n
\n- Act as a senior member of the SRE team, supporting activities including the backlog and workload of the team, scoping requirements, peer review of code, providing feedback to the rest of the team.
- Represent the team in management and stakeholder meetings. Ensure best practices are kept, and suggest improvements to our development processes where you see gaps.
- Investigate, test, and resolve technical problems, working closely with other engineers to deliver core product functionality.
- Defining SLOs, SLIs, and SLAs for key metrics that indicate the health, security, stability and uptime of production, staging and development environments
- Monitoring the above environments and reacting to alerts and issues that may arise in day-to-day operation of their product line.
- Participate in an on-call rota for priority-1 level alarms with the rest of the Platform teams
- Ongoing upgrades and improvements to operational processes to optimise performance, stability and cost.
- Working with the platform engineering team to contribute to the planning of how we carry application/infrastructure releases and configuration changes.
- Interact with internal teams and external 3rd party vendors to troubleshoot and resolve complex problems