Match score not available

Site Reliability Engineer (SRE) - Azure Cloud

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Proficiency in Azure, particularly with Sitecore., Experience in advanced monitoring and analytics., Strong scripting and automation skills, preferably in Bash or Python., Knowledge of incident management protocols and SLAs..

Key responsabilities:

  • Ensure the reliability and performance of over 400 WSF websites.
  • Develop and implement advanced monitoring solutions for system health.
  • Lead incident resolution and conduct post-incident reviews.
  • Mentor the Cloud Ops team on SRE best practices.

Brixio Singapore logo
Brixio Singapore Scaleup
51 - 200 Employees
See all jobs

Job description

#RemoteWork Opportunity: AZURE Cloud: Site Reliability Engineer (SRE)

*MUST BE RESIDING IN THE PHILIPPINES*

Position: Site Reliability Engineer (SRE)

Location: Philippines (Remote)

About the Project:

Join us in supporting the groundbreaking Website Factory (WSF) project for a global cosmetics company. This project manages over 400 brand websites, providing a seamless, reliable, and high-performance digital experience. Hosted on Azure's PaaS, it utilizes modern tech like Sitecore for content management.

Role Summary:

As an SRE, you'll ensure the WSF project's reliability, availability, and performance. Collaborating with Cloud Ops and NOC teams, your focus will be on continuous improvement, system health, and SLA compliance. You'll play a key role in incident resolution and mentorship.

Key Objectives:

1. Ensure WSF project infrastructure reliability and performance.

2. Develop advanced monitoring solutions for performance metrics.

3. Lead root-cause analysis for high-severity issues.

4. Mentor Cloud Ops team on SRE best practices.

Key Responsibilities:

- Improve reliability and performance of 400+ WSF websites.

- Monitor system health and performance, identifying areas for improvement.

- Develop automation scripts for system management.

- Lead high-severity incident resolution and post-incident reviews.

- Maintain robust monitoring, alerting, and logging frameworks.

- Collaborate with project stakeholders to meet objectives and SLAs.

- Mentor Cloud Ops team in SRE methodologies.

Required Skills:

**Technical Skills**:

- Azure Proficiency is a MUST, especially in relation to Sitecore.

- Advanced Monitoring & Analytics expertise.

- Scripting & Automation skills (e.g., Bash, Python).

- Incident Management protocols.

- SLA & Metrics understanding.

- Capacity Planning expertise.

- Security best practices.

- Database Knowledge (MSSQL, cache, replication).

- CI/CD Pipelines (Azure DevOps, Jenkins, GitLab).

**Soft Skills**:

- Excellent Communication (verbal & written).

- Strong Problem-solving.

- Leadership & Mentorship.

- Analytical Thinking.

- Collaboration across teams.

- Process Management.

**Certifications**:

- Microsoft Certified: Azure Solutions Architect Expert is a PLUS.

- Site Reliability Engineering (SRE) Foundation.

- ITIL 4 Specialist: High-Velocity IT.

- Certified Information Systems Security Professional (CISSP).

Join us in this exciting project, ensuring top-notch digital experiences for global cosmetic brands!

#SRE #SiteReliabilityEngineer #RemoteWork #TechJobs #Azure

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Communication
  • Leadership
  • Analytical Thinking
  • Mentorship
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs