Match score not available

Site Reliability Engineer

Remote: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

Minimum of 3 years experience in SRE, Proficiency in Azure and SQL databases, Experience in cost optimization and resource budget management, Strong collaboration skills with cross-functional teams, Microsoft Certified: Azure Administrator Associate required.

Key responsabilities:

  • Ensure performance, availability, and reliability of systems
  • Manage backup solutions and disaster recovery processes
  • Implement strategies for cost optimization
  • Collaborate to manage resource budget and track usage
  • Leverage Azure for deploying and managing services
Spektra Systems logo
Spektra Systems https://spektrasystems.com/
201 - 500 Employees
See more Spektra Systems offers

Job description

This is a remote position.

We are seeking a skilled and motivated Site Reliability Engineer (SRE) to join our dynamic team. The ideal candidate will have a minimum of 3 years of experience in site performance, availability, and backups, along with a strong background in cost optimization, resource budget management, and SQL database management. This role requires close collaboration with the Operations Manager and a solid understanding of Azure and SaaS products.

Key Responsibilities:

Site Performance & Availability:
  • Ensure the performance, availability, and reliability of our systems and services.
  • Monitor, diagnose, and resolve performance issues to maintain optimal service levels.
  • Understand application deployment, metrics to monitor, and errors to look for.
Backups & Recovery:
  • Manage backup solutions and disaster recovery processes to ensure data integrity and availability.
  • Regularly test and validate backup procedures. Develop and maintain DR execution plans and playbooks.
Cost Optimization:
  • Implement and oversee strategies for cost optimization, including resource usage monitoring and cost-effective solutions.
  • Work to balance performance with cost efficiency.
Resource Budget Management:
  • Collaborate with the Operations Manager to manage and optimize the resource budget.
  • Track and report on resource usage and costs to ensure alignment with budgetary constraints.
Azure Expertise:
  • Leverage your knowledge of Azure to deploy, manage, and monitor cloud-based applications and services.
  • Ensure the efficient use of Azure resources and services.
Database Administration:
  • Manage and optimize SQL databases, including performance tuning, backup, and recovery.
  • Set up and maintain dashboards/alerts to detect database issues proactively. 
Performance Measurement Systems:
  • Set up performance measurement systems and track key metrics to ensure system reliability and efficiency.
Collaboration:
  • Work closely with the Operations Manager and other teams to align on operational goals and strategies.
  • Provide insights and recommendations for improving system reliability and efficiency.


Requirements
  • Experience: Minimum of 3 years of experience as a Site Reliability Engineer or in a similar role focusing on site performance, availability, backups, cost optimization, and SQL database management.
  • Technical Skills: Proficiency in managing and optimizing systems with hands-on experience in Azure and SQL databases. Knowledge of SaaS product environments is essential.
  • Cost Optimization: Proven experience in cost management and resource budget optimization.
  • Collaboration: Strong ability to work closely with Operations Managers and cross-functional teams to drive reliability and efficiency improvements.
  • Communication: Excellent verbal and written communication skills to convey technical concepts effectively and collaborate with various stakeholders.
  • SQL Skills: Strong understanding of SQL database management, performance tuning, and administration.

Certifications:
  • Required: Microsoft Certified: Azure Administrator Associate (Exam AZ-104)
  • Preferred: Microsoft Certified: Azure Solutions Architect Expert (Exam AZ-305

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Communication

Site Reliability Engineer (SRE) Related jobs