Match score not available

Site Reliability Engineer lll

85% Flex
EXTRA HOLIDAYS - EXTRA PARENTAL LEAVE - FULLY FLEXIBLE - 4 DAY WEEK
Remote: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

Bachelor’s degree in Computer Science or STEM field, Minimum 7 years’ experience in an engineering role.

Key responsabilities:

  • Develop and integrate internal systems
  • Monitor server health and stability
  • Meet KPIs, SLAs and error budgets
  • Coding, automation on Cloud Platforms
  • Provide feedback and mentorship
Emburse  logo
Emburse Information Technology & Services Scaleup https://www.emburse.com/
501 - 1000 Employees
HQ: Los Angeles
See more Emburse offers

Job description

Logo Jobgether

Your missions

Develop software and software fixes to integrate internal systems. Ensure code quality, test and distribute code updates, and monitor the health and stability of the servers.

What you'll do:
  • Meet and beat Key Performance Indicators, SLAs, maintain an error budget and adhere to it.
  • Identify, evaluate, and execute preventative measures to minimize and avoid impact to the customer experience
  • Employ deep troubleshooting skills to improve the availability, performance, and security for CR and Emburse, ensure services are designed with 24/7 availability and operational readiness and rigor
  • Coding and Automation of Applications on Cloud Platforms
  • Work with Engineering leadership to build shared services that meet the requirements and need of the platform and application teams
  • Work with Cloud Platform and Operations leaders to develop narratives, backlog grooming, epic planning and overall sprint planning processes
  • Ensure the platform holds a high degree of reliability, at least four 9s.
  • Define non-functional requirements as part of the product lifecycle to influence the new designs, standards, and methods for scalable, highly available distributed systems
  • Own technically intricate issues that cross between DevOps, Databases, Networking, Code, Infrastructure and people; drive them to satisfactory completion.
  • Work closely with product different stakeholders to align Operational priorities and planning with the product and engineering roadmap
  • Prepare and present engineering related documents to key stakeholders
  • Provide recommendations and feedback in review sessions, design reviews and review sessions.
  • Mentor SRE I and II’s
  • Assist guiding more junior engineers in best practices
  • Conduct and assist with investigation, test and deployment activities, identify and mitigate risks in development activities

  • What we're looking for:
  • Bachelor’s degree in Computer Science or a STEM field required
  • Minimum of 7 years’ experience in an engineering role required
  • Deep understanding of infrastructure as code, scripting, self-healing, containers, DevOps tooling, distributed systems higly desired
  • Experience working with Ansible and Terraform tools hightly desirable
  • Excellent written and verbal communication skills, in English
  • Experience with full lifecycle of SaaS implementations as well as Infrastructure as code
  • Excellent follow-up and project management skills
  • Proven ability to create and maintain new tools
  • Excellent troubleshooting skills
  • Excellent technical skills. Up to 70% of the job is hands on in a distributed Linux environment
  • Strong scripting skills. OOP is a plus
  • Liaise between other teams to help prioritize and align priorities
  • Experience working with an off shore team
  • Required profile

    Experience

    Level of experience: Senior (5-10 years)
    Industry :
    Information Technology & Services
    Spoken language(s):
    EnglishEnglish
    Check out the description to know which languages are mandatory.

    Soft Skills

    • troubleshooting-problem-solving
    • technical-acumen
    • collaboration
    • Mentorship
    • Problem Solving
    • verbal-communication-skills

    Site Reliability Engineer Related jobs