Match score not available

Snr. Site Reliability Engineer (Position located in Sheffield, United Kingdom)

Remote: 
Full Remote
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

Bachelor's degree in related field., 5+ years of SRE or DevOps experience., Expertise in CI/CD workflows., Proficiency in AWS or Azure..

Key responsabilities:

  • Manage and maintain environments securely.
  • Design and implement CI/CD pipelines.
  • Monitor and troubleshoot system performance.
  • Collaborate with teams to meet project needs.
KnowBe4 logo
KnowBe4 Computer Hardware & Networking Large https://www.knowbe4.com/
1001 - 5000 Employees
See more KnowBe4 offers

Job description

About KnowBe4

KnowBe4, the provider of the world's largest security awareness training and simulated phishing platform, is used by tens of thousands of organizations around the globe. KnowBe4 enables organizations to manage the ongoing problem of social engineering by helping them train employees to make smarter security decisions, every day.

Fortune has ranked us as a best place to work for women, for millennials, and in technology for four years in a row! We have been certified as a "Great Place To Work" in 8 countries, plus we've earned numerous other prestigious awards, including Glassdoor's Best Places To Work.

Our team values radical transparency, extreme ownership, and continuous professional development in a welcoming workplace that encourages all employees to be themselves. Whether working remotely or in-person, we strive to make every day fun and engaging; from team lunches to trivia competitions to local outings, there is always something exciting happening at KnowBe4.

To learn more about our team and office culture in England (UK), visit the following links. 
Careers Page: https://www.knowbe4.com/careers/locations/york
Glassdoor: https://www.glassdoor.com/Location/KnowBe4-York-Location-EI_IE969384.0,7_IL.8,12_IC3297365.htm
LinkedInhttps://www.linkedin.com/company/knowbe4/life/uk/

The team’s role is to deploy and maintain the infrastructure where our Products live. The main purpose is to get product features from code developed by the Development teams and transform it into infrastructure so our customers can ultimately use it, while keeping this infrastructure secure and updated. They play a pivotal role in providing deployment and support for infrastructure hosted in Azure and AWS cloud. This ranges from creating infrastructure diagrams to writing infrastructure as code and deploying pipelines for applications. Additionally they provide 24/7/365 support managing single servers to huge clusters of servers for a wide range of customers dealing with practically every team in the business to get a customer’s dream out to production.

Responsibilities:

  • Manage and maintain environments to ensure high availability and security.
  • Design and implement CI/CD pipelines to automate software delivery.
  • Monitor and troubleshoot system performance issues, using observability tools like Prometheus, Grafana, or Datadog.
  • Collaborate with development teams to align infrastructure efforts with project needs and timelines.
  • Build and maintain infrastructure as code (IaC) solutions using tools like Terraform
  • Manage AWS/Azure services, including ECS/Container Apps, S3/blob storage etc
  • Participate in incident response, conducting root cause analysis and post-incident reviews.
  • Automate manual tasks to improve operational efficiency and reduce technical debt.
Minimum Qualifications:
  • Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • 5+ years equivalent work experience in SRE, DevOps, or infrastructure management may substitute for formal education.
  • CI/CD Workflows: Expertise in designing and maintaining automated pipelines for continuous delivery.
  • AWS or Azure Cloud Expertise: Strong knowledge of AWS/Azure services,
  • Infrastructure-as-Code: Proficiency in Terraform, Ansible, or similar tools.
  • Monitoring and Observability: Experience with Prometheus, Grafana, Datadog, or other observability platforms.
  • Automation and Scripting: Proficiency in Python, Bash, or other scripting languages to automate tasks.
  • Incident Management: Ability to lead incident response efforts and conduct root cause analysis.
  • Collaboration and Communication: Strong interpersonal skills to work effectively across teams and with stakeholders.

Our Fantastic Benefits

We offer company-wide bonuses based on monthly sales targets, employee referral bonuses, adoption assistance, tuition reimbursement, certification reimbursement, certification completion bonuses, and a relaxed dress code - all in a modern, high-tech, and fun work environment. For more details about our benefits in each office location, please visit www.knowbe4.com/careers/benefits.

Note: An applicant assessment and background check may be part of your hiring procedure.

Individuals seeking employment at KnowBe4 are considered without prejudice to race, color, religion, national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, gender identity, sexual orientation or any other characteristic protected under applicable federal, state, or local law. If you require reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employee selection process, please visit www.knowbe4.com/careers/request-accommodation.

No recruitment agencies, please.

Required profile

Experience

Level of experience: Senior (5-10 years)
Industry :
Computer Hardware & Networking
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Communication

Site Reliability Engineer (SRE) Related jobs