Match score not available

Senior Site Reliability Engineer

75% Flex
EXTRA PARENTAL LEAVE
Remote: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

3+ years of AWS and Kubernetes experience, Familiarity with DevOps/SRE environments, Experience with Telemetry and automation tools.

Key responsabilities:

  • Design, deploy, and manage Kubernetes at scale
  • Collaborate in Agile teams on highly reliable solutions
Everbridge  logo
Everbridge Large https://www.everbridge.com/
1001 - 5000 Employees
See more Everbridge offers

Job description

Logo Jobgether

Your missions

Are you motivated by an incredible sense of purpose in doing work that helps keep people safe? Are you passionate about innovating on cutting edge technology to develop robust architecture principles, operability guidelines, progressive scaling methodologies, and implementing other sophisticated techniques to reliably operate infrastructure at scale? Do you have an appetite for securing systems, streamlining efficiency, automating away toil, and proactively eliminating problems before they occur? If so, this position is a perfect opportunity for you to join the Everbridge Kubernetes Platform team.

As part of the Everbridge Federal Kubernetes Platform team, you will play a critical role in ensuring the overall service quality and availability of Everbridge's solutions. This includes designing, deploying, managing Kubernetes at scale, evangelizing both Kubernetes and SRE best practices, and helping to push the boundaries of the latest technology. The platforms that you will support are critical to the delivery of time sensitive information to help keep people safe and businesses running. We are dedicated, passionate people who are committed to customer service and doing the right thing.

What You'll Do:
  • Keep people safe and businesses running.
  • Be an integral member of the team implementing our platform in a DoD IL4 cloud environment.
  • Own and maintain the Kubernetes infrastructure from conception to completion within AWS. Including services such as VPCs, EC2, Transit Gateways, IAM roles and policies, Route53, S3, SGs, NACLs
  • Build upon the operational availability, security, scalability, efficiency, monitoring, instrumentation, and overall service reliability of Everbridge's Kubernetes solutions.
  • Collaborate across Agile teams with Architects, Developers, Quality, Data, Security, and other engineers on designing and implementing highly reliable solutions.
  • Research and implement SRE and Kubernetes best practices and by creating automation, cross-functional collaboration, and data-driven decisions to reinforce the integrity and reliability of our systems.
  • Participate in a rotating on-call rotation to resolve production escalations

  • What You'll Bring:
  • 3+ years of technical AWS experience, managing and owning systems in a production environment
  • 2+ years of Kubernetes experience (EKS, AKS, GKE, Self managed)
  • 3+ years of Terraform or similar IaC experience
  • Experience with the following tooling: GitLab CICD, Packer, Docker, EKS, Kubernetes, Spinnaker, Helm, Argo, Jenkins
  • Experience with Telemetry tools such as Datadog, SumoLogic, Grafana, Prometheus
  • Experience writing automation in languages such as Python, Go, Bash, Java
  • Experience with configuration management tools such as Salt, Ansible, AWS user_data
  • Experience with a DevOps/SRE production environment
  • Experience with Agile practices
  • Large scale production UNIX/Linux experience
  • Experience working on DoD IL4 programs
  • Currently hold a Secret Clearance or a be a US citizen with the ability to obtain a Secret Clearance
  • Must have or be able to obtain and maintain DoD 8140 “Intermediate” level or higher certification (formally DoD 8170 IAM Level II)
  • #LI-JS1
    #LI-Remote

    Required profile

    Experience

    Level of experience: Mid-level (2-5 years)
    Spoken language(s):
    English
    Check out the description to know which languages are mandatory.

    Soft Skills

    • Proactive Mindset
    • Collaborative

    Go Premium: Access the World's Largest Selection of Remote Jobs!

    • Largest Inventory: Dive into the world's largest remote job inventory. More than half of these opportunities can't be found on standard platforms.
    • Personalized Matches: Our AI-driven algorithms ensure you find job listings perfectly matched to your skills and preferences.
    • Application fast-lane: Discover positions where you rank in the TOP 5% of applicants, and get personally introduced to recruiters with Jobgether.
    • Try out our Premium Benefits with a 7-Day FREE TRIAL.
      No obligations. Cancel anytime.
    Upgrade to Premium

    Find more Site Reliability Engineer jobs