Site Reliability Engineer

extra parental leave
Work set-up: 
Full Remote
Contract: 
Salary: 
84 - 84K yearly
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

Minimum of 2 years of technical AWS experience managing production systems., At least 1 year of Kubernetes experience (EKS, AKS, GKE, or self-managed)., 2+ years of experience with Terraform or similar Infrastructure as Code tools., Experience with UNIX/Linux systems and working in a DevOps/SRE environment..

Key responsibilities:

  • Implement and maintain scalable, reliable cloud infrastructure in AWS.
  • Collaborate with cross-functional teams to design and deploy high-availability solutions.
  • Participate in on-call rotations to resolve production issues.
  • Research and apply SRE best practices to enhance system reliability and automation.

Everbridge  logo
Everbridge Large https://www.everbridge.com
1001 - 5000 Employees
See all jobs

Job description

Are you motivated by an incredible sense of purpose in doing work that helps keep people safe? Are you passionate about innovating on cutting edge technology to develop robust architecture principles, operability guidelines, progressive scaling methodologies, and implementing other sophisticated techniques to reliably operate infrastructure at scale? Do you have an appetite for securing systems, streamlining efficiency, automating away toil, and proactively eliminating problems before they occur? If so, this position is a perfect opportunity for you to join the Everbridge Federal Platform team.
As part of the Everbridge Federal Platform team, you will play a critical role in ensuring the overall service quality and availability of Everbridges solutions. This includes designing, deploying, managing services at scale, evangelizing both SRE best practices, and helping to push the boundaries of the latest technology. The platforms that you will support are critical to the delivery of time sensitive information to help keep people safe and businesses running. We are dedicated, passionate people who are committed to customer service and doing the right thing.

What Youll Do:
  • Keep people safe and businesses running.
  • Be an integral member of the team implementing our platform in a DoD IL4 cloud environment.
  • Maintain infrastructure from conception to completion within AWS. Including services such as VPCs, EC2, Transit Gateways, IAM roles and policies, Route53, S3, SGs, NACLs
  • Build upon the operational availability, security, scalability, efficiency, monitoring, instrumentation, and overall service reliability of Everbridges solutions.
  • Collaborate across Agile teams with Architects, Developers, Quality, Data, Security, and other engineers on designing and implementing highly reliable solutions.
  • Research and implement SRE and best practices and by creating automation, crossfunctional collaboration, and datadriven decisions to reinforce the integrity and reliability of our systems.
  • Participate in a rotating oncall rotation to resolve production escalations

  • What Youll Bring:
  • 2+ years of technical AWS experience, managing and owning systems in a production environment
  • 1+ years of Kubernetes experience (EKS, AKS, GKE, Selfmanaged)
  • 2+ years of Terraform or similar IaC experience
  • 2+ years of experience with MongoDB or ElasticSearchELK administration
  • 2+ years of experience with application development or writing automation in Java
  • Experience with the following tooling: GitLab CICD, Packer, Docker, EKS, Kubernetes, Spinnaker, Helm, Argo, Jenkins
  • Experience with Telemetry tools such as Datadog, SumoLogic, Grafana, Prometheus
  • Experience with configuration management tools such as Salt, Ansible, AWS user_data
  • Experience with a DevOpsSRE production environment
  • Experience with Agile practices
  • UNIXLinux experience
  • Experience working on DoD programs
  • Currently hold a Secret Clearance or a be a US citizen with the ability to obtain a Secret Clearance
  • Must have or be able to obtain and maintain DoD 8140 “Intermediate” level or higher certification (formally DoD 8170 IAM Level II)
  • The reasonably estimated salary for this role at Everbridge ranges from $84,400 $112,500 and may also include variable compensation. Actual compensation is based on factors such as the candidates skills, qualifications, and experience. In addition, Everbridge offers a wide range of best in class, comprehensive and inclusive employee benefits for this role including healthcare, dental, parental planning, and mental health benefits, disability income benefits, life and AD&D insurance, a 401(k) plan and match, paid time off, and fitness reimbursements

    #LIHG1
    #LIRemote
  • Required profile

    Experience

    Level of experience: Mid-level (2-5 years)
    Spoken language(s):
    English
    Check out the description to know which languages are mandatory.

    Other Skills

    • Collaboration
    • Problem Solving

    Site Reliability Engineer (SRE) Related jobs