Match score not available

Site Reliability Engineer, Senior

Remote: 
Full Remote
Contract: 
Salary: 
85 - 193K yearly
Experience: 
Senior (5-10 years)
Work from: 
Virginia (USA), United States

Offer summary

Qualifications:

5+ years as Software Engineer, Developer, or DevOps Engineer, 4+ years managing large scale AWS environments, 2+ years using monitoring tools like Prometheus or Grafana, Experience with SRE practices including monitoring instrumentation, Ability to obtain a Secret clearance.

Key responsabilities:

  • Enhance system resilience and efficiency
  • Build resilient infrastructure with redundancy and monitoring tools
  • Automate tasks and script routine processes
  • Assist junior engineers while expanding technical knowledge
  • Evaluate production readiness and post-mortems
Booz Allen Hamilton logo
Booz Allen Hamilton Information Technology & Services XLarge https://www.boozallen.com/
10001 Employees
See more Booz Allen Hamilton offers

Job description

Site Reliability Engineer, Senior

The Opportunity:

Engineering to make a system more resilient and efficient frees up time and money to build more capabilities. Whether you come from a background in network engineering, systems administration, or software development—if you have a passion for making systems better, we need you! 

As a site reliability engineer (SRE) on our team, you’ll work with the government and its affiliates on the development of more robust systems by building a resilient infrastructure. You’ll build in redundancy, implement monitoring tools, and automate wherever possible. You’ll reduce toil by scripting routine tasks and automating self-repair. This is your chance to leverage your expertise in monitoring and observability while assisting junior engineers and broadening your knowledge base.
 
Join us. The world can’t wait. 

You Have: 

  • 5+ years of experience working as a Software Engineer, Developer, or DevOps Engineer

  • 4+ years of experience provisioning, operating, and maintaining large scale production AWS environments

  • 2+ years of experience monitoring disparate applications or infrastructure using tools such as Prometheus, Grafana, Splunk, Dynatrace, DataDog, CloudWatch, or OpenSearch

  • Experience applying fundamental SRE practices and principles, including monitoring instrumentation, SLI, SLO, and supporting Error Budget development, evaluating production readiness, post-mortems, and reducing toil

  • Experience with infrastructure automation tools, including Terraform or CloudFormation

  • Experience with Git repositories, CI/CD concepts, and leveraging Infrastructure as Code (IaC) to configure AWS Cloud environments

  • Experience with programming or scripting languages, including Bash, Python, or Go for automation purposes and building a scalable infrastructure in AWS

  • Experience with Agile methodologies, SDLC, and working in an Agile development environment

  • Ability to obtain a Secret clearance

  • HS diploma or GED

Nice If You Have: 

  • Experience with making systems fully observable, manipulating and transforming telemetry, and monitoring distributed complex architected systems across separate regions

  • Experience applying advanced SRE practices and principles, including capacity planning, cost optimization, chaos engineering, self-healing architecture, and advanced alerting techniques

  • Experience integrating monitoring to ITSM tooling, including Service Now, Jira Service Desk, Pager Duty, VictorOps, OpsGenie, or Everbridge

  • Experience with Gitlab CI for CI/CD deployments

  • Experience with serverless computing and container orchestration technologies, including AWS Lambda, Amazon ECS, and EKS Clusters

  • Experience with cloud computing concepts and AWS services, including network and security concepts and services, such as VPCs, Security Groups, VPNs, Firewalls, WAF, or TLS

  • Ability to design, plan, and implement scalable and resilient systems and troubleshoot complex technical issues

  • Possession of excellent documentation, problem-solving, and collaboration skills

  • Possession of excellent oral and written communication skills

  • AWS Certified Solutions Architect - Associate, AWS Certified Developer - Associate, or other AWS Certification

​ 

Clearance: 

Applicants selected will be subject to a security investigation and may need to meet eligibility requirements for access to classified information.

Compensation

At Booz Allen, we celebrate your contributions, provide you with opportunities and choices, and support your total well-being. Our offerings include health, life, disability, financial, and retirement benefits, as well as paid leave, professional development, tuition assistance, work-life programs, and dependent care. Our recognition awards program acknowledges employees for exceptional performance and superior demonstration of our values. Full-time and part-time employees working at least 20 hours a week on a regular basis are eligible to participate in Booz Allen’s benefit programs. Individuals that do not meet the threshold are only eligible for select offerings, not inclusive of health benefits. We encourage you to learn more about our total benefits by visiting the Resource page on our Careers site and reviewing Our Employee Benefits page.

Salary at Booz Allen is determined by various factors, including but not limited to location, the individual’s particular combination of education, knowledge, skills, competencies, and experience, as well as contract-specific affordability and organizational requirements. The projected compensation range for this position is $84,600.00 to $193,000.00 (annualized USD). The estimate displayed represents the typical salary range for this position and is just one component of Booz Allen’s total compensation package for employees. This posting will close within 90 days from the Posting Date.

Identity Statement

As part of the application process, you are expected to be on camera during interviews and assessments. We reserve the right to take your picture to verify your identity and prevent fraud.

Work Model
Our people-first culture prioritizes the benefits of flexibility and collaboration, whether that happens in person or remotely.

  • If this position is listed as remote or hybrid, you’ll periodically work from a Booz Allen or client site facility.
  • If this position is listed as onsite, you’ll work with colleagues and clients in person, as needed for the specific role.

EEO Commitment

We’re an equal employment opportunity/affirmative action employer that empowers our people to fearlessly drive change – no matter their race, color, ethnicity, religion, sex (including pregnancy, childbirth, lactation, or related medical conditions), national origin, ancestry, age, marital status, sexual orientation, gender identity and expression, disability, veteran status, military or uniformed service member status, genetic information, or any other status protected by applicable federal, state, local, or international law.

Required profile

Experience

Level of experience: Senior (5-10 years)
Industry :
Information Technology & Services
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Problem Solving
  • Non-Verbal Communication

Site Reliability Engineer Related jobs