Match score not available

Technical Senior Manager of Site Reliability Engineering

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

9+ years in Systems Engineering and Architecture, including requirements definition and systems integration., Extensive experience in Cloud Computing with AWS, Azure, or GCP, and proficiency in Infrastructure-as-Code using Terraform and Ansible., Demonstrated success in team leadership, managing 6-8 contributors, and preparing teams for compliance audits., Strong interpersonal and problem-solving skills, with the ability to create technical documentation and diagrams..

Key responsabilities:

  • Allocate 70% of time to hands-on engineering tasks, including developing deployments and automation scripts.
  • Dedicate 30% of time to leadership duties, mentoring junior engineers, and managing escalations.
  • Act as the primary escalation contact for complex technical issues, ensuring high client satisfaction.
  • Coordinate day-to-day engineering activities, tracking progress and adjusting resources to meet project goals.

Coalfire logo
Coalfire Computer Hardware & Networking Large https://www.coalfire.com/
1001 - 5000 Employees
See all jobs

Job description

About Coalfire

Coalfire is on a mission to make the world a safer place by solving our clients’ hardest cybersecurity challenges. We work at the cutting edge of technology to advise, assess, automate, and ultimately help companies navigate the ever-changing cybersecurity landscape. We are headquartered in Denver, Colorado with offices across the U.S. and U.K., and we support clients around the world.

But that’s not who we are – that’s just what we do.
 
We are thought leaders, consultants, and cybersecurity experts, but above all else, we are a team of passionate problem-solvers who are hungry to learn, grow, and make a difference.

Position Summary

We’re looking for a Technical Senior Manager of SRE to play a central role in the implementation, and maintenance of scalable, secure, and high-performing systems—ensuring our clients’ mission-critical infrastructures remain stable and resilient. If you’re driven by a desire to innovate, excel at operational excellence, and thrive in a collaborative environment, come be part of a team committed to making the world a safer place.

What You'll Do
  • Allocate approximately 70% of time to hands-on engineering tasks, such as developing new deployments, tooling, and automation scripts to address client needs
  • Dedicate around 30% of time to leadership duties, including mentoring junior engineers, ensuring quality deliverables, and managing escalations
  • Act as the primary escalation contact for complex technical issues, resolving them promptly to maintain high levels of client satisfaction
  • Monitor and uphold quality standards for engineering work, confirming alignment with internal protocols, compliance regulations, and project milestones
  • Identify and mitigate risks in partnership with consulting and solutions architecture teams, ensuring regulatory requirements and client expectations are fully addressed
  • Coordinate day-to-day engineering activities, tracking progress and adjusting resources to meet project goals on schedule
  • Help create and implement solutions that improve the practice 

  • What You'll Bring
  • 9+ years in Systems Engineering and Architecture: Involving requirements definition, architecture development, systems integration, and testing.
  • 9+ years in Cloud Computing: Designing, implementing, operating, and automating environments within AWS, Azure, or GCP
  • 9+ years with Infrastructure-as-Code: Hands-on proficiency in Terraform and Ansible for orchestration and automation
  • SLA and Issue Management: Proven track record of meeting SLAs—particularly regarding availability, response times, and service posture—through effective collaboration and escalation processes
  • Operational Excellence: Demonstrated success driving continuous improvement via KPIs and best practices for operational support
  • Governance and Compliance: Experience guiding the creation of Infrastructure-as-Code solutions, governance models, and alignment with standards such as FedRAMP or other security frameworks
  • Team Leadership: Proven track record of managing teams (6–8 contributors), focusing on career development, goal setting, project oversight, and daily guidance
  • Regulatory Audit Prep: Prepared and coached teams for client-facing compliance audits with third-party auditors
  • Project Definition and Documentation: Lead efforts of defining, planning, and documenting key Managed Services projects and initiatives; tracked outcomes against established goals
  • Managed Services Expertise: Familiarity with ticket management systems and meeting SLA requirements in a managed services environment
  • Cloud & Automation: Extensive experience with AWS, Azure, or GCP; deep knowledge of Terraform, Ansible, GitLab, and CI/CD technologies
  • Technical Collaboration: Proven ability to collaborate with Site Reliability Engineers and cross-functional teams, facilitating team problem-solving and performance improvements
  • Soft Skills: Strong interpersonal, organizational, and problem-solving skills; effective at building client trust
  • Documentation & Communication: Capable of creating technical diagrams and comprehensive written documentation; able to convey complex ideas clearly
  • Professionalism & Autonomy: Demonstrated ability to work both independently and as part of a team with a professional attitude and demeanor
  • Security Mindset: Critical thinker capable of balancing stringent security and compliance requirements with mission objectives

  • Bonus Points
  • Consulting Experience: Previous roles in technical consulting for external clients
  • High-Availability Environments: Exposure to 24x7 operational settings or large-scale and high-availability system support
  • Encryption and Hardening: Demonstrated expertise implementing SSL, PKI, FIPS 140-2, and enforcing security baselines such as CIS Benchmarks and DISA STIG
  • Further Cloud and Security Specialization: Additional hands-on work with container orchestration (Kubernetes), advanced threat detection, or enterprise endpoint security
  • Why You’ll Want to Join Us

    At Coalfire, you’ll find the support you need to thrive personally and professionally. In many cases, we provide a flexible work model that empowers you to choose when and where you’ll work most effectively – whether you’re at home or an office.

    Regardless of location, you’ll experience a company that prioritizes connection and wellbeing and be part of a team where people care about each other and our communities. You’ll have opportunities to join employee resource groups, participate in in-person and virtual events, and more. And you’ll enjoy competitive perks and benefits to support you and your family, like paid parental leave, flexible time off, certification and training reimbursement, digital mental health and wellbeing support membership, and comprehensive insurance options.

    At Coalfire, equal opportunity and pay equity is integral to the way we do business. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran. Coalfire is committed to providing access, equal opportunity, and reasonable accommodation for individuals with disabilities in employment, its services, programs, and activities. To request reasonable accommodation to participate in the job application or interview process, our Human Resources team at HumanResourcesMB@coalfire.com.

    Required profile

    Experience

    Industry :
    Computer Hardware & Networking
    Spoken language(s):
    English
    Check out the description to know which languages are mandatory.

    Other Skills

    • Professionalism
    • Communication
    • Organizational Skills
    • Social Skills
    • Problem Solving

    Site Reliability Engineer (SRE) Related jobs