Match score not available

Career Opportunities: Senior Site Reliability Engineer - Network - Remote (4218)

Remote: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

BS in Computer Science or equivalent experience, 5+ years Azure network design experience, Thorough understanding of networking protocols, Experience with monitoring SaaS network topologies, Knowledge in Infrastructure as Code and scripting.

Key responsabilities:

  • Implement a culture of SRE for network reliability
  • Design secure and fault-tolerant networks
  • Lead network monitoring and alerting initiatives
  • Conduct Root Cause Analysis on network issues
  • Automate system operations and share best practices
Donnelley Financial Solutions (DFIN) logo
Donnelley Financial Solutions (DFIN)
1001 - 5000 Employees
See more Donnelley Financial Solutions (DFIN) offers

Job description

Logo Jobgether

Your missions

 

Donnelley Financial Solutions (DFIN) is a leader in risk and compliance solutions, providing insightful technology, industry expertise and data insights to clients across the globe. We’re here to help you make smarter decisions with insightful technology, industry expertise and data insights at every stage of your business and investment lifecycles. As markets fluctuate, regulations evolve and technology advances, we’re there. And through it all, we deliver confidence with the right solutions in moments that matter. 

Summary:

 We are looking for technical team members at all levels who want to push themselves to deliver best in market SaaS solutions. We offer a challenging environment where you will have to grow, adapt and use your skills consistently. Our customers rely on us in the moments that matter. Engineering delivers on that promise.  

 

The Senior Site Reliability Engineer – Network is responsible for ensuring the networks in our SaaS products are fast, stable and optimized for our customers. SRE’s at DFIN take on availability, performance, managing change, monitoring, response and are guardians of non-functional requirements. 

 

You either have a network infrastructure background with a programmatic, automated mindset or are someone that comes with a software engineering background with extensive network infrastructure experience. The SRE goal is to build automated systems that reduce or eliminate manual work to keep our products up and running and performing optimally. We are looking for someone who thrives on collaboration within the team and across other groups and can operate independently to deliver solutions. 

Responsibilities:
  • Champion and implement a culture of SRE to maintain a reliable and performant network infrastructure in DFIN SaaS products 
  • Design and implement secure, redundant, fault-tolerant networks in DFIN SaaS products; you understand networking protocols and network elements and how they are integrated together to create resilient, fault-tolerant networks in SaaS products 
  • Choose and configure common network elements in SaaS product network topologies including load balancers, routers, DNS, etc.; provision route tables and routing paths in DFIN SaaS products so development teams do not have to 
  • Define, lead the implementation, and maintain SaaS product network monitoring and alerting to prevent client impacting issues and ensure network availability, performance and scalability to maintain SLOs and SLAs 
  • Identify and remediate issues in SaaS product network infrastructure (high latency, timeouts, dropped connections, etc.) using diagnostic tooling and network traces; perform thorough Root Cause Analysis (RCA); drive vendor partners (Microsoft) to provide quality assurances by requiring immediate defect fixes, software updates, etc., as necessary to ensure an ideal customer experience 
  • Serve as a senior escalation point for SaaS product network issues and collaborate with DFIN IT to integrate SaaS products into broader DFIN network topologies 
  • Automate everything including system operational runbooks  
  • Dive deep into technology and stay on the forefront of the latest network analysis tools, technologies, and strategies; help evaluate, prototype, and integrate them into work processes    
  • Perform with broad independence and deliver on project milestones and tasks on schedule while communicating progress regularly  
  • Build strong relationships with SRE team members and software engineering teams to hold each other accountable to expectations   
  • Learn continuously and apply lessons learned   
  • Evangelize best practices, eliminate bottlenecks, and improve process 
  • Participate in on-call duties 365/24/7 and lead the triage and RCA of production incidents
Qualifications:
  • BS in Computer Science or equivalent work experience.  
  • Thorough understanding of common networking protocols including IP, TCP/IP, ICMP, DNS, DHCP, ARP, SSL, TLS and how to diagnose network issues by isolating problems at the protocol layer within specific network elements 
  • 5+ years experience with Azure network design and network element configuration including provisioning of routing tables 
  • 5+ years experience monitoring and preventing issues in SaaS network topologies in Azure 
  • 5+ years experience implementing network performance, availability, and scalability monitoring and alerting using tooling such as SolarWinds 
  • 5+ years experience creating automated deployments with tools such as Harness, Azure DevOps, Ansible or Jenkins to manage Infrastructure as Code and software build and deployment in a continuous integration (CI) / continuous delivery (CD) environment 
  • 5+ years experience as a global admin of Azure including cloud cost management 
  • 5+ years experience writing scripts in PowerShell or Python/Bash to automate system operations as runbooks for Windows or Linux environments.  
  • 5+ years experience supporting public client facing revenue generating systems  
  • Strong DevOps focus and experience building and deploying Infrastructure as Code with Terraform or similar technology  
  • Experience planning, coordinating, developing and executing all stages of post deployment verification test scripts   
  • Experience securing Windows or Linux systems in 24x7 production environment   
  • Experience with containerization and managing Kubernetes clusters (AKS or EKS) 

It is the policy of Donnelley Financial Solutions to select, place and manage all its employees without discrimination based on race, color, national origin, gender, age, religion, actual or perceived disability, veteran's status, actual or perceived sexual orientation, genetic information or any other protected status. 

If you are a qualified individual with a disability or a disabled veteran, you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or access jobs.dfinsolutions.com as a result of your disability.  You can request a reasonable accommodation by sending an email to Accommodations@dfinsolutions.com. #BI-Remote

 

Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
Check out the description to know which languages are mandatory.

Soft Skills

  • communication
  • collaboration
  • Problem Solving

Site Reliability Engineer Related jobs