Logo for DraftKings

Lead Site Reliability Engineer

Job description

At DraftKings, AI is becoming an integral part of both our present and future, powering how work gets done today, guiding smarter decisions, and sparking bold ideas. It’s transforming how we enhance customer experiences, streamline operations, and unlock new possibilities. Our teams are energized by innovation and readily embrace emerging technology. We’re not waiting for the future to arrive. We’re shaping it, one bold step at a time. To those who see AI as a driver of progress, come build the future together.

The Crown Is Yours

As a Lead Site Reliability Engineer, you will drive key initiatives to enhance the reliability, scalability, and efficiency of our infrastructure. You'll collaborate across teams to architect infrastructure automation while mentoring other Engineers to foster a culture of continuous learning and innovation. In this role, you will shape deployment strategies, performance tuning, and monitoring frameworks to support our rapid growth.

What You'll Do

  • Lead SRE initiatives across multiple projects and products, collaborating with cross-functional teams to shape platform and infrastructure engineering efforts across the organization.

  • Drive technical excellence by mentoring and guiding engineers, fostering a culture of continuous learning and innovation.

  • Architect and automate self-healing, fault-tolerant infrastructure with declarative configurations, GitOps, and event-driven automation for scalable deployments across public clouds and on-premise.

  • Design, develop, and maintain software-driven infrastructure automation to build internal tools and eliminate repetitive operational tasks.

  • Own and drive decisions on product deployment, performance tuning, monitoring, and alerting to ensure high availability and system efficiency in production.

  • Define key metrics and SLAs around new web services being created to support our rapid traffic growth.

  • Design and implement monitoring and alerting strategies to enforce application SLAs.

What You'll Bring

  • At least 6 years of experience managing distributed cloud environments (GCP, AWS, vSphere, Nutanix) and platform automation at scale.

  • Deep expertise in container orchestration (Kubernetes) and container runtimes (Docker, containers), with the ability to design, scale, and troubleshoot complex workloads.

  • Expert-level understanding of networking and web concepts, with the ability to debug issues down to the packet level.

  • Strong experience developing software for automation and infrastructure tooling (Go, Python).

  • Strong understanding of Linux-based operating systems, including performance tuning, bootloaders, storage, partitioning, kernel debugging, and low-level system optimizations.

  • Experience with Infrastructure as Code (IaC) and configuration management tools (Terraform, Ansible, Chef, etc.), ensuring scalable and repeatable infrastructure provisioning.

  • Understanding of applications written in various programming languages (C#/.NET, Java, Elixir, Ruby, etc).

  • Experience in AWS Greengrass IoT management and A/B booting. 

Join Our Team

We’re a publicly traded (NASDAQ: DKNG) technology company headquartered in Boston. As a regulated gaming company, you may be required to obtain a gaming license issued by the appropriate state agency as a condition of employment. Don’t worry, we’ll guide you through the process if this is relevant to your role.

The US base salary range for this full-time position is 148,000.00 USD - 185,000.00 USD, plus bonus, equity, and benefits as applicable. Our ranges are determined by role, level, and location. The compensation information displayed on each job posting reflects the range for new hire pay rates for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific pay range and how that was determined during the hiring process. It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.

Site Reliability Engineer (SRE) Related jobs

Other jobs at DraftKings

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.