Match score not available

Site Reliability Engineer

extra holidays - fully flexible

Offer summary

Qualifications:

5+ years experience in DevOps, SRE or related roles, Expertise with cloud services, specifically Google Cloud Platform, Strong advocacy for SRE principles and practices, Proven leadership in adopting and scaling SRE principles across organizations..

Key responsabilities:

  • Creating and managing service level objectives (SLOs)
  • Monitoring application performance and incident management
  • Installing and maintaining reliable alerting and observability dashboards
  • Collaborating across teams on SLO-Driven OKRs.

PDQ logo
PDQ Computer Software / SaaS SME https://www.pdq.com/
51 - 200 Employees
See all jobs

Job description

About PDQ

PDQ, founded in Salt Lake City, UT, makes device management simple, secure, and Pretty Damn Quick through our products Deploy, Inventory, Connect, Detect, SimpleMDM and SmartDeploy. IT teams use our products to reduce complexity, improve efficiency, and enhance control in their unique environments. We are backed by TA Associates and Berkshire Partners, top-tier global private equity firms.

PDQ's Core Values: Honesty, Ownership, Collaboration and Improvement

Job Description

  • At this time, qualified candidates for this role may reside in any of the following US states: AR, AZ, CO, CT, FL, GA, ID, IL, IN, KY, MD, MI, MN, MO, NC, NH, OK, OR, TN, TX, UT, VA, WA, WI.

As the first dedicated Site Reliability Engineer at PDQ, you will build and shape the foundation reliability, availability, performance and scalability of PDQ's systems and services, emphasizing automation, proactive system management and efficient operations.

What You'll Be Doing

  • Creating and managing service level objectives (SLOs)
  • Collaborating across on SLO-Driven OKRs
  • Installing and maintaining reliable alerting and observability dashboards
  • Ensuring new & existing features have monitoring and alerting
  • Monitoring application performance
  • Incident management and response
  • Reporting on stability and performance
  • Assisting with load testing and synthetic testing
  • Knowledge sharing and advocating for performance and reliabilty work across the engineering org

We're Looking For People Who Have

  • 5+ years experience in DevOps, SRE or related roles
  • Proven leadership in adopting and scaling SRE principles across organizations
  • Experience working with large-scale systems
  • Expertise with:
    • Cloud services, Google Cloud Platform specifically
    • PromQL or equivalent tool
    • Monitoring and observability (Bonus: Prometheus, Grafana, and GroundCover knowledge)
  • Proven ability to plan and implement strategies for scaling infrastructure to meet future demands
  • Strong advocacy for SRE principles, such as reducing toil, building reliable systems, and incorporating observability early in the development lifecycle
  • Strong sense of responsibility for system uptime and reliability
Who You Are

  • Ownership: You take responsibility for projects, drive results, and deliver on commitments
  • Honesty: You demonstrate integrity, transparency, and ethical behavior in all interactions
  • Collaboration: You work effectively with cross-functional teams and foster a culture of teamwork
  • Improvement: You continuously seek opportunities for growth, innovation, and personal development
  • Willing to take full ownership of launching and driving all SRE-related initiatives, ensuring our systems are scalable, reliable, and efficient as we grow

Tools We Use

  • Prometheus & Grafana
  • GroundCover
  • Sentry
  • Kubernetes
  • Google Cloud Pub/Sub
  • Github Actions
  • Cloudflare
  • Infrastructure as Code: Terraform
  • Slack

PDQ Perks & Benefits

PDQ offers all of the great perks and benefits you'd expect from working at a very cool tech company, and even some you might not expect, including:

  • 4-Day Work Week
  • Managers who champion professional development
  • 100% Premium Coverage for medical, dental and vision for you and your dependents
  • 100% Premium Coverage for Short Term Disability, Long Term Disability, Life, and AD&D Insurance
  • Company Match of the first 6% of your employee deferrals
  • Flexible Paid Time Off Policy that treats you like the adult that you are
  • Health Savings Account (HSA) and wellness incentives
  • Quarterly Company Values Award (team member nominated)

PDQ is proud to be an equal opportunity workplace and do not discriminate on the basis of sex, race, color, age, pregnancy, sexual orientation, gender identity or expression, religion, national origin, ancestry, citizenship, marital status, military or veteran status, genetic information, disability status, or any other characteristic protected by federal, provincial, state, or local law. If you would like to request reasonable accommodation for a medical condition or disability during any part of the application process, please contact hr@pdq.com.

The majority of PDQ's full-time roles do not qualify for sponsorship of employment visas such as the H-1B visa. This applies to scenarios where a candidate might possess temporary work authorization during their schooling or after graduation (e.g., CPT, OPT), but would require H-1B visa sponsorship within a few years of employment to retain eligibility for employment.

Required profile

Experience

Industry :
Computer Software / SaaS
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Honesty

Site Reliability Engineer (SRE) Related jobs