Match score not available

Senior Site Reliability Engineer

extra holidays - extra parental leave - fully flexible
Remote: 
Full Remote
Contract: 
Salary: 
115 - 115K yearly
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

5+ years in DevOps or SRE, Deep knowledge of cloud environments (AWS or GCP), Expertise in containerized environments (Docker and Kubernetes), Proficiency in scripting languages like Python or Bash, Experience with CI/CD pipelines.

Key responsabilities:

  • Design scalable infrastructure for cloud services
  • Monitor and optimize system performance
  • Automate deployment and management processes
  • Ensure high availability and disaster recovery solutions
  • Collaborate across teams for application reliability
Tekmetric logo
Tekmetric Scaleup https://linktr.ee/
51 - 200 Employees
See more Tekmetric offers

Job description

About Tekmetric

Tekmetric is a cloud-based auto-repair shop management system with an easy-to-use workflow and a modern approach to customer care. Tekmetric champions transparency, integrity, innovation—and above all—a service-mentality that puts customers first.

Founded in Houston, Texas in 2015, Tekmetric has been providing reliable, fast customer service to shops from day one. Our team is growing quickly, but our service philosophy of listening to customers still remains a core value.

We’re looking for hungry candidates to help us strengthen our industry-leading team. We’re building our software to be more intuitive. We’re building more integrations that make our customers’ lives easier. We’re building better internal processes to make our globally distributed organization run more smoothly. We’re building stronger relationships with the best and brightest partners in our industry.

Our customers love our products. We love serving them. And we love the journey we’re on together.

Come build with us!

What You’ll Do

  • Design and implement scalable infrastructure: Architect and maintain reliable, scalable, and secure cloud infrastructure that supports positive user experiences and measurable business growth.
  • Monitor and optimize system performance: Develop and maintain monitoring, alerting, and incident response practices to ensure system reliability and performance at scale.
  • Automate everything: Create automated pipelines for deployment, testing, and infrastructure management to improve speed, consistency, and reliability across the organization.
  • Ensure high availability and disaster recovery: Implement and manage solutions for backup, disaster recovery, and failover processes to ensure business continuity.
  • Security and compliance: Apply best practices in security, monitoring, and compliance, ensuring that systems meet necessary requirements and regulations.
  • Collaboration: Work cross-functionally with development, data, product, and QA teams to improve application reliability and scalability.
  • Leadership and mentorship: Provide technical leadership, mentorship, and guidance to junior DevOps team members, fostering a culture of continuous learning and improvement.

What You’ll Bring

  • Experience: 5+ years of experience in DevOps, Site Reliability Engineering (SRE), or a related field, with deep knowledge of cloud environments (preferably AWS or GCP.).
  • Cloud Infrastructure: Hands-on experience with AWS (or similar cloud providers) and infrastructure as code (Terraform, etc.).
  • Automation: Strong experience in automation tools
  • Containerization: Expertise in working with containerized environments like Docker and orchestration tools such as Kubernetes.
  • Monitoring and Logging: Experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK stack).
  • Scripting: Proficiency in scripting languages like Python, Bash, or similar.
  • CI/CD pipelines: Experience with designing and optimizing Continuous Integration and Continuous Deployment (CI/CD) pipelines.
  • Collaboration and Communication: Strong communication skills and ability to work cross-functionally, solving complex technical challenges in a collaborative manner.
  • Problem-solving mindset: Ability to troubleshoot and resolve critical issues in high-pressure environments, maintaining composure and professionalism.
Bonus Points:
  • Experience with Infrastructure as Code tools like Terraform.
  • Familiarity with monitoring tools like Prometheus, Grafana, or the ELK stack.
  • Exposure to compliance and security best practices in cloud environments.
  • Experience coding in one or multiple programming languages such as Go, Java, Javascript.

Who You Are

Successful candidates will also demonstrate many of the characteristics that our core values represent:

  • Build things that matter
    • You have a love of building something new or improving on current processes and care about making a positive difference.
  • We’re all entrepreneurs
    • You love learning new things and are comfortable working in a startup-like, dynamic environment -- moving quickly, even in the face of ambiguity. You are a self-directed leader who can jump in, structure their own work, and figure out how best to execute a plan yourself and with others. At Tekmetric our leaders are all players and coaches.
  • Yes before no
    • You keep an open mind and are excited about new ideas and helping others actualize their ideas. You are intellectually curious and analytical in a strategic context.
  • We matter to each other
    • You care about people and see the success of one is success for us all. You are a highly ethical individual with unquestioned integrity and the experience, confidence, and stature to protect confidential information in a growing company.

 

What We Offer:

Healthcare Insurance and Leave:

  • Flexible and remote work opportunities
  • Generous PTO
  • Exceptional leave programs for all of life’s moments: maternity, paternity and parental bonding, as well as medical leave to care for yourself or loved ones
  • Excellent Medical, Dental, Vision and Prescription Drug Coverage

Financial Benefits:

  • 401(k) Retirement Savings Plan with a 6% Match
  • Employer covered STD, LTD, Life and AD&D Insurance Programs
  • Up to $60 monthly for wellness expenses and activities
  • Education Assistance- includes undergraduate/graduate courses and continuing education

Most importantly, we have a stellar team of coworkers, a really cool office, and lots of fun activities!

 

Tekmetric is an equal opportunity employer. We hire hard working individuals, regardless of gender, race or color, ethnicity or national origin, age, disability, religion, sexual orientation, gender identity or expression, or veteran status. We know that when our employees feel appreciated and included, they can be more creative, innovative and successful.

We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Communication
  • Leadership
  • Mentorship
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs