Site Reliability Engineer

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Strong understanding of Linux internals and core network principles., Familiarity with relational and NoSQL databases like PostgreSQL and MongoDB., Proficient in container orchestration tools such as Kubernetes and Docker., Experience with configuration management tools like Puppet and Ansible..

Key responsabilities:

  • Manage distributed infrastructure using open-source technologies across multiple datacenters.
  • Ensure product SLAs and perform capacity planning in a 24/7 on-call rotation.
  • Implement innovative platforms to enhance the efficiency of SRE teams.
  • Utilize data and metrics for decision-making with a focus on security and automation.

Newfold Digital logo
Newfold Digital Large https://newfold.com
1001 - 5000 Employees
See all jobs

Job description

Who we are.

Newfold Digital is a leading web technology company serving millions of customers globally. Our customers know us through our robust portfolio of brands. We have some of the industry's most prominent and storied go-to-market brands, including Bluehost, HostGator, Domain.com, Network Solutions, Register.com and Web.com. We help customers of all sizes build a digital presence that delivers results. With our extensive product offerings and personalized support, we take pride in collaborating with our customers to serve their online presence needs. The strength of our company livesin the intersection of our people, our customers, and our brands.

What you'll do & how you'll make your mark.

  • Manage distributed infrastructure with open-source technologies across multiple datacenters.
  • Ensure product SLAs, perform capacity planning, and address critical issues in a 24/7 on-call rotation.
  • Explore and implement innovative platforms As a service solution to support and enhance the efficiency of technical SRE teams.
  • Utilize data and metrics for decision-making, focusing on security and best practices.
  • Prioritize robust automation and scripting to reduce dependence on manual procedures.

Who you are & what you'll need to succeed.

  • Strong understanding of Linux internals, OS fundamentals, and core network principles.
  • Basic familiarity with relational databases (PostgreSQL, MySQL) and NoSQL databases (Redis, MongoDB).
  • Proficient in container orchestration tools like OpenShift, Kubernetes, Docker Swarm, or Apache Mesos.
  • Experienced in administering and troubleshooting configuration management tools such as Puppet, Ansible Tower (AWX), or Chef.
  • Hands-on experience in load balancer administration (HAProxy, Nginx, and F5).
  • Hands-on experience with caching technologies such as Redis, Nginx+, Varnish, or Memcached.
  • Skilled in monitoring and logging stacks such as Grafana, InfluxDB, Graphite, Prometheus, ELK, and Graylog.
  • Hands-on experience with web servers like Nginx, Apache, or Tomcat.
  • Skilled in at least one scripting language such as Python, Golang or Perl.

Not Applicable

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Decision Making
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs