Logo for Unit Group

Site Reliability Engineer

Roles & Responsibilities

  • 4+ years of relevant work experience
  • Strong experience with Kubernetes, Docker, Git, Linux, Bash, Terraform
  • Proficiency with AWS or other cloud-hosted services
  • Software development skills and proficient in at least one language

Requirements:

  • Design, implement, and maintain our infrastructure using best practices
  • Create and support CI/CD pipelines
  • Deploy enterprise-scale projects on AWS and Hetzner
  • Automate key processes, including build, release, and monitoring (alerting and observability), in the development and deployment of both infrastructure and products

Job description

This is a remote position.

As an SRE you’ll join our team in building the infrastructure needed to support the rest of our engineering department. You’ll help to create a stable foundation for our engineers to build off of and tools that are highly available, cost-efficient and extensible. As we continue to scale and embrace the DevOps culture, this team will be looked to for guidance and mentorship.

We’re looking for someone who is excited to learn, a great team player, and strives for doing the right things the first time around - knowing that it may take longer but understands that there’s a balance to be achieved and the importance of quality.

What You’ll Do
  • Design, implement, and maintain our infrastructure using best practices
  • Create and support CI/CD pipelines
  • Deploy enterprise-scale projects on AWS and Hetzner
  • Work with the latest technology like Kubernetes
  • Automate key processes, including build, release, and monitoring (alerting and observability), in the development and deployment of both infrastructure and products
  • Design and execute technical solutions that improve speed and quality
  • Monitor system performance and troubleshoot issues
  • Participate in the on-call rotation to support our applications
  • Collaborate with team members and other staff to further develop a DevOps culture
  • Ensure security and compliance requirements are met


Requirements

  • 4+ years of relevant work experience
  • Strong experience with Kubernetes, Docker, Git, Linux, Bash, Terraform is a must
  • Experience with Helm, GitHub Actions, Ansible will be a plus
  • Confidence working with monitoring and alerting systems such as Prometheus
  • Experience with Datadog will be a plus
  • Experience in the testing and deployment of complex software solutions in a fast-paced, cloud environment
  • Experience with supporting Java/Kotlin apps will be a plus
  • Proficiency with AWS or other cloud-hosted services
  • Solid understanding of software design patterns
  • Software development skills and proficient in at least one language
  • Demonstrate ability to learn quickly and pick up new technologies
  • Highly analytical with a passion for finding solutions to tough problems
  • Responsible, proactive and honest
  • Excellent communication and collaboration skills


Site Reliability Engineer (SRE) Related jobs

Other jobs at Unit Group

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.