Match score not available

Cloud Site Reliability Engineer

work from anywhere - fully flexible
Remote: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

Proficiency in Linux and command-line, Experience with Kubernetes and Helm, Understanding of monitoring tools, Scripting expertise in Bash, Knowledge of AWS and CI/CD tools.

Key responsabilities:

  • Support cloud systems and 24x7 monitoring
  • Automate processes and maintain documentation
  • Collaborate with teams for service delivery
  • Enhance monitoring tools and provide support
  • Contribute to process improvement and team knowledge
Namecheap, Inc logo
Namecheap, Inc Information Technology & Services Large https://www.namecheap.com/
1001 - 5000 Employees
See more Namecheap, Inc offers

Job description

As a Cloud Site Reliability Engineer, you’ll be at the forefront of innovation, working on our cloud products platform to ensure stability and optimal performance.


Where you’ll do it: This role is 100% remote as long as you’re in the EET or Central Europe time zone +/- 2 hours.

The Interview Process: The 4-week process will have 4 stages that include a 45-minute HR chat ➡ 45 min Values & Technical chat ➡  Home Task  ➡   A 45 min Technical interview

Technologies: Linux, Kubernetes, CI/CD, Prometheus, Helm, Bash

Reporting to: Cloud SRE Team Leader

Your team: You’ll join a team of 7 colleagues, (Cloud SRE Lead, 5 Cloud SRE Engineers, YOU!)


What will make your journey with us amazing?

A supportive manager who cares about your well-being and is invested in your professional growth.

A culture of continuous learning, with clear targets and feedback.

A global company with over 2600 employees located in more than 26 countries around the world, including offices in 3 countries: Ukraine, Portugal, and India.

What will you do?

The Cloud SRE team supports our cloud system, takes care of monitoring platforms, and provides 24x7 "Always On" support through on-call rotations. We automate manual processes, enhance monitoring tools, maintain documentation, and collaborate with other teams to ensure effective service delivery to customers.

What will you bring?

-Kind, empathetic, and collaborative personality, willing to learn and share knowledge openly.

-Proficiency in command-line interfaces, *nix systems (Linux, Ubuntu), and Git.

-Experience working with Kubernetes clusters, both Docker and CRI-O based, and familiarity with Helm charts.

-Understanding of monitoring tools such as Prometheus, Grafana, and Alertmanager.

-Demonstrated expertise in scripting (Bash)
-A proactive approach to taking ownership, supporting new ideas, and following through from ideation to post-release support.

-An autonomous and flexible working style, able to contribute independently and collaboratively, with strong research and analytical skills for informed decision-making.

-And as a bonus—we value a good sense of humor!

Will be a plus:

-Knowledge of AWS

-Experience with CI/CD tools like ArgoCD, and FluxCD

-Experience with Ansible, Terraform, and Newrelic

-Knowledge of programming languages like Python, Go, and PHP

What’s in it for you?


Embrace a 100% remote lifestyle with this opportunity!

Work with flexibility in a supportive environment where you have the autonomy to manage your time, while also staying connected with the team through daily check-ins and shared office hours. We value collaboration and commitment to team goals, balancing independence with structured support to ensure we all succeed together.


- Invest in your growth with dedicated learning resources and support.

- Thrive in a culture rooted in truth, trust, and transparency.

- Unleash your creativity and explore new ideas with 2 dedicated R&D days each month!

- Stay ahead of the curve with weekly team knowledge-sharing sessions.

- Escape the meeting marathon with 3 meeting-free days per week.

- Enjoy generous vacation policies to recharge when you need it.

- Be a part of a unique team, not just another "cloud-shop" - we run our own infrastructure!


#NamecheapCareers

#HackYourCareer

#equalopportunity

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Industry :
Information Technology & Services
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Analytical Skills
  • Decision Making
  • Collaboration

Site Reliability Engineer (SRE) Related jobs