Senior Site Reliability Engineer (100% remote-friendly within Spain)

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Experience with monitoring tools like DataDog, OTEL, or Prometheus., Strong investigative skills for troubleshooting complex issues., Familiarity with .NET and AWS services., Practical experience with Kubernetes and understanding of microservices architecture..

Key responsibilities:

  • Ensure system reliability and availability through monitoring and incident response.
  • Investigate incidents and implement long-term fixes based on root cause analysis.
  • Define and maintain service level objectives to drive service quality.
  • Collaborate with developers to enhance system scalability and automate operational tasks.

Doctoralia logo
Doctoralia XLarge https://www.doctoralia.es/
201 - 500 Employees
See all jobs

Job description

Company Description

Welcome to the good side of tech đź‘‹

You might have heard about us, but with a different name: Znanylekarz. It all started 10 years ago when we asked ourselves: is anyone in healthcare thinking about patients? We jumped in and we empowered patients by giving them access to leave and read reviews about their visit. We then provided doctors with the technology to manage bookings easily and save time, so they could devote themselves to what they always wanted: treating patients. And today is the day in which we ask you: wanna join us in the next step of making the healthcare experience more human?

 Docplanner at scale

We are leaders in 13 countries so far, and more than 90 million patients trust us every month. 280.000 specialists believe in us and our product, and so do leading venture capital funds such as Point Nine Capital, Goldman Sachs Asset Management, and One Peak Partners. And yet, employing over 2.900 people all over the globe, we managed to keep the startup mindset we started with over 10 years ago.

How does Docplanner Tech fit here?

At Docplanner Tech we are a diverse group of ~325 people working in teams that include engineers, designers, product managers, data and research. We are responsible for building the product for all locations. Taking care about talent matters to us, that's why the engineering team has an amazing balance of new team members and people that have been in DP for years.

We could tell you about us, but we will let our reviews on Glassdoor speak for themselves. In case you’d like to see how it feels to be 100% yourself at work, here’s a video of us

And why should you join us?

Because it feels good to tell your family and your friends how you made the world a little bit better. You go to bed knowing that what you do matters, and that your talents align with your beliefs.

We want to make the healthcare experience more human, and that starts with you being you. We believe that taking the diversity of human experience into account makes a better healthcare experience for all. We’re not just different: we embrace diversity. We will encourage you to come to work your whole self, and that includes not coming to the office at all if you prefer not to, as we're 100% remote-friendly.

Job Description

At Docplanner, we love building software that makes a real difference. Our site reliability engineers (SREs) play a key role in making sure our users get powerful features, fast performance, and rock-solid reliability — so they can focus on what matters most to them. As more and more customers rely on our platform, we’re looking for an experienced SRE to help us build great foundations. We’re after someone who brings fresh ideas, a unique perspective, and things like an owner — just like us — to build practical solutions and great user experiences every step of the way.

Objectives of this role

  • Operate production environments by monitoring availability and taking a holistic view of system health.

  • Measure and optimize system performance to stay ahead of customer needs and drive continuous innovation.

  • Improve reliability, quality, and time-to-market of our suite of software solutions.

  • Provide primary operational support and engineering expertise for multiple large-scale, distributed software applications.

Responsibilities

  • Ensure reliability and availability of systems through monitoring, alerting, and incident response.

  • Investigate and resolve incidents, perform root cause analysis, and implement long-term fixes.

  • Define and maintain SLOs/SLIs to measure and drive service quality.

  • Continuously improve performance and optimize infrastructure cost and resource usage.

  • Collaborate with developers to build scalable, fault-tolerant systems and improve deployment practices.

  • Automate operational tasks to reduce manual toil and improve efficiency.

Qualifications

What will help you thrive?

  • Monitoring and observability - Experience with monitoring stack like DataDog / OTEL / Prometheus.

  • Detective mindset - Strong investigative mindset with a detective-like approach to troubleshooting and resolving complex issues.

  • .NET experience - Familiar with .NET environment and ability to code.

  • AWS experience - Experience working with AWS services and cloud-native architectures.

  • Kubernetes - Practical experience deploying, managing, and troubleshooting applications in Kubernetes; understanding of containers, Helm, and scaling strategies.

  • Think like an owner - Proactive approach to identifying problems, performance bottlenecks, and areas for improvement.

  • Communicator – Equally fluent when talking to humans or machines; clear, effective communication across teams and tools.

Nice to have

  • Proficiency in scripting or programming with languages such as Python or Go – to support automation and tooling development.

  • Hands-on experience in Site Reliability Engineering practices – including incident management and service-level objectives.

  • Understanding of microservices architecture – with experience in designing, observing, and troubleshooting distributed systems.

Additional Information

Let’s talk money

  • salary adequate to your experience and skills. The range is broad so that we can accommodate our roles for all levels of experience, but we will show you the career ladder to explain where we see your skills and impact within the company". Your salary will be, now and always, 100% transparent to you;
  • Flexible remuneration and benefits system via Flexohwhich includes: restaurant card, transportation card, kindergarten, and training tax savings;
  • Share options plan after 6 months of working with us.

True flexibility and work-life balance

  • Remote or hybrid work model with our hub in Barcelona;
  • Flexible working hours (fully flexible, as in most cases you only have to be on a couple of meetings weekly);
  • Summer intensive schedule during July and August (work 7 hours, finish earlier);
  • 23 paid holidays, with exchangeable local bank holidays;
  • Additional paid holiday on your birthday or work anniversary (you choose what you want to celebrate).

Health comes first 

  • Private healthcare plan with Adeslas for you and subsidized for your family (medical and dental);
  • Access to hundreds of gyms for a symbolic fee in partnership for you and your family with Wellhub;
  • Access to iFeel, a technological platform for mental wellness offering online psychological support and counseling. 

 

We promote and embrace equal opportunities in our hiring process, and also every day at work. When you apply for our roles you receive equal treatment regardless of age, disabilities, gender reassignment, marital or civil partner status, pregnancy or parental status, race, colour, nationality, ethnic or national origin, religion or belief, sex, sexual orientation or any other dimension of human difference.  If you require additional support in your recruitment process, we kindly encourage you to let us know. Behind those words you’re reading, there’s a person (hi!) who already helped a candidate by adapting the interviews, and now we’re lucky to have this person with us. So, even if you’ve never asked for it before, may this serve as a sign that, now, you can do so. We can only truly be equal if we adapt to each other.

“We believe all humans, in all their beautiful diversity, should have equal rights, dignity and respect. Period.” Mariusz Gralewski,  CEO

 

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Communication
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs