Logo for Moniepoint Group

Site Reliability Engineer

Roles & Responsibilities

  • Minimum 3 years of experience supporting enterprise applications as an SRE or in a similar role, with proficiency in Java, Go, or Python.
  • Strong understanding of distributed systems, microservices architecture, and software design patterns.
  • Hands-on experience with Kubernetes and managing applications on major cloud providers (GCP, AWS, or Azure).
  • Experience setting up dashboards in Grafana and using APM tools (Datadog, New Relic, Signoz) with solid knowledge of metrics, logs, and traces; proficiency in SQL (PostgreSQL/MySQL).

Requirements:

  • Participate in on-call rotations to detect and triage service and reliability issues; serve as Incident Commander during major incidents, coordinating cross-functional teams and communicating status to stakeholders.
  • Create and maintain dashboards and alerts; work with development teams to instrument code for visibility.
  • Develop automation to eliminate manual and repetitive operational tasks (toil) across applications and infrastructure.
  • Implement and track SLIs and SLOs defined by engineering leadership; investigate and resolve customer complaints related to performance and reliability.

Job description

Who We Are

Moniepoint is an all-in-one financial services platform for emerging markets and the second-fastest growing company in Africa.

Since 2019, Moniepoint’s technology has powered over 3 million people, offering personal and business banking, payment, credit and business management tools to help them succeed. Moniepoint processed $182 billion in 2023, and currently processes the majority of the POS transactions in Nigeria.

What We Do

At Moniepoint, we are a customer-focused community, dedicated to crafting solutions that redefine our industry. We have several products that provide essential services for businesses, such as credit, overdrafts, etc. We leverage artificial intelligence and data to make our decisions, but also have the technology and data-driven best practices used to support our businesses.

Curious about what makes Moniepoint an incredible place to work? Check out posts on how we cultivate a culture of innovation, teamwork, and growth.

 

Job Summary

We are seeking a Site Reliability Engineer (SRE) responsible for ensuring our systems run smoothly and efficiently while engineering solutions to improve visibility, eliminate repetitive tasks, and increase system resilience. The ideal candidate will balance real-time on-call responsibilities with strategic engineering work to achieve sustainable and scalable service reliability.

 

Responsibilities

  • Participate in on-call rotations to detect and triage service and reliability issues across all environments. Act as the Incident Commander during major incidents: initiating war room or bridge calls, coordinating cross-functional teams, providing timely and clear status updates to all stakeholders.
  • Create and maintain meaningful dashboards and alerts. Work with development teams to instrument their code to ensure visibility.
  • Develop automation to eliminate manual and repetitive operational tasks (toil) related to reliability across both applications and infrastructure.
  • Implement and track Service Level Indicators (SLIs) and Service Level Objectives (SLOs) defined by the engineering leadership.
  • Investigate and resolve customer complaints escalated beyond L1 and L2 support, especially those involving performance, reliability, or complex system behavior.

Requirements

  • Minimum of 3 years of experience supporting enterprise applications as an SRE or similar role with proficiency in writing code in Java, Go or Python
  • Good understanding of distributed systems concepts, microservices architecture and software design patterns.
  • Hands-on experience with Kubernetes. You have managed applications on a major cloud provider (GCP, AWS, or Azure), and can troubleshoot common container issues.
  • Experience setting up dashboards in Grafana and using APM tools like Datadog, New Relic, Signoz.You have a  Solid understanding of metrics, logs, and traces.
  • Proficiency in SQL (e.g., PostgreSQL, MySQL). Ability to write complex queries to debug data issues and a basic understanding of database performance.

 

What we can offer you

  • Culture - We put our people first and prioritize the well-being of every team member. We’ve built a company where all opinions carry weight and where all voices are heard. We value and respect each other and always look out for one another. Above all, we are human.
  • Learning - We have a learning and development-focused environment with an emphasis on knowledge sharing, training, and regular internal technical talks.
  • Compensation - You’ll receive an attractive salary, pension, health insurance, annual bonus, plus other benefits.

What to expect in the hiring process

  • A preliminary phone call with the recruiter
  • A technical interview with the Hiring Manager
  • A behavioural and technical interview with a member of the Executive team. 

Moniepoint is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees and candidates.

 

 

 

Site Reliability Engineer (SRE) Related jobs

Other jobs at Moniepoint Group

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.