Match score not available

Site Reliability Engineer

extra holidays - extra parental leave
Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

At least 5 years of experience in SRE, DevOps, or engineering roles., Strong understanding of SRE and DevOps methodologies, including the build and deployment cycle., Experience with observability tools such as Grafana, Loki, and Prometheus., Proficiency in programming languages like Java, Python, or Node.js..

Key responsabilities:

  • Ensure the stability, resilience, and scalability of services through automation and infrastructure engineering.
  • Collaborate with feature teams to implement monitoring and ensure safe service changes.
  • Lead troubleshooting efforts for complex incidents and maintain observability across services.
  • Contribute to strategic goals by maximizing developer velocity and ensuring product reliability.

WorldRemit logo
WorldRemit Financial Services Large https://www.worldremit.com
1001 - 5000 Employees
See all jobs

Job description

About Zepz

Zepz Group is the group powering leading global remittance brands: WorldRemit and Sendwave. Zepz Group has been disrupting an industry previously dominated by offline legacy players by reducing the barriers to finance and increasing safety and convenience for users. Every day, Zepz Group and its brands work towards unlocking the prosperity of cross-border communities through finance and technology - driven by the vision of a world that celebrates migrants’ impact on prosperity, at home and abroad. Zepz served over 9+ million users through its presence in over 4,600 corridors with over 40 send countries and 90 receive countries

Come join us!

Zepz.io

Our Commitments:
  1. We act like owners - We are relentlessly delivering for our users and spending money thoughtfully. 
  2. We embrace embarrassing honesty - We function best when we're open and honest with one another — especially about our challenges and doubts. 
  3. We have a bias to action - We get to first outcomes quickly, iterate and learn. 
  4. We strive to be better - We may make mistakes, but always learn from them.
  5. We are inclusive - to better reflect and serve our users.
About the role

Working in the Site Reliability Engineering team, you’ll be helping ensure the stability, resilience and scale of our services through automation, observability and infrastructure engineering. The work is varied; from helping engineering teams deploy monitoring, to designing and implementing new SRE tools and techniques, our team is proactive and always involved. We are a fast moving team operating in a growing Fintech company, supporting engineers on three continents. We use a modern DevOps and SRE tech stack –Github Actions, K8s, ArgoCD, Grafana, AWS, Terraform, and Agile working practices to get the job done. As a member of Zepz’s SRE team you will aim high, embrace challenges and always do what’s right; acting with integrity and building trust as you contribute to the company’s technical direction and long term decision making.

Reporting to the SRE Manager you will:
  • Use code to solve problems. configuration, infrastructure, tooling, and automation, everything must be solved by writing high quality code that performs and scales.
  • Using best practices and standards in regards to Observability, Monitoring, Alerting, Capacity Planning, availability, performance/latency, change, troubleshooting for all our Tech services.
  • Work closely with feature teams to ensure that services are correctly monitored, change is delivered in a safe and secure way, resilience is built into our product and our standards and best practices adopted.
  • Lead or be involved in the troubleshooting of complex incidents and problems.
  • Have visibility on end to end service to our customers and ensure their journey is stable and consistent across all the microservices and 3rd party dependencies with the observability tool you will have implemented with the Engineering teams.
  • Helping the team meet its strategic goals; to maintain the highest level of observability, maximize developer velocity while keeping our product reliable, and ensure that we can deliver the highest quality experience to our customers.
  • Growing together. You’ll review others' work and happily seek feedback on yours to ensure we build a better codebase and sharpen each other's skills.
What we’re looking for from you
  • A skilled Engineer. At least 5 years in SRE, DevOps or Engineer role with a keen interest in solving problems using automation.
  • Understand SRE and DevOps methodologies. You understand the build and deployment cycle of an application, and how to operate a resilient system.
  • A focus on observability. Observability is key to operating a truly reliable and scalable system. We are looking for engineers who can "Monitor Everything & Measure Everything", driving a culture of observability. Experience with Grafana, Loki and Prometheus.
  • Holistic view on application delivery. You understand the use of many systems; monitoring, logging, alerting, and scaling. To build a robust platform which can respond to varying demands from both external sources (traffic) and internal sources (feature team delivery) in a safe and controlled manner. You have experience supporting or developing applications written in Java, Python or node.js.
  • Systematic problem-solving approach. You should have an understanding of how to analyze, and troubleshoot large-scale distributed systems.
  • Happy in the Clouds. Our Cloud Native platform is hosted on AWS. You’ll be comfortable working with a system that supports users from around the world, at scale. 
  • Bias for action. You see a problem, you fix a problem. You get buy-in for your solutions and keep tickets moving. We’re always looking for ways to ship at pace.   
  • Growth mindset. A willingness to use your skills and experience to mentor less-experienced engineers. A desire to learn from others and make yourself better every day. 
  • Agile outlook. You need to be excited about working in a fast-changing environment. Products, tools, frameworks and processes change, we evolve and take the best bits with us. The teams drive the evolution.
  • Disciplined and self managed. You need to own your role and be disciplined about adhering to protocols and processes. As a senior you will always ensure you are bringing value to the team and driving tasks to completion without being actively managed.
Bonus points if you:
  • Have experience working in a FinTech space
  • Have experience working in a distributed team across different geographies and timezones

What you’ll get from us 

Please note that the benefits below will apply to permanent roles.  

We have five core benefits for our talent in the US, UK, Philippines, Poland, and South Africa. specifically:

  • Unlimited Annual Leave: Feel free to make the most of your time off and maintain a healthy work-life balance! 
  • Private Medical Cover: ​​You can opt-in to a Private Medical Insurance scheme. This provides you with access to thorough medical coverage, so you can feel confident in your health and well-being.              
  • Retirement: We offer pension schemes to help you plan for and secure your future. 
  • Life Assurance: Life assurance is available to give you peace of mind and protect your loved ones in case of the unexpected.
  • Parental Leave: We offer competitive parental leave schemes to ensure you are spending as much quality time with your new bundle of joy as possible. 

We are also remote-first as an organisation, offering flexibility for you to work where you need to be most productive. In addition to the above, you will discover that we have a range of secondary perks (such as the cycle-to-work scheme and employee discounts) depending on your location, to help you thrive at Zepz!  

Why choose Zepz? 
  • Our team of over 1,000  employees is fully distributed across the world. We are working from coffee shops, homes, and co-working spaces — making us one of the larger fully distributed growth-stage startups in the world but we also offer workspace in our talent cluster locations - spaces we can meet, collaborate and connect.
  • We are proud parents, community organizers, farmers, band members, yoga teachers, YouTube influencers, former Olympians, and serial entrepreneurs.
  • We collectively speak over twenty languages, including Akuapem, Amharic, Bengali, Ewe, Fante, Ga, Igbo, Kalenjin, Luganda, Oromo, Somali, Swahili, Wolof, Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Hungarian, Irish, Italian, Latvian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish and Swedish.
  • At Zepz, embodying our commitments binds us together. We are collectively passionate about striving to achieve our vision and purpose -  to continue to provide the best service to our users.
Ready to Apply?

Applications will be reviewed on a rolling basis. If interested, please submit your resume along with a cover letter (optional), highlighting why your experience demonstrates you meet the requirements of the role. Please also indicate the countries in which you have work authorization.

Confidence can sometimes hold us back from applying for a job. But we'll let you in on a secret: there's no such thing as a 'perfect' candidate. Zepz is a place where everyone can thrive. 

So however you identify and whatever background you bring with you, and if at all you might need any form of support to make the process as comfortable as possible, please let us know and give us a shot by applying. We want you to be excited to wake up to make an impact every day.



Required profile

Experience

Industry :
Financial Services
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Communication
  • Teamwork
  • Growth Mindedness
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs