Intermediate Site Reliability Engineer, Database Operations

unlimited holidays - work from home - work from anywhere
Work set-up: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

Experience managing PostgreSQL in large-scale production environments., Strong understanding of SQL and PL/pgSQL., Hands-on experience with infrastructure automation tools like Ansible, Terraform, or Chef., Proactive attitude with excellent communication skills in English..

Key responsibilities:

  • Automate operational tasks such as updates and configuration changes.
  • Respond to platform emergencies and support escalations.
  • Develop and maintain automated observability and capacity planning systems.
  • Collaborate with engineering teams to optimize database performance and reliability.

GitLab logo
GitLab Information Technology & Services Large https://about.gitlab.com/
1001 - 5000 Employees
See all jobs

Job description

GitLab is an opencore software company that develops the most comprehensive AIpowered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and cocreate the software that powers our world. When everyone can contribute, consumers become contributors, significantly accelerating human progress. Our platform unites teams and organizations, breaking down barriers and redefining whats possible in software development. Thanks to products like Duo Enterprise and Duo Agent Platform, customers get AI benefits at every stage of the SDLC.

The same principles built into our products are reflected in how our team works: we embrace AI as a core productivity multiplier, with all team members expected to incorporate AI into their daily workflows to drive efficiency, innovation, and impact. GitLab is where careers accelerate, innovation flourishes, and every voice is valued. Our highperformance culture is driven by our values and continuous knowledge exchange, enabling our team members to reach their full potential while collaborating with industry leaders to solve complex problems. Cocreate the future with us as we build technology that transforms how the world develops software.

Int. Site Reliability Engineer: Database Operations


An overview of this role

Site Reliability Engineers (SREs) are responsible for keeping all userfacing services and other GitLab production systems running smoothly. SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to our environments and the GitLab codebase. We specialize in systems, whether it be networking, the Linux kernel, or some more specific interest in scaling, algorithms, or distributed systems.

The Database Operations team’s mission is to build, run, own and evolve the entire lifecycle of the PostgreSQL database engine for GitLab.com. The team is focused on owning the reliability, scalability, evolution, performance & security of the database engine and its supporting services. The team should be seeking to build their services on top of Reliability::Foundations services and cloud vendor managed products, where appropriate, to reduce complexity, improve efficiency and deliver new capabilities quicker.

GitLab.com is a unique site and it brings unique challenges–it’s the biggest GitLab instance in existence. In fact, it’s one of the largest singletenancy opensource SaaS sites on the internet. The experience of our team feeds back into other engineering groups within the company, as well as to GitLab customers running selfmanaged installations

Responsibilities

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Industry :
Information Technology & Services
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Mentorship
  • Collaboration
  • Communication
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs