Match score not available

Staff Reliability Engineer

Remote: 
Full Remote
Contract: 
Salary: 
10 - 1800K yearly
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

3-5 years experience in SRE/DBRE, Proficient with MySQL and Cloud AWS, Strong skills in Kafka cluster management, Experience with infrastructure as code tools, Python programming knowledge.

Key responsabilities:

  • Lead improvement projects for datastores.
  • Maintain infrastructure uptime and scaling.
  • Ensure compliance with security standards.
  • Manage day-to-day support tasks and issues.
  • Provide coaching to new hires.
Udemy logo
Udemy E-learning Large http://www.udemy.com/
1001 - 5000 Employees
See more Udemy offers

Job description

About us

At Udemy, we’re on a mission to transform lives through learning. Through our intelligent skills platform and a global community of instructors, we’ve helped over 70 million learners and 16,000 organizations achieve their goals. Come join us in ensuring everyone, everywhere has access to the skills they need to unlock their potential and create possibilities for themselves and others.

 
About you

You’re an analytical problem-solver ready to put your skills toward purposeful work that has a global impact. You want to lead the way in innovation, exploring the latest technologies and finding new solutions. You thrive in a collaborative environment and are eager to work with and learn alongside the best in Product, Design, and Engineering.

 

About this role 

As part of Udemy's Platform team, the Datastore Infrastructure (DSI) team is responsible for overseeing all aspects of Databases (MySQL, Aurora, DynamoDB), Message Queues (RabbitMQ), Streaming (Kafka), and Caching (Redis, Memcache) in our infrastructure. This includes ensuring uptime, security and compliance, observability, performance,  improving developers' productivity and developing future growth strategies. The team is split between EU and US regions. You will play a vital role in overseeing day-to-day activities and engineering strategies of DSI, ensuring that millions of students worldwide achieve greater learning and career outcomes on Udemy. We value teamwork, a good sense of humor, strong ownership, technological curiosity, and a desire to learn.

To be successful in this role, you will collaborate closely with engineering, product, and a diverse set of stakeholders around the world. You are not just interested in maintaining systems but also writing the software that maintains them. You strongly believe in a no-blame culture and advocate for humane on-call practices. You constantly seek opportunities for improvement and thrive in an environment where you can drive positive change.

What you'll be doing
  • Lead improvement projects for our datastores and platform teams to align with the company’s long term objectives.
  • Maintain Infrastructure Uptime, monitor performance, and ensure infrastructure continues scaling as we grow.
  • Ensure adherence to PCI and ISO27001 compliance as well as SOC 2 security requirements, modifying CI/CD processes when necessary, and upholding policies and standards.
  • Advocate for and implement positive changes in tools and processes through healthy discussions.
  • Participate in the on-call rotation, demonstrating a systematic approach to incident management.
  • Participate in day-to-day activities, support requests, and project-related tasks for the team.
  • Contribute to documentation, maintain ticketing queues, provide project support, troubleshoot, and offer after-hours assistance as required
  • Provide coaching and mentorship to new hires, fostering their technical growth and integration into the team. Maintain close communication with team members throughout their tenure.
What you’ll have
  • 3-5 years of professional experience working in an SRE/DBRE team with Infrastructure responsibilities in managing large production workloads.
  • Proficiency with managing MySQL at scale (Horizontal Scaling, sharding, InnoDB optimizations, Query Optimization, HA/DR, Monitoring, Backups Strategy, Security, Automations).
  • Strong understanding in running Production Workloads in Kubernetes
  • Proficiency with tools like Terraform, Ansible, Git and how to work with Infrastructure as Code, and automated provisioning.
  • Strong experience in Kafka cluster management, topic configuration, performance tuning, and ensuring high availability and fault tolerance. Experience with MSK is also good.
  • Experience with  Message Queues (MQ/SQS) and Caching (Redis, Memcache) or similar products
  • Experience in Python.
  • Knowledge of configuration management tools, monitoring systems (Datadog or similar) for database infrastructure, and scaling strategies for handling increased data volumes.
  • Strong troubleshooting skills to diagnose complex database issues.
  • Hands-on experience with AWS cloud infrastructure and a grasp of security best practices.
  • Adaptability and comfort working in a fast-paced, hands-on environment.

Nice to have :  

  • Experience with any additional Programming Languages (Golang, Kotlin, Java)
  • Experience in implementing CDC pipelines for reliable data replication and synchronization
  • Experience with Vitess Operator running MySQL on Kubernetes.
  • Experience with Writing Kubernetes Helm Charts.
  • Experience with tools like ArgoCD/Argo Workflows, or similar alternatives in various combinations.
  • Knowledge of security standards, vulnerability patching, TLS/SSL and related..
  • Any additional experience or familiarity with related technologies would be advantageous.

We understand that not everyone will match each of the above qualifications. However, we also realize that everyone has unique experiences that can add value to our company. Even if you think your background might not perfectly align, we'd love to hear from you!

Life at Udemy

We aspire to be as vibrant and dynamic as the communities we serve, as inquisitive as those who use our platform, and as revolutionary as the future we strive to open for everyone. Here are some of the things we love about life at Udemy:

  •  We’re invested in creating an inclusive environment that welcomes a diverse range of backgrounds and experiences. From creating employee resource groups, ensuring we’re a Fair Pay Workplace, and building a flexible work culture, our belonging, equity, diversity, and inclusion (BEDI) initiatives always put our people first. We want you to be able to bring your authentic self to work because when we all do, we’re better for it.

  • Learning is what we do – inside and out. Our Learning & Development team is second to none, helping ensure your journey is one of continuous progression. You’ll also have unlimited access to Udemy courses, monthly UDays (meeting-free professional development days), and a generous annual professional development stipend.

  • Our reason to exist is to revolutionize learning – that calls for taking risks and learning from failures. Whether it’s our hackathons (a company-wide effort to envision new possibilities for our product) or sharing our prototypes, we see experimentation as a crucial step on the path to success.

  • We’re committed to creating world-class employee experiences and are proud of the recognition of this by Great Place to Work. 

Of course, the best thing about being part of Udemy is knowing your work makes a difference for people and organizations around the world. You’ve got the skills; why not use them to help others develop theirs?

At Udemy, we value diversity and inclusion and consider qualified applicants without regard to race, color, religion, sex, national origin, ancestry, age, genetic information, sexual orientation, gender identity, marital or family status, veteran status, medical condition, or disability. We will consider for employment qualified applicants with arrest and conviction records.

Udemy Mexico Benefits

Our benefits are designed to support you and your family with the protection and care you need, ensuring easy access to the right coverage when it matters most. Here’s a snapshot of the key benefits for full-time Udemates based in Mexico:

Core Benefits:

  • Medical, Dental, and Vision Coverage: Includes maternity, private hospital, dental, and vision benefits for employees and eligible dependents.
  • Life Insurance: 12x monthly base salary, with a Free Cover Limit of MXN 7.5 million.

Financial Benefits:

  • Grocery Vouchers: Monthly vouchers valued at $2,200 MXN.
  • Teleworking Allowance: $800 MXN monthly.
  • Savings Fund: Both you and Udemy contribute 13% of your monthly salary, capped to 1.3 times the UMA value.
  • Christmas Bonus: Equivalent to 30 days of your current salary.
  • Vacation Bonus: 25% premium on your vacation days.

Time Off Benefits:

  • Holidays: Udemy observes 12 public holidays in Mexico.
  • Annual Leave: 12 days of annual leave per year.
  • Sick Leave: 10 days of paid sick leave per year.
  • Family Bonding: 12 weeks of maternity leave or partner leave for bonding after the birth or adoption of a child.
  • Bereavement Leave: Up to 20 working days for the loss of a loved one.

Wellbeing Benefits:

  • Maven (Reproductive Health): Unlimited virtual appointments and $25,000 lifetime benefit for fertility, adoption, and surrogacy.
  • Modern Health (Therapy and Coaching): 10 sessions with therapists and 10 sessions with coaches per year.
  • Origin (Financial Planning): Free access to Certified Financial Planners for you and your spouse.
  • Workplace Options (EAP): 24/7 support for family, stress, caregiving, financial, and legal issues.

Additional Perks:

  • Udemy Courses: Free and unlimited access to Udemy’s Marketplace and Udemy Business.

Home Office Reimbursement: $500 one-time reimbursement and $35 monthly for work-from-home needs.

Information regarding data privacy is available within the Udemy Careers Privacy Notice.

At Udemy, we strive to be transparent around compensation. Actual compensation for this role is based on several factors, including but not limited to job-related skills, qualifications, experience, and specific work location due to differences in the cost of labor. In addition to a base salary, this role is also eligible for benefits and equity.
Hiring Compensation Range
$1,440,000$1,800,000 MXN

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Industry :
E-learning
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Communication
  • Troubleshooting (Problem Solving)
  • Adaptability
  • Analytical Thinking

Site Reliability Engineer (SRE) Related jobs