Key Facts

Remote From:

United Kingdom, France, Colorado (USA), United States

Full time

English, French

Hard Skills

Other Skills

•
Collaboration
•
Communication
•
Teamwork
•
Problem Solving

Roles & Responsibilities

3+ years of DevOps/SRE experience (5+ years preferred)
Strong knowledge of AWS (ECS/Fargate), Docker, Terraform, and PostgreSQL at scale
Datadog expertise and experience improving observability and monitoring
Experience with CI/CD and production incident management, on-call rotations

Requirements:

Design, implement, and maintain highly available, scalable, and secure cloud infrastructure for the Sweep Data platform and AI workloads using Infrastructure as Code; enhance observability with Datadog and ML training/deployment/monitoring
Support critical infrastructure scaling projects and contribute to high-traffic system design; establish runbooks, workflows, and documentation
Collaborate with engineers and AI/ML teams to optimize and scale pipelines; participate in the SRE guild and share best practices
Manage day-to-day operations including on-call duties, capacity planning, proactive health monitoring, security/data protection measures, and compliance with SOC 2 Type 2 and ISO 27001

SWEEP

About SWEEP

Sweep is the sustainability data management platform. Easily track your carbon emissions and ESG performance in one place. Its market-leading, AI-powered software helps organizations understand all extra-financial data across their business and value chain to manage increasing disclosure requirements and take action to meet sustainable business goals. Co-founded by Rachel Delacour, Yannick Chaze and Raphael Guller, Sweep partners with enterprise, midmarket and financial institutions across the world, with customers including L’Oreal, Lacoste, and Hewlett Packard. Sweep is B Corp certified and a member of the World Bank’s Carbon Pricing Leadership Coalition, France Invest and The International Emissions Trading Association.

Founded: 2018

Company size: 51 - 200

Website LinkedIn See all jobs →

Job description

Sweep is hiring a Site Reliability Engineer (SRE), to join our SRE & infrastructure team and help us ensure the reliability, scalability, and performance of our systems.

This role is ideal for someone with solid DevOps background, expertise in AWS and high-traffic systems, and a commitment to fostering a collaborative, inclusive environment within our SRE guild and broader engineering culture.

Climate change is the defining issue of our time. By empowering companies with technology that helps them manage their climate impact, we believe Sweep can make a meaningful contribution to a better future for all of us.

Ok, sounds promising. What will I be doing?

As a key player in our Engineering team, you will collaborate with engineering teams to design and implement cutting-edge, automated infrastructure to support both our core platform and AI-driven solutions.

1. Contribute to the team's ownership of technical infrastructure 🛠️

Design, implement, and maintain highly available, scalable, and secure cloud infrastructure for the Sweep Data platform and AI workloads using Infrastructure as Code practices.
Improve and expand our observability strategy, working with Datadog to enhance metrics, dashboards, and alerting across our Rails application and AI workloads.
Develop scalable infrastructure to support the machine learning model training, deployment, and monitoring.
Participate in incident response and post-mortem reviews as part of the Oak team

2. Support critical scaling initiatives 📈

Support critical infrastructure scaling projects.
Contribute to high-traffic systems design.
Help establish team processes including runbooks, workflows, and documentation.

3. Facilitate collaboration 🤝

Work closely with engineers who have elevated infrastructure privileges within our DevOps culture.
Collaborate within the SRE guild and contribute to best practices across the engineering team.
Collaborate with AI/ML teams to optimize and scale AI/ML pipelines and workloads.

4. Manage day-to-day operations 🔧

Manage day-to-day operations including on-call duties, capacity planning, and proactive system health monitoring.
Implement security measures and data protection protocols.
Support enterprise customer security requirements including BYOK implementation and data sovereignty compliance.
Maintain and contribute to Sweep strong level of compliance including SOC 2 Type 2, ISO 27001 and more.

5. Continuously improve and learn 🚀

Use a proactive approach to problem-solving and a commitment to building fault-tolerant systems.
Stay up-to-date with the latest industry trends and technologies to ensure we're always building on solid foundations.

That sounds just right for me. What do I need to bring?

Glad you asked. This is who we’re looking for:

Qualifications 🏆

Engineering degree in computer science or 3+ years of DevOps/SRE experience, with strong candidates at 5+ years preferred
Good knowledge of AWS (including ECS/Fargate), Docker, Terraform, PostgreSQL at scale (experience with sharding, clustering, or high-volume scenarios preferred)
Datadog expertise strongly preferred
Experience with continuous integration and continuous deployment
Experience with high-traffic, multi-tenant systems and database scaling strategies
Knowledge and experience in data modeling, database design, and data management
Strong operational mindset with experience in day-to-day production operations
Experience with on-call rotations and production incident management
Experience improving observability and monitoring systems
Understanding of clean code and clean infrastructure practices
You speak English fluently, French is a plus

Technical bonuses 💡

Ruby on Rails experience is a plus
Snowflake experience is a plus
Change Data Capture and data pipeline experience is valuable
Familiarity with high-traffic systems
ARC (Actions Runner Controller) and Kubernetes is a plus

Qualities 🧠

Autonomous and self-structured
Willing to imagine and implement processes to ease developers' lives
Passionate about solving problems and developing solutions
A team player who values collaboration and feedback

Copy that. And what’s in it for me?

By joining Sweep, you'll be part of an exciting startup with a vision to change the world. We're ready to hit the ground running, and joining us at this early stage allows you the unique opportunity to help shape our journey.

Our flexible work model allows you to balance personal and professional commitments while staying connected with your global colleagues. Even though our hubs are in France, the UK and the US, we're committed to fostering a connected and engaged remote work culture.

As a B Corporation, we're dedicated to creating successful businesses that benefit everyone, including society and the planet.

Ready for the most exciting chapter of your career? Come join us on this extraordinary ride!

Ready to apply?

APPLY

Share ·