Match score not available

Director Engineering, Cloud Reliability

extra holidays - extra parental leave - work from home - fully flexible
Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

15+ years of experience in software development and 8+ years in management., Bachelor’s or Master's degree in Computer Science, Engineering, or equivalent., Experience with architecting distributed cloud systems at massive scale with a focus on resiliency and reliability., Background in Business Continuity Planning (BCP), Disaster Recovery (DR), and managing FedRAMP cloud operations..

Key responsabilities:

  • Own the overall vision and execution for the Cloud Reliability team.
  • Define strategy and roadmap for production change control, incident response, and operational standards.
  • Drive initiatives related to reliability risk management and customer trust.
  • Engage with customers and account teams to discuss reliability objectives and practices.

Confluent logo
Confluent Computer Software / SaaS Large http://confluent.io/
1001 - 5000 Employees
See all jobs

Job description

Position at Confluent Inc

With Confluent, organizations can harness the full power of continuously flowing data to innovate and win in the modern digital world. We have a purpose that drives us to do better every day – we're creating an entirely new category within data infrastructure - data streaming. This technology will allow every organization to create experiences and use the power of data in ways that profoundly impact the way we all live. This impact is our purpose and drives us to do better every day.

One Confluent. One team. One Data Streaming Platform.

Data Connects Us.

About the Role:

The CAR (Cloud Architecture and Reliability) team at Confluent is dedicated to ensuring the dependable functionality and management of Confluent Cloud. The team owns a combination of centralized initiatives as well as broad, strategic, cross-engineering objectives and KPIs related to securing customer trust through delivering a reliable and resilient product offering. As Confluent grows and requires increasingly complex architecture, the CAR team is at the center of the conversation, bringing together scalability, efficiency, and resiliency concerns across all of our cloud pillars to ensure the best customer experience on our streaming data platform. You will be responsible for further defining the vision of the Cloud Reliability charter, growing and coaching the team, and planning and driving cross-team efforts to solve complex distributed system challenges at scale. 

What You Will Do:

  • Own the overall vision and execution for the Cloud Reliability team
  • Define the strategy and roadmap for critical initiatives in production change control, release management and safety, incident response and supportability, and operational standards.
  • Own and manage the operations of our FedRAMP cloud offering
  • Drive initiatives related to BCP/DR and reliability risk management
  • Work with account teams and customers to directly discuss reliability objectives and practices to build customer trust
  • Deliver high impact to the business by driving important technical initiatives in areas comprising security, reliability, multi-tenancy, architectural direction, and major component refactor across organizational boundaries
  • Influence the overall domain health and operational hygiene for Confluent Cloud, including reductions in critical KPIs such as MTTR/MTTD and incident volume
  • Partner with our Global Support team to align self-service objectives with administrative interfaces and tooling
  • Make pragmatic tradeoffs and recommendations, focused on ROI-based investments and objective

What You Will Bring:

  • 15+ years of experience in software development and 8+ years in management
  • Experience with architecting distributed cloud systems at massive scale with a proactive focus towards resiliency and reliability
  • Ability to influence domain health and operational hygiene by driving improvements in key metrics (e.g., MTT*, availability, incident volumes)
  • Background in driving Business Continuity Planning (BCP), Disaster Recovery (DR), and reliability risk mitigation efforts for large multi-tenant and multi-cloud SaaS offerings
  • Ability to own and manage the operations of a FedRAMP cloud offering, ensuring compliance and security best practices
  • A great track record of timely shipping features and have a sense of urgency, an aggressive mindset towards achieving results, and excellent prioritization skills
  • Experience driving broad, cross-cutting horizontal initiatives requiring alignment on goals, resourcing, delivery timeframes, etc 
  • Experience engaging directly with customers and account teams to discuss reliability objectives and practices, fostering trust and alignment
  • Ability to hire while ensuring a high hiring bar, keep engineers motivated, coach/mentor, and handle performance management
  • Bachelor’s or Master's degree in Computer Science, Engineering, or equivalent 

 

Come As You Are

At Confluent, equality is a core tenet of our culture. We are committed to building an inclusive global team that represents a variety of backgrounds, perspectives, beliefs, and experiences. The more diverse we are, the richer our community and the broader our impact. Employment decisions are made on the basis of job-related criteria without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or any other classification protected by applicable law.

At Confluent, we are committed to providing competitive pay and benefits that are in line with industry standards. We analyze and carefully consider several factors when determining compensation, including work history, education, professional experience, and location. This position has an annual estimated salary of $297,000 - $356,400 and a competitive equity package. The actual pay may vary depending on your skills, qualifications, experience, and work location. In addition, Confluent offers a wide range of employee benefits. To learn more about our benefits click HERE.

Click HERE to review our Candidate Privacy Notice which describes how and when Confluent, Inc., and its group companies, collects, uses, and shares certain personal information of California job applicants and prospective employees.
#LI-Remote

Required profile

Experience

Industry :
Computer Software / SaaS
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Prioritization
  • Communication
  • Problem Solving

Cloud Engineer Related jobs