Match score not available

Sr. Engineer - Boundary Reliability Engineering

extra holidays - extra parental leave - work from anywhere - fully flexible
Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

5-7 years of experience with production applications at scale, particularly in backend applications using Golang or similar technologies., Proficiency in PostgreSQL or any RDBMS, along with observability and AWS primitives., Strong communication skills, with an emphasis on empathy and kindness., A willingness to learn and reflect on experiences, along with experience debugging live production services..

Key responsabilities:

  • Develop a deep understanding of customer interactions with Boundary Cloud to enhance reliability and user experience.
  • Implement best practices for high availability, disaster recovery, scalability, and fault tolerance.
  • Design and build internal tools for monitoring and diagnosing reliability issues, while leading incident management processes.
  • Collaborate with cross-functional teams and participate in a 24/7 on-call rotation to support mission-critical services.

Hashicorp logo
Hashicorp Information Technology & Services Large http://www.hashicorp.com
1001 - 5000 Employees
See all jobs

Job description

About the team

HashiCorp Boundary aims to provide a seamless, just-in-time remote access experience for customers to their infrastructure and other web applications without having to worry about passwords, certificates or other credentials. Boundary is offered as a Cloud platform and this role will be part of the Boundary Enterprise Enablement team whose primary focus will be scale and reliability to enable hypergrowth among medium and large enterprises.

 

What you’ll do (responsibilities)

Senior Engineer – Boundary Reliability Engineering

As a Senior Engineer on the Boundary Reliability Engineering team, you will 

Key Responsibilities:

  • Develop a deep understanding of how customers interact with Boundary Cloud and continuously improve reliability and user experience.
  • Implement and advocate for best practices in high availability, disaster recovery, scalability, and fault tolerance.
  • Design and build internal developer tools to proactively detect, diagnose, and remediate reliability issues.
  • Lead and refine incident management processes to minimize downtime and directly improve customer satisfaction.
  • Enhance service reliability by developing monitoring and observability tooling using SLIs, SLOs, and SLAs.
  • Deploy, manage, and monitor large-scale Boundary Cloud deployments to ensure optimal performance.
  • Anticipate potential failures and take proactive steps to mitigate risks before they impact users.
  • Collaborate with cross-functional teams to refine tools and processes based on real-world production insights.
  • Participate in a 24/7 on-call rotation, supporting mission-critical production services.

 

What you’ll need (basic qualifications)

  • 5-7 years of handling production applications at scale: Backend applications written in Golang or similar, Postgresql (or any RDBMS), Observability, and AWS Primitives
  • Strive for quality through maintainable code and comprehensive testing from development to deployment
  • Clear communication skills while remaining empathetic and kind
  • An eagerness to learn through humility and reflection
  • Experience debugging live production services

 

What's nice to have (preferred qualifications)

  • Working knowledge of industry best practices related to information security
  • Working knowledge on AWS Aurora or postgres, Nomad or other orchestration platforms, Traefik or other load balancing technologies
  • Experience or willingness to conceive, document and advocate for best practices

#LI-Remote (exclude if not applicable)

 

Individual pay within the range will be determined based on job related-factors such as skills, experience, and education or training.

The base pay range for this role in the SF Bay Area / NYC area is:
$176,000$207,000 USD
The base pay range for this role in California (excluding SF Bay Area), New York (excluding NYC), Seattle Metro, Denver / Boulder Metro, Washington D.C., or Maryland is:
$161,300$189,800 USD
The base pay range for this role in Colorado (excluding Denver / Boulder Metro), Illinois, Minnesota, or Washington (excluding Seattle Metro) is:
$146,600$172,500 USD

“HashiCorp is an IBM subsidiary which has been acquired by IBM and will be integrated into the IBM organization. HashiCorp will be the hiring entity. By proceeding with this application you understand that HashiCorp will share your personal information with other IBM subsidiaries involved in your recruitment process, wherever these are located. More information on how IBM protects your personal information, including the safeguards in case of cross-border data transfer, are available here: link to IBM privacy statement.”

Required profile

Experience

Industry :
Information Technology & Services
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Empathy
  • Willingness To Learn
  • Communication

Site Reliability Engineer (SRE) Related jobs