Customer Reliability Engineer, Infrastructure

unlimited holidays - work from anywhere
Work set-up: 
Full Remote
Contract: 
Salary: 
140 - 140K yearly
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

At least 4 years of professional experience in relevant fields., Proficiency with Linux operating systems., Experience with Kubernetes, Docker, or container technologies., Knowledge of major cloud providers such as AWS, GCP, or Azure..

Key responsibilities:

  • Operate, monitor, and maintain the Airflow platform to ensure reliability.
  • Build expertise in cloud infrastructure and Kubernetes across multiple cloud providers.
  • Engage with customers to understand and meet their reliability goals.
  • Contribute to open-source projects and internal monitoring systems.

Astronomer logo
Astronomer Scaleup https://www.astronomer.io
201 - 500 Employees
See all jobs

Job description

Astronomer empowers data teams to bring missioncritical software, analytics, and AI to life and is the company behind Astro, the industryleading unified DataOps platform powered by Apache Airflow®. Astro accelerates building reliable data products that unlock insights, unleash AI value, and powers datadriven applications. Trusted by more than 700 of the worlds leading enterprises, Astronomer lets businesses do more with their data. To learn more, visit www.astronomer.io.

Your background may be unconventional; as long as you have the essential qualifications, we encourage you to apply. While having bonus qualifications makes for a strong candidate, Astronomer values diverse experiences. Many of us at Astronomer havent followed traditional career paths, and we welcome it if yours hasnt either.

About this role:

The Astronomer Customer Reliability Engineering (CRE) team is responsible for the success of our customers usage of our managed Airflow service. The CREs are responsible for operating, monitoring, and maintaining the platform to ensure availability, predictability, and reliable operations. As an infrastructure specialist within the team, you will learn to become an expert on the reliability of Kubernetes and the underlying cloud infrastructure on all 3 public clouds (AWS, Azure, and GCP). Our CRE team ensures production environments are available, predictable, and reliable for our customers. You will create strong relationships with customers and help them achieve their reliability goals.

When you learn a new piece of technology, are you aiming not just for getting started but becoming the expert? Do you listen to the plumber when they tell you what was wrong with the pipes? Do you know how your router works? Are you the kind of person who takes an MIT Opencourseware course and actually finishes it? Then this role could be for you.

This position includes a requirement to work from 12PM6PM Pacific US, Monday to Friday. Your remaining work time is flexible.

What you get to do:
  • Learn and build expertise across several software engineering disciplines, including:

    • Kubernetes

    • Cloud engineering

    • Cloud networking

      • Gain exposure to the big picture; learn about product, engineering, customer relationship management, and more.

      • Spend up to 20% of your time on side projects that contribute to Astronomer’s overall success, such as contributing to the opensource Airflow repository or developing Astronomer’s internal monitoring and alerting systems built on Airflow.

      • Work on a modern, sophisticated, cloudnative product that customers use to connect to dozens of other systems. Gain depth and breadth of learning!

      • Work directly with our customers’ data engineers, system admins, DevOps teams, and management.

      • Provide feedback from your experience that can shape the direction of Astronomer’s products

      • Own the customer experience, working directly with customers to prioritize and solve issues and meet SLAs.

      • Participate remotely within a fully distributed team. Approximately 24 inperson events per year.

      • Help maintain 24x7 coverage through a specified 6hour pager period during your work day.

      • Participate in paid oncall rotation for weekend coverage.

        • What you bring to the role:
          • Motivation to learn

          • Commitment to excellence

          • Problemsolving and troubleshooting abilities

          • Willingness to identify and own problems through the full lifecycle, from vague problem to delivered solution

          • Excellent written and verbal communication for connecting with our customers over our ticketing system and through Zoom

          • Demonstrable Linux familiarity

          • 4 years of professional experience

          • Experience with KubernetesDockerContainers

          • Experience with any major cloud provider (AWS, GCP, Azure)

            • Bonus points if you have:
              • Previous experience working directly with customers (internal or external)

              • Experience with DevOps

              • Contributions to opensource projects

              • Experience with Splunk or Prometheus

                • The salary for this role is $140,000$150,000, depending on experience level, along with an equity component.

                  #LIRemote

                  At Astronomer, we value diversity. We are an equal opportunity employer: we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Astronomer is a remotefirst company.

Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Communication
  • Troubleshooting (Problem Solving)
  • Self-Motivation
  • Problem Solving

Customer Success Engineer Related jobs