Astronomer empowers data teams to bring missioncritical software, analytics, and AI to life and is the company behind Astro, the industryleading unified DataOps platform powered by Apache Airflow®. Astro accelerates building reliable data products that unlock insights, unleash AI value, and powers datadriven applications. Trusted by more than 700 of the worlds leading enterprises, Astronomer lets businesses do more with their data. To learn more, visit www.astronomer.io.
The Astronomer Customer Reliability Engineering (CRE) team is responsible for the success of our customers usage of our managed Airflow service. The CREs are responsible for operating, monitoring, and maintaining the platform to ensure availability, predictability, and reliable operations. As an infrastructure specialist within the team, you will learn to become an expert on the reliability of Kubernetes and the underlying cloud infrastructure on all 3 public clouds (AWS, Azure, and GCP). Our CRE team ensures production environments are available, predictable, and reliable for our customers. You will create strong relationships with customers and help them achieve their reliability goals.
When you learn a new piece of technology, are you aiming not just for getting started but becoming the expert? Do you listen to the plumber when they tell you what was wrong with the pipes? Do you know how your router works? Are you the kind of person who takes an MIT Opencourseware course and actually finishes it? Then this role could be for you.
This position includes a requirement to work from 12PM6PM Pacific US, Monday to Friday. Your remaining work time is flexible.
Learn and build expertise across several software engineering disciplines, including:
Kubernetes
Cloud engineering
Cloud networking
Gain exposure to the big picture; learn about product, engineering, customer relationship management, and more.
Spend up to 20% of your time on side projects that contribute to Astronomer’s overall success, such as contributing to the opensource Airflow repository or developing Astronomer’s internal monitoring and alerting systems built on Airflow.
Work on a modern, sophisticated, cloudnative product that customers use to connect to dozens of other systems. Gain depth and breadth of learning!
Work directly with our customers’ data engineers, system admins, DevOps teams, and management.
Provide feedback from your experience that can shape the direction of Astronomer’s products
Own the customer experience, working directly with customers to prioritize and solve issues and meet SLAs.
Participate remotely within a fully distributed team. Approximately 24 inperson events per year.
Help maintain 24x7 coverage through a specified 6hour pager period during your work day.
Participate in paid oncall rotation for weekend coverage.
Motivation to learn
Commitment to excellence
Problemsolving and troubleshooting abilities
Willingness to identify and own problems through the full lifecycle, from vague problem to delivered solution
Excellent written and verbal communication for connecting with our customers over our ticketing system and through Zoom
Demonstrable Linux familiarity
4 years of professional experience
Experience with KubernetesDockerContainers
Experience with any major cloud provider (AWS, GCP, Azure)
Previous experience working directly with customers (internal or external)
Experience with DevOps
Contributions to opensource projects
Experience with Splunk or Prometheus
The salary for this role is $140,000$150,000, depending on experience level, along with an equity component.
#LIRemote
At Astronomer, we value diversity. We are an equal opportunity employer: we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Astronomer is a remotefirst company.
Torq
Torq
Sysdig
Talentport
Censys