Epsilon3 is a multi-product operations management platform revolutionizing the way teams build, launch, and operate spacecraft and other advanced hardware systems.
Launched in 2021, our company is led by engineers from SpaceX, Google, and NASA, who have experience supporting over 100 space missions. Innovative teams at Blue Origin, Rocket Lab, Axiom Space, Firefly Aerospace, and many others depend on our web-based (SaaS) solutions to plan and track high-stakes procedures. We raised a $15M Series A funding round led by Lux Capital, Y Combinator (YC S21), and other world-class investors.
This role is remote and can be based anywhere in the United States.
We are looking for a Site Reliability Engineer (SRE) who is interested in space exploration and passionate about building scalable, reliable, and secure software. You will be responsible for building and supporting complex infrastructure and deployment scenarios. We are currently using technologies such as React.JS, Node, Postgres, AWS GovCloud, Docker, and K8s, and our stack will evolve over time as we scale our solutions and approach.
The ideal candidate has years of experience using Kubernetes (K8s) and is proficient in JavaScript.
Some of the technical challenges we’re undertaking:Real-time synchronization of data and user interfaces across earth and spaceVisualization of many complex data fieldsIntegration of multiple high-bandwidth data streams for real-time processing and displayMultiple deployment environments including cloud and on-premisesMission-critical security and reliability requirementsSupporting complex workflows and detailed tracking while also maintaining simplicity and delightfulness of user experienceResponsibilities:Support and contribute to the entire lifecycle of our software, from inception and design, through to deployment, operation and refinementSupport our services in production and before they go live through system design, security considerations, capacity planning, and launch preparednessBuild processes and systems to continuously improve system reliability and performanceBuild processes and systems to continuously improve the productivity of the rest of the development teamScale systems sustainably through automation and continuous improvement in reliability and velocityPractice sustainable incident response and postmortemsContribute to the design, build, test, and release of our web-based operational dashboards, electronic procedure tools, and suite of specialized software solutions to support various missionsJoin and actively participate in customer discovery calls and technical demonstrationsAnalyze and enhance the security, efficiency, stability, and scalability of our software systemsSupport software QA and user testingSupport and facilitate security reviews and audits of our systems by customers and third partiesFacilitate compliance with cybersecurity certifications and contribute to improvements in our security policies and processesAssess third-party and open source software and develop integrationsContribute to the growth and refinement of our engineering culture, processes, and toolsQualifications:Bachelor’s Degree in Computer Science or related field5+ years of combined experience in site reliability and production software engineeringProficiency with Kubernetes (K8s) and JavaScript (JS) is required for this roleStrong foundation in computer science concepts (algorithms, data structures, object-oriented programming, design, testing, etc.)Self-starter and able to navigate ambiguity and assess rapidly evolving prioritiesStrong team player with great communication skills and collaborative work ethicLove of learning (technical and otherwise)Experience in fast-growing tech startups is a plusExperience with Lean Startup methodologies (agile software development) is a plusUS Citizenship (future security clearance may be required)Must be located in the United StatesSalary range: $120,000 - $175,000
This full-time role includes stock options, generous PTO, health insurance, and a 4% 401k match.
We meet in-person four times per year for hackathons and fun team bonding activities.
Epsilon3 is an equal opportunity employer committed to diversity and inclusion in the workplace. We prohibit discrimination and harassment of any kind based on race, color, sex, religion, sexual orientation, national origin, disability, genetic information, pregnancy, or any other protected characteristic as outlined by federal, state, or local laws. This policy applies to all employment practices within our organization, including hiring, recruiting, promotion, termination, layoff, recall, leave of absence, compensation, benefits, training, and apprenticeship. Epsilon3 makes hiring decisions based solely on qualifications, merit, and business needs at the time.