What You'll Do
Support and enhance the reliability of in-house applications and systems across production and non-production environments.
Design and implement resilient infrastructure and automate operational processes using IaC tools and pipelines.
Improve observability by implementing monitoring, logging, alerting, and SLOs to detect and respond to issues proactively.
Lead and participate in incident response, root cause analysis, and the creation of actionable postmortems.
Collaborate with development, infrastructure, and support teams to drive reliability-focused architecture and tooling decisions.
Identify and automate manual tasks using scripts, runbooks, and self-healing solutions to reduce operational overhead.
Champion an engineering enablement model—empowering developers to own and operate what they build.
What You'll Bring
Proven experience supporting and improving distributed software systems in production.
Proficiency in scripting and automation (PowerShell, Python, Bash, Node.js, or similar).
Strong understanding of observability practices and tools (ELK Stack, OTel, AWS CloudWatch, Azure Application Insights).
Hands-on experience with one or more cloud platforms (AWS preferred, Azure/GCP acceptable).
Solid knowledge of CI/CD pipeline automation (GitHub Actions, Azure DevOps).
Experience with infrastructure as code (Terraform and Ansible).
Familiarity with incident response practices, SLAs/SLIs/SLOs, and service ownership models.
Excellent communication and collaboration skills with a mindset for continuous learning and improvement.
Pay Range: $82,925 - $110,525 Annually
This hiring range is a reasonable estimate of the base pay range for this position at the time of posting. Pay is based on a number of factors which may include job-related knowledge, skills, experience, business requirements, and geographic location.
#ST2
** Note that the following statements only apply to candidates who will be working from an unincorporated area within Los Angeles County. **
First American will consider for employment all qualified applicants, including those with arrest or conviction records, in a manner consistent with the requirements of applicable state and local laws (e.g., the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act).
First American intends to conduct a review of an applicant’s criminal history in connection with a conditional offer. First American reasonably believes that a criminal history may have a direct, adverse and negative relationship with the following material job duties for this position potentially resulting in the withdrawal of the conditional offer of employment: handling of confidential, proprietary or trade secret information belonging to First American or its customers, administrating or facilitating financial transactions, and the ability to meet customer-imposed criminal history requirements.
Based on eligibility, First American offers a comprehensive benefits package including medical, dental, vision, 401k, PTO/paid sick leave and other great benefits like an employee stock purchase plan.Jobber
InOrg Global
Shutterfly
Addepar
Tailor