Match score not available

Senior Site Reliability Engineer (SRE)

Remote: 
Full Remote
Contract: 
Salary: 
175 - 192K yearly
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

7+ years of experience as a software engineer, with 5 years in SRE, Strong record of automation and AWS cloud infrastructure experience.

Key responsabilities:

  • Design and implement automated systems for infrastructure and apps
  • Maintain monitoring to ensure system health and performance
  • Collaborate with software teams for resilient and scalable systems
  • Improve infrastructure design for high uptime and simplicity
  • Develop disaster recovery plans, participate in on-call rotations
Business Wire logo
Business Wire SME https://services.businesswire.com/
501 - 1000 Employees
See more Business Wire offers

Job description

Business Wire, a Berkshire Hathaway company, is the global market leader in press release distribution and regulatory disclosure. We are on a mission to redefine how organizations connect with their audiences - and that’s just the beginning!

Organizations, large and small, depend on us to accurately publicize market-moving news and multimedia, and generate social engagements that develop interactions with their target audiences.

About the Role
As a Senior Site Reliability Engineer (SRE) you will play a critical role in ensuring the availability, reliability, and scalability of our company’s infrastructure and applications. You will work closely with software engineering, architecture, and operations teams to design and implement highly automated systems and ensure the smooth operation of Business Wire services. This is a senior technical role that requires a deep understanding of cloud infrastructure, systems operations, network architecture, and software development.
 
You will be part of the team responsible for providing technical support across all of Business Wire’s SaaS-based applications and infrastructure. An ideal candidate is a lifelong learner with a passion for appraising environments and designing/implementing innovative solutions which lead to supportability/reliability improvements. Candidates should have an advanced understanding of Linux systems, Java application technology stacks, networking, and system/networking troubleshooting fundamentals. As part of a small team supporting mission-critical programs within the company, this position provides the candidate with a unique opportunity to make a significant impact in supporting our customers.

What You'll Do
  • Design and implement highly automated systems/services that ensure the availability, reliability, and scalability of infrastructure and applications.
  • Build and maintain monitoring and alerting to provide timely feedback on the performance and health of systems, network, and applications. Continuously improve infrastructure and application design to ensure 99.99% uptime while removing architectural complexity.
  • Work with software development to design and implement systems/applications that are resilient to failure and highly scalable.
  • Achieve material application performance improvements based on insights from observability metrics.
  • Develop and maintain disaster recovery plans and procedures.
  • Participate in on-call rotations to ensure 24/7 application availability.
  • Triage incoming Web Support escalation requests.
  • Drive incident root cause analysis, service restoration, and serve as an incident commander during outage events. 

  • What You'll Need
  • 7+ years of experience as a software engineer with 5 years as an SRE supporting Infrastructure, Networking, and Application Operations in a high availability, 24x7 hybrid environment (Colo/Cloud).
  • Strong record of automation (e.g., Python, Bash, Ansible, Terraform, CloudFormation).
  • Strong experience with AWS cloud infrastructure and container orchestration (Kubernetes, ArgoCD) operating in a GitOps framework.
  • Strong experience with application monitoring, observability, and alerting systems (e.g., New Relic, Grafana).
  • Strong experience with at least one programming language (Python, Java).
  • Advanced experience with Linux system administration, Java-based applications, and network architecture.
  • Ability to participate in architecture reviews.
  • AWS related certifications (Architecture, DevSecOps, Cloud Engineer) are a plus. 
  • Business Wire will not sponsor a new applicant for employment authorization for this position.
    #LI-DNI

    What We Offer
    The base salary range for this position is $175K to $185K/year.  Offered salary will be determined by several factors, including but not limited to: applicant’s education, experience, knowledge, skills and abilities, as well as internal equity and alignment with geographic market data.  Business Wire reserves the right to modify this salary range at any time.

    Business Wire’s total rewards include:
  • Ability to work remotely
  • Excellent health benefits that begin on your first day of employment
  • $100 monthly fitness allotment, a tuition reimbursement program, and enhanced mental health resources
  • 401(k) plan with generous company match, and annual profit sharing contribution (subject to company performance)
  • PTO, Floating Holidays, Wellness Day Off, Birthday Day Off, and more!
  • A pre-employment background check will be required after the acceptance of an offer. Business Wire is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. Pursuant to the San Francisco Fair Chance Ordinance and other similar state laws and local ordinances, and its internal policy, Business Wire will also consider for employment qualified applicants with arrest and conviction records.

    Required profile

    Experience

    Level of experience: Senior (5-10 years)
    Spoken language(s):
    English
    Check out the description to know which languages are mandatory.

    Other Skills

    • Communication
    • Problem Solving

    Site Reliability Engineer (SRE) Related jobs