Site Reliability Engineer Ireland

extra holidays - extra parental leave
Work set-up: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

Bachelor's or Master's degree in Computer Science or related field., At least 4 years of software engineering experience., Experience with deploying and managing distributed database systems or large-scale SaaS applications., Knowledge of cloud infrastructure, automation, and security practices..

Key responsibilities:

  • Manage and operate the global CloudVision service fleet.
  • Develop and improve CI/CD pipelines and automation processes.
  • Monitor key service indicators for capacity planning and incident response.
  • Lead security design and disaster recovery efforts for cloud-based applications.

Arista Networks logo
Arista Networks Large http://www.arista.com
1001 - 5000 Employees
See all jobs

Job description

Company Description

Arista Networks is an industry leader in datadriven, clienttocloud networking for large data center, campus and routing environments. What sets us apart is our relentless pursuit of innovation. We leverage the latest advancements in cloud computing, artificial intelligence, and softwaredefined networking to provide our clients with a competitive edge in an increasingly interconnected world. Our solutions are designed to not only meet the current demands of the digital landscape but to also anticipate and adapt to future challenges.

At Arista we value the diversity of thought and perspectives that each employee brings to the table. We believe that fostering an inclusive environment, where individuals from various backgrounds and experiences feel welcome, is essential for driving creativity and innovation.

Our commitment to excellence has earned us several prestigious awards, such as Best Engineering Team, Best Company for Diversity, Compensation, and WorkLife Balance. At Arista, we take pride in our track record of success and strive to maintain the highest standards of quality and performance in everything we do.

Job Description

Who You’ll Work With

SREs at Arista combine strong software and systems engineering with a passion for operating production systems at scale. As an SRE you’ll be part of the team responsible for our global service fleet.

What You’ll Do
As an SRE you’ll be responsible for our global CloudVision service fleet. This includes:

  • Building the CICD lifecycle for services, from inception and design to deployment and scaling
  • Improving operational processes through automation
  • Identifying key service indicators to be used in capacity planning
  • Owning disaster recovery and management
  • Driving infrastructure and cloudbased application security design
  • Leading sustainable incident response and blameless postmortems
  • Being an active member of our globally distributed oncall team
    • Arista’s CloudVision is an enterprise network management and streaming telemetry SaaS offering. CloudVision is deployed on Kubernetes across global regions using Spinnaker for our CICD pipeline. Our tech stack runs on GKE, using HBaseHadoop as main distributed database and storage layer, ElasticSearch for powering search data, ClickHouse for fast real time queries of flow data, our own Kafkabased distributed real time stream processing layer for analytics, and TensorFlow for ML analysis. Our monitoring system is built on top of Prometheus, Grafana, Loki, and other OSS tools.

      Qualifications
      • BSMS degree in Computer Science or a relevant experience subject.
      • 4+ years software engineering experience.
      • Experience developing or managing deployments of distributed database systems or scale out applications for a SaaS environment.
        • #LIEO1

Required profile

Experience

Level of experience: Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Teamwork
  • Problem Solving

Site Reliability Engineer (SRE) Related jobs