Logo for Extreme Networks

Staff Cloud Operations Engineer

Roles & Responsibilities

  • 5+ years in cloud infrastructure engineering with deep expertise in at least one major cloud provider (AWS preferred).
  • Strong Kubernetes experience: cluster design, operators, controllers, and multi-cluster management.
  • Proficiency with Infrastructure as Code: Terraform, CloudFormation, or similar.
  • GitOps expertise: ArgoCD, Flux, or similar; experience with ApplicationSets and complex deployment patterns.

Requirements:

  • Architect Scale Infrastructure: design and implement multi-cluster, multi-region Kubernetes deployments across AWS, GCP, Azure and on-prem; build scalable infrastructure across regions and providers.
  • Own Production Systems: end-to-end ownership of production infrastructure; drive incident response, postmortems, and improvements to prevent recurrence.
  • GitOps & IaC Excellence: build and maintain Terraform modules; manage thousands of configuration files across clusters using GitOps; design ArgoCD ApplicationSets and Helm architectures for safe automated releases.
  • Observability, Performance & Security: implement monitoring with Prometheus, Grafana, Loki; optimize performance, capacity planning, autoscaling; ensure security controls and compliance across cloud infrastructure.

Job description

Job Teaser Summary:

Extreme’s Cloud Operations team is a group of talented engineers passionate about building highly reliable, scalable and secure solutions in public/private cloud environments. We are looking to hire a highly motivated Cloud Operations engineer with strong working experience in production operation and deployment automation. You will work with the team to design, develop and implement deployment automation solutions end-to-end. You will also be expected to participate in continuous cloud service operation, troubleshoot and resolve complex issues in production. We will work together to design, develop and implement the best public / private / local cloud solutions for our customers. Extreme Networks is the right place to be and now is the right time to join us and be part of our spectacular growth and success. We're looking for the best and the brightest 'A' players who want to make a difference doing a job they love.
 
About the Role:
 
We want you to help lead infrastructure engineering for ExtremeCloud, a multi-cloud SaaS platform. Design, build, and operate large-scale, multi-region Kubernetes environments across AWS, GCP, and Azure and on-prem. Drive reliability, scalability, and operational excellence for a platform serving global customers.

What You'll Do:
  • Architect & Scale Infrastructure: Design and implement multi-cluster, multi-region Kubernetes deployments using EKS, GKE, and AKS. Build infrastructure that scales across regions and cloud providers.
  • Own Production Systems: Take end-to-end ownership of production infrastructure. Drive incident response, postmortems, and improvements to prevent recurrence.
  • Infrastructure as Code at Scale: Build and maintain Terraform modules for complex infrastructure patterns. Manage thousands of configuration files across clusters, regions, and environments using GitOps principles.
  • GitOps & Deployment Excellence: Design and optimize ArgoCD ApplicationSets and Helm chart architectures. Build deployment pipelines that enable safe, automated releases across hundreds of microservices.
  • Performance & Reliability Engineering: Analyze system performance, identify bottlenecks, and implement optimizations. Improve SLOs through capacity planning, autoscaling, and architectural improvements.
  • Observability & Monitoring: Build and enhance monitoring, alerting, and observability using Prometheus, Grafana, Loki, and custom tooling. Drive visibility into complex distributed systems.
  • Security & Compliance: Implement security controls, compliance frameworks, and best practices across cloud infrastructure. Design secure multi-tenant architectures.
  • Technical Leadership: Mentor engineers, establish best practices, and drive technical decisions. Collaborate with platform, SRE, and product teams to deliver reliable infrastructure.

  • What We're Looking For:
  • 5+ years in cloud infrastructure engineering, with deep expertise in at least one major cloud provider (AWS preferred)
  • Strong Kubernetes experience: cluster design, operators, controllers, and multi-cluster management
  • Proficiency with Infrastructure as Code: Terraform, CloudFormation, or similar
  • GitOps expertise: ArgoCD, Flux, or similar; experience with ApplicationSets and complex deployment patterns
  • Deep Linux and networking knowledge
  • Experience with distributed systems: Elasticsearch, PostgreSQL, Redis, Kafka, RabbitMQ
  • Monitoring and observability: Prometheus, Grafana, ELK stack, or similar
  • Strong problem-solving skills and experience debugging complex distributed systems
  • Experience with cloud security, compliance (SOC2, ISO27001), and secure-by-design practices
  • Excellent communication skills for working across time zones and with distributed teams
  • Self-directed with a track record of owning problems end-to-end

  • Nice to Have:
  • Experience with multi-cloud architectures and cloud-agnostic patterns
  • Contributions to open-source infrastructure projects
  • Experience with service mesh technologies (Istio, Linkerd)
  • Knowledge of chaos engineering and reliability testing
  • Experience with cost optimization and FinOps practices

  • Why This Role:
  • Work on infrastructure at scale: hundreds of clusters, thousands of services, global reach
  • Deep technical ownership: design, build, and operate critical systems
  • Modern stack: Kubernetes, GitOps, Infrastructure as Code, cloud-native tools
  • Impact: infrastructure decisions affect millions of users
  • Growth: work with experienced engineers and tackle complex challenges
  • Extreme Networks, Inc. (EXTR) is the industry’s first cloud-driven, end-to-end enterprise networking company. Our best-of-breed technology solutions, from the wireless and IoT edge to the data center, are flexible, agile, and secure to accelerate the digital transformation of our customers and provide them with the fastest path to the autonomous enterprise. Our 100% in-sourced services and support are number one in the industry. Even with 50,000 customers globally, including half of the Fortune 50 and some of the world's leading names in business, hospitality, retail, transportation and logistics, education, government, healthcare, and manufacturing, we remain nimble and responsive to ensure customer and partner success. We call this Customer-Driven Networking™. Founded in 1996, Extreme is headquartered in San Jose, California. For more information, visit Extreme's website or call 1-888-257-3000.
     
    Extreme Networks provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, pregnancy, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

    Cloud Engineer Related jobs

    Other jobs at Extreme Networks

    We help you get seen. Not ignored.

    We help you get seen faster — by the right people.

    🚀

    Auto-Apply

    We apply for you — automatically and instantly.

    Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

    AI Match Feedback

    Know your real match before you apply.

    Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

    Upgrade to Premium. Apply smarter and get noticed.

    Upgrade to Premium

    Join thousands of professionals who got noticed and hired faster.