Logo for Visa

Sr Site Reliability Engineer

Job description

Company Description

Founded by experienced entrepreneurs and engineers in 2016, Pismo is a technology company that provides a comprehensive processing platform for banking, card issuing and financial market infrastructure and helps customers innovate and build the next generation of banking and payment solutions. Pismo joined Visa in 2024.

Leveraging Visa’s solutions, our core platform, and an expanding suite of capabilities, Pismo addresses the technological challenges that large banks, marketplaces, and fintech companies face in migrating from legacy systems to more advanced technology in the market. Pismo’s cloud-based platform empowers firms to build and launch financial products rapidly, scaling as they grow to have a broader audience while keeping high security and availability standards.

Pismo’s 500+ employees are located in more than 10 countries around the world.

Job Description

Join Pismo’s Platform squad within the SRE Tribe, dedicated to owning and evolving the containerized platform that underpins critical workloads. You’ll work cross‑functionally to ensure our platform is reliable, scalable, secure, and easy to operate, focusing on Kubernetes at scale and cloud architecture.

What You’ll Do

Own the end‑to‑end lifecycle (design, provisioning, upgrades, maintenance, and decommissioning) of core platform components, including:

  • Cloud infrastructure primitives
  • Kubernetes clusters and cluster services
  • Networking, ingress, and service discovery
  • Service Mesh and supporting data‑plane components

Design platform components to be resilient by default, applying SRE principles such as:

  • Fault isolation and graceful degradation
  • Capacity planning and saturation control
  • Reduced operational toil and clear failure modes

Lead the design and implementation of infrastructure bootstrap orchestration, including:

  • Automated cluster and environment provisioning
  • Deterministic, repeatable platform bring‑up and teardown
  • Dependency‑aware orchestration across cloud, network, and Kubernetes layers

Drive Infrastructure‑as‑Code and GitOps‑first practices to ensure:

  • Platform components are reproducible and auditable
  • Changes are automated, testable, and reversible
  • Manual intervention is minimized or eliminated
  • Identify automation gaps and lead initiatives that reduce human effort, onboarding time, and operational risk.

Apply and promote SRE operational excellence practices, including:

  • Clear ownership and runbooks for platform components
  • Participation in on‑call rotation as a platform reliability escalation point
  • Incident response, post‑incident reviews, and problem management
  • Improve day‑2 operations by standardizing upgrade/rollback strategies and reducing MTTD/MTTR.
  • Ensure platform operations align with security, compliance, and internal control requirements.
  • Collaborate with engineering teams across the organization to influence platform adoption, reliability standards, and cloud‑native best practices.

This is a remote position. A remote position does not require job duties be performed within proximity of a Visa office location. Remote positions may be required to be present at a Visa office with scheduled notice. #LI-Remote

Qualifications

For this role, you must be based in Brazil.

Language Skills
Proficiency in English at B2 level or above (Upper-Intermediate)

Technical Skills

  • Strong hands‑on experience with public cloud platforms (AWS preferred, Azure also considered).
  • Proven experience operating and administering Kubernetes at scale in production environments.
  • Strong experience with container orchestration platforms and cloud architecture fundamentals (networking, IAM/security concepts, and reliability patterns).
  • Experience with Infrastructure as Code (Terraform preferred) and automation‑first workflows.
  • Familiarity with GitOps practices and CI/CD pipelines.
  • Strong troubleshooting skills for distributed systems, including root‑cause analysis and reliability improvements.
  • Experience with observability concepts and practices (monitoring, logging, alerting, tracing).

Preferred Qualifications

  • Experience with Service Mesh technologies (Istio preferred, App Mesh or Linkerd).
  • Experience working with critical or mission‑critical systems.
  • Strong background applying SRE principles (operational readiness, incident management, runbooks, toil reduction).
  • AWS certifications.

Additional Information

Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.

Site Reliability Engineer (SRE) Related jobs

Other jobs at Visa

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.