Role overview

Qualifications

4+ years in SRE, DevOps, Platform/Infrastructure, or backend engineering with significant production operations ownership
Hands-on experience operating production services on Kubernetes and shipping infrastructure as code in a GitOps workflow
Solid PostgreSQL production experience including query planning, indexing decisions, online migrations on large tables, high availability/DR, and CDC pipelines
Cloud networking fundamentals (VPCs, routing, L4/L7 load balancing, DNS, TLS) with comfort debugging cross-service connectivity; strong observability mindset; incident response experience; and proficiency in Go or Python

Responsibilities

Operate production day-to-day: on-call, incident response, postmortems, and close-the-loop follow-ups
Define and refine SLIs/SLOs and error budgets; help product teams operate within them
Strengthen observability and ship infrastructure as code in a GitOps workflow for cloud resources and Kubernetes workloads
Own PostgreSQL reliability: performance tuning, schema/migration reviews, online migrations on large tables, HA/DR, CDC pipelines, and mentor engineers on reliability and database fundamentals

Key facts

Remote from: EMEA
Full time
Site Reliability Engineer (SRE)
2 - 2K yearly
English

Other skills

Non-Verbal Communication
Mentorship
Calmness Under Pressure
Teamwork
Problem Solving

About the company

Alpaca

Capital Markets & Securities

Alpaca builds an API-first stock and crypto brokerage platform. Trade with algorithms, connect with apps, and build services with our easy to use APIs *Securities by Alpaca Securities LLC, Cryptocurrencies by Alpaca Crypto LLC (alpaca.markets)*

Company details

Company typeScaleup

IndustryCapital Markets & Securities

Company size201 - 500

Links

Website LinkedIn See all jobs

Your match analysis

See how your profile stacks up against this role.

We compared the job requirements to your profile to show where you're strong and where you fall short.

Job description

Who We Are:

Alpaca is a US-headquartered self-clearing broker-dealer and brokerage infrastructure for stocks, ETFs, options, crypto, fixed income, 24/5 trading, and more. Our recent Series D funding round brought our total investment to over $320 million, fueling our ambitious vision.

Amongst our subsidiaries, Alpaca is a licensed financial services company, serving hundreds of financial institutions across 40 countries with our institutional-grade APIs. This includes broker-dealers, investment advisors, wealth managers, hedge funds, and crypto exchanges, totalling over 9 million brokerage accounts.

Our global team is a diverse group of experienced engineers, traders, and brokerage professionals who are working to achieve our mission of opening financial services to everyone on the planet. We're deeply committed to open-source contributions and fostering a vibrant community, continuously enhancing our award-winning, developer-friendly API and the robust infrastructure behind it.

Alpaca is proudly backed by top-tier global investors, including Portage Ventures, Spark Capital, Tribe Capital, Social Leverage, Horizons Ventures, Unbound, SBI Group, Derayah Financial, Elefund, and Y Combinator.

Our Team Members:

We're a dynamic team of 380+ globally distributed members who thrive working from our favorite places around the world, with teammates spanning the USA, Canada, Japan, Hungary, Nigeria, Brazil, the UK, and beyond!

We're searching for passionate individuals eager to contribute to Alpaca's rapid growth. If you align with our core values—Stay Curious, Have Empathy, and Be Accountable—and are ready to make a significant impact, we encourage you to apply.

Your Role:

As a Site Reliability Engineer at Alpaca, you'll help keep our brokerage platform reliable, observable, and operable as we grow - working across our cloud infrastructure, Kubernetes platform, observability stack, messaging layer, and data layer. We're especially interested in candidates with strong PostgreSQL fundamentals who'd like to grow into deeper ownership of our database reliability posture: PostgreSQL sits on the trading-critical path, and we want this person to spend a meaningful share of their time leveling it up while still being a well-rounded SRE the rest of the week.

Things You Get To Do

Operate production day-to-day - oncall, incident response, postmortems, and the follow-ups that actually close the loop.
Own reliability practice - define and refine SLIs/SLOs and error budgets, and help product teams live within them.
Strengthen our observability across metrics, logs, traces, and alerting.
Ship infrastructure through code in a GitOps workflow - cloud resources and Kubernetes workloads alike.
Look after PostgreSQL: performance tuning, schema and migration review, online migrations on large tables, HA/DR, and CDC pipelines.
Mentor engineers on reliability and database fundamentals through code review, design review, and pairing.

Who You Are (must-haves)

4+ years in SRE, DevOps, Platform/Infrastructure, or backend engineering with significant production operations ownership.
Hands-on experience operating production services on Kubernetes, and shipping infrastructure as code in a GitOps workflow.
Solid working knowledge of PostgreSQL in production — query plans, pg_stat_*, indexing and schema trade-offs, and what a safe online migration looks like on a non-trivial table.
Cloud networking fundamentals (VPCs, routing, L4/L7 load balancing, DNS, TLS) and comfort debugging cross-service connectivity.
Comfortable with a modern observability stack and proficient with Linux at the operator level.
Practiced in incident response - calm under pressure, structured debugging, postmortems that drive change.
At least working proficiency in Go or Python, plus strong written and verbal communication.
Genuine interest in databases and in growing your PostgreSQL/DBA expertise.

Who You Might Be (Nice-to-Haves):

Deeper PostgreSQL experience: large clusters at OLTP load, online migrations on big tables, HA/DR ownership, connection pooling at scale, or change-data-capture pipelines.
Experience with typed SQL access layers in Go (e.g. pgx, gorm, sqlc).
Production experience with messaging systems at scale (e.g. RabbitMQ, Kafka, Redpanda).
Security & compliance experience in a regulated environment (SOC 2, secrets management, audit logging).
Familiarity with trading, brokerage, or other regulated fintech domains.

How We Take Care of You:

Competitive Salary & Stock Options
Health Benefits
New Hire Home-Office Setup: One-time USD $500
Monthly Stipend: USD $150 per month via a Brex Card

Alpaca is proud to be an equal opportunity workplace dedicated to pursuing and hiring a diverse workforce.

Recruitment Privacy Policy

Apply once. Then go straight to the hiring manager.

After you apply, unlock the direct contact details of the people who actually make the call. A quick follow-up makes you 5x more likely to land an interview.

Marcus Rivera

Chief Revenue Officer

m.rivera@company.com

linkedin.com/in/marcusrivera

Unlocked after you apply

Site Reliability Engineer (SRE) Related jobs

EMEA Site Reliability Engineer (SRE)

Senior Staff Site Reliability Engineer

2 days ago

Ping Identity

Full time

KubernetesSystems ArchitectureMicrosoft NetworkingIdentity And Access ManagementCI/CD

Site Reliability Engineer

1 day ago

Datacom

Full time

Infrastructure as Code (IaC)Site Reliability EngineeringDevSecOpsMicrosoft AzureAmazon Web Services

Senior Site Reliability Engineer

2 days ago

Tempo Software

Full time

KubernetesAmazon Web ServicesLinuxBash (Scripting Language)Terraform

Site Reliability Engineer-SkillBridge Intern

2 days ago

Zscaler

Internships

Site Reliability EngineeringAI/ML InferenceLinux AdministrationInfrastructure as Code (IaC)FedRAMP

Site Reliability Engineer — ML Infrastructure

1 day ago

TableCheck

Full time

Amazon Web ServicesKubernetesDevOpsPython (Programming Language)Terraform

Other jobs at Alpaca

Senior DevOps Engineer

Just Now

Alpaca

Full time
Senior (5-10 years)
1 - 1K

Google Cloud Platform (GCP)Infrastructure as Code (IaC)TerraformKubernetesCI/CD

Brokerage Accounting Manager (Japan Securities Entity)

Just Now

Alpaca

Full time
1 - 1K

Financial AccountingRegulatory ComplianceAccountingManagement AccountingCorporate Financial Reporting

Financial Operations Principal

1 day ago

Alpaca

Full time
Senior (5-10 years)
1 - 1K

Regulatory ComplianceCorporate Financial ReportingAccountingFinanceRisk Management

Site Reliability Engineer

Role overview

Qualifications

Responsibilities

Key facts

Hard skills

Other skills

About the company

Company details

Links

Your match analysis

Job description

Your Role:

Things You Get To Do

Who You Are (must-haves)

Who You Might Be (Nice-to-Haves):

How We Take Care of You:

Apply once. Then go straight to the hiring manager.

Site Reliability Engineer (SRE) Related jobs

Senior Staff Site Reliability Engineer

Site Reliability Engineer

Senior Site Reliability Engineer

Site Reliability Engineer-SkillBridge Intern

Site Reliability Engineer — ML Infrastructure

Other jobs at Alpaca

Senior DevOps Engineer

Brokerage Accounting Manager (Japan Securities Entity)

Financial Operations Principal

Reach out to the hiring manager directly.