Logo for LiveKit

Data Engineer

Roles & Responsibilities

  • 8+ years of experience in data engineering with strong Python and SQL expertise
  • Deep expertise in GCP, with hands-on experience in BigQuery, Dataflow, Cloud Storage, and related analytics services
  • Strong understanding of data modeling, transformation patterns, and working with dbt
  • AI-first development mindset with hands-on experience building AI-driven workflows and effectively using AI coding assistants

Requirements:

  • Own the Analytics Infrastructure: end-to-end ownership of the GCP-based data infrastructureβ€”from ingestion and movement to storage, security, and availabilityβ€”and build/operate reliable, scalable pipelines powering analytics; collaborate with Analytics on downstream transformations and BI.
  • Maximize the GCP Ecosystem: design cost-effective solutions using GCP-native services, and pragmatically decide when to extend with third-party tooling or homegrown solutions.
  • Contribute Across Data Infrastructure: apply data pipeline expertise to application data needs in collaboration with product engineering.
  • Engineering Standards & AI-First Development: follow PR reviews, automated testing, change management, and production-grade practices; work with AI coding assistants and adapt to startup pace.

Job description

LiveKit is building the infrastructure layer for the voice-driven era of computing. Our platform gives developers everything they need to build, test, deploy, scale, and observe agents in production. Founded in 2021, LiveKit powers voice AI applications for OpenAI, xAI, Salesforce, Coursera, Spotify, and thousands of others, collectively facilitating billions of calls each year.

You'll thrive at LiveKit if you:

  • obsess with crafting code that is fast, reliable and practical for the problem

  • are known as the go-to person for tackling tough technical problems

  • work hard and can build and ship fast

  • can clearly explain complex technical concepts to others

  • are a fast learner, frequently picking up new languages and tools

The best way to impress us is with thoughtful Issues and/or PRs on our Github repos 😊

About This Role:

As a Data Engineer at LiveKit, you'll own the analytics infrastructure that powers our business intelligence and data analysis capabilities. Working closely with the Head of Data and analytics peers, you'll design and implement scalable GCP-based data pipelines β€” from ingestion through transformation to delivery β€” maximizing the GCP ecosystem for cost-effective solutions while integrating additional services or homegrown tooling where appropriate. While analytics infrastructure is the core focus, you'll also engage with the broader application data infrastructure, contributing your data pipeline expertise to support product and engineering needs. This is a foundational IC role with significant ownership over the architecture and direction of our analytics stack as the team grows.

What You’ll Do:

Own the Analytics Infrastructure: You are the end-to-end owner of our GCP-based data infrastructure β€” including ingestion, movement, storage, security, and availability. You build and operate reliable, scalable pipelines that power analytics, and partner closely with the Analytics team on downstream transformation and BI.

Maximize the GCP Ecosystem: Build cost-effective solutions anchored in GCP-native services. Know when to extend with third-party tooling or homegrown solutions, and make pragmatic tradeoffs.

Contribute Across Data Infrastructure: While analytics is the primary focus, you'll bring broad data pipeline expertise to application data needs in collaboration with the product engineering team.

Managed Services First: Favor managed solutions over self-hosting. Evaluate build vs. buy with cost and operational burden in mind.

Engineering Standards: This role reports to the Head of Data within the Engineering org. Expect PR reviews, automated testing, proper change management, and production-grade standards.

AI-First Development: Work extensively with AI coding assistants and contribute to evolving our AI development workflows and infrastructure.

Startup Pace: Priorities shift quickly. Balance long-term architectural thinking with the tactical execution the moment requires.

Who You Are:

  • 8+ years of experience in data engineering with strong Python and SQL expertise

  • Deep expertise in GCP, with hands-on experience in BigQuery, Dataflow, Cloud Storage, and related analytics services

  • Proven ability to design and implement production-grade data pipelines and aggregation layers for BI and analysis

  • AI-first development mindset with hands-on experience building AI-driven workflows and effectively using AI coding assistants

  • Strong understanding of data modeling, transformation patterns, and working with dbt

  • Experience with data movement tools (Estuary, Airbyte, Fivetran, or similar)

  • Solid infrastructure and DevOps fundamentals: Terraform or similar IaC, CI/CD, Git workflows, and change management

  • Experience implementing observability and monitoring for data systems (DataDog, Grafana, or similar)

  • Strong communication skills and ability to work cross-functionally with engineering and business stakeholders

  • Self-directed and comfortable with ambiguity in a fast-paced startup environment

  • Located in the US or Canada

Bonus:

  • Experience coordinating with dbt and analytics engineering teams

  • Background with AI workflow tools (n8n or similar)

  • Background with AI coding assistants

  • Prior experience as an early infrastructure hire building from the ground up

Our Commitment to You:

  • An opportunity to build something truly impactful to the world

  • Contribute to open source alongside world-class engineers

  • Competitive salary and equity package

  • Health, dental, and vision benefits

  • Flexible vacation policy

Data Engineer Related jobs

Other jobs at LiveKit

We help you get seen. Not ignored.

We help you get seen faster β€” by the right people.

πŸš€

Auto-Apply

We apply for you β€” automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

✨

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.