Key Facts

Remote From:

United States

Full time

Senior (5-10 years)

124 - 329K yearly

English

Hard Skills

Other Skills

•
Accountability
•
Communication
•
Teamwork
•
Analytical Thinking

Job description

About GitHub: GitHub is the world’s leading platform for agentic software development — powered by Copilot to build, scale, and deliver secure software. Over 180 million developers, including more than 90% of the Fortune 100 companies, use GitHub to collaborate, and more than 77,000 organisations have adopted GitHub Copilot.
Locations: In this role you can work from Remote, United States
Overview:

As a Senior Data Engineer on the Revenue Data & Analytics team at GitHub, you will build and maintain the governed data infrastructure that powers how GitHub understands its revenue. You will work at the intersection of data engineering, data modeling, and analytics engineering — designing pipelines, building dimensional models, and implementing data quality frameworks that serve as the single source of truth for Copilot and broader revenue analytics.

We are building a metadata-driven, medallion-architecture governance layer built on Microsoft Fabric, dbt, and Delta Lake. This role is foundational to that effort. You will be dedicated to transforming how GitHub governs, models, and delivers revenue data at scale.

We are looking for engineers who think in models, not just pipelines. People who understand that a well-governed dim_account is worth more than a thousand ad-hoc joins. If you care about data quality as much as data delivery, about SCD2 as much as SLAs, and about building things right — this is your role.

Locations

In this role you can work from Remote, United States

Responsibilities:

Design, build, and maintain dbt models across medallion layers (bronze/silver/gold) in Microsoft Fabric Lakehouse and Warehouse, following Kimball dimensional modeling patterns — including SCD2 dimensions, incremental CDC pipelines, and metadata-driven approaches to minimize code duplication

Author and enforce data quality checks and dbt tests across pipeline stages to catch anomalies before they reach downstream consumers; contribute to data cataloging and lineage to ensure governed datasets are discoverable and traceable

Develop and maintain Airflow DAGs for orchestration — scheduling, dependency management, error handling, and alerting

Containerize data workloads with Docker and deploy via GitHub Actions CI/CD pipelines, including automated testing, linting, and environment promotion (dev → staging → prod)

Manage and optimize ADLS Gen2 and Delta Lake storage — partitioning, compaction, retention policies, and cost management

Collaborate with analytics engineers, BI developers, and analysts to ensure gold-layer datasets serve Power BI, Trino, and downstream reporting needs

Participate in architecture reviews and contribute to ADRs; support migration from legacy patterns toward a governed, metadata-driven platform with pragmatism about transition paths

Own operational excellence across data pipelines — monitoring, alerting, incident response, and proactive detection of data drift, schema changes, and quality regressions

Qualifications:

Required Qualifications:

6+ years experience in Software Engineering, Computer Science, or related technical discipline with proven experience maintaining and delivering production software coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Go, Ruby, Rust, or Python

- OR Associate’s Degree in Computer Science, Electrical Engineering, Electronics Engineering, Math, Physics, Computer Engineering, Computer Science, or related field AND 5+ years experience in Software Engineering, Computer Science, or related technical discipline with proven experience maintaining and delivering production software coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Go, Ruby, Rust, or Python

- OR Bachelor's Degree in Computer Science, Electrical Engineering, Electronics Engineering, Math, Physics, Computer Engineering, Computer Science, or related field AND 4+ years experience in Software Engineering, Computer Science, or related technical discipline with proven experience maintaining and delivering production software coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Go, Ruby, Rust, or Python

- OR Master's Degree in Computer Science, Electrical Engineering, Electronics Engineering, Math, Physics, Computer Engineering, Computer Science, or related field AND 2+ years experience in Software Engineering, Computer Science, or related technical discipline with proven experience maintaining and delivering production software coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Go, Ruby, Rust, or Python

- OR Doctorate in Computer Science, Electrical Engineering, Electronics Engineering, Math, Physics, Computer Engineering, Computer Science, or related fie

- OR equivalent experience.

5+ years SQL experience.

Preferred Qualifications:

SQL fluency — window functions, CTEs, merge statements, query optimization; you think in sets, not loops

Hands-on dbt experience (Core or Cloud) — models, tests, macros, Jinja, incremental materializations; dimensional modeling (Kimball star schemas, SCD2, conformed dimensions) a strong plus

Orchestration experience (Airflow, Prefect, Dagster, or similar) for scheduling, dependencies, and error handling

Cloud data platform experience — Azure preferred (Fabric, ADLS, Synapse); AWS/GCP transfers; familiarity with Delta Lake, Apache Iceberg, or Spark a bonus

Docker, Git-based workflows, and CI/CD for data pipelines; Python or equivalent for engineering tasks

Data quality tooling (Soda, dbt Elementary) and catalog/lineage tools (Purview, Atlan, DataHub, or similar)

Familiarity with advanced patterns — medallion architecture, Data Vault 2.0, metadata-driven frameworks, or federated query engines (Trino/Presto)

Experience with revenue, finance, or billing data — ARR, consumption models, hierarchy attribution, and account ownership complexity

Compensation Range: The base salary range for this job is USD $124,000.00 - USD $329,200.00 /Yr.

These pay ranges are intended to cover roles based across the United States. An individual's base pay depends on various factors including geographical location and review of experience, knowledge, skills, abilities of the applicant. At GitHub certain roles are eligible for benefits and additional rewards, including annual bonus and stock. These rewards are allocated based on individual impact in role. In addition, certain roles also have the opportunity to earn sales incentives based on revenue or utilization, depending on the terms of the plan and the employee's role. GitHub Leadership Principles:

GitHub values

Customer-obsessed
Ship to learn
Growth mindset
Own the outcome
Better together
Diverse and inclusive

Manager fundamentals

Model
Coach
Care

Leadership principles

Create clarity
Generate energy
Deliver success

Who We Are: GitHub is the world’s leading AI-powered developer platform with 150 million developers and counting. We’re also home to the biggest open-source community on earth (and 99% of the world’s software has open-source code in its DNA). Many of the apps and programs you use every day are built on GitHub.
Our teams are dreamers, doers, and pioneers, leading the way in AI, driving humanitarian efforts around the globe, and even sending open source to Mars (and beyond!). At GitHub, our goal is to create the space you need to do your best work. We’re remote-first and offer competitive pay, generous learning and growth opportunities, and excellent benefits to support you, wherever you are—because we know that people flourish when they can work on their own terms.
Join us, and let’s change the world, together.
EEO Statement: GitHub is made up of people from a wide variety of backgrounds and lifestyles. We embrace diversity and invite applications from people of all walks of life. We don't discriminate against employees or applicants based on gender identity or expression, sexual orientation, race, religion, age, national origin, citizenship, disability, pregnancy status, veteran status, or any other differences. Also, if you have a disability, please let us know if there's any way we can make the interview process better for you; we're happy to accommodate!

Ready to apply?

APPLY

Share ·