5+ years of experience in data science, with 3+ years in Spark framework
Strong SQL and Python skills, with proven experience building ETL/ELT at scale
Experience with data pipeline orchestration (Airflow, Prefect, dbt, or similar)
Solid understanding of data modeling (Kimball, Data Vault, or hybrid)
Requirements:
Design, build, and optimize data pipelines and ETL workflows in Snowflake (Snowpark, Streams/Tasks, Snowpipe); develop scalable data models, user 360 views, churn prediction, and recommendation engine inputs; lead integration across data sources (MySQL, BigQuery, Redis, Kafka, GCP Storage, API Gateway); implement CI/CD and data quality checks
Mentor junior data engineers; partner with Data Science, ML, and Backend teams to productionize ML features in Snowflake; ensure data governance, privacy, and PII compliance; collaborate with stakeholders to translate requirements
Tune algorithm performance; establish data partitioning, clustering, and materialized views; build dashboards and monitors for pipeline health, job success, and data latency metrics
Establish naming conventions, data lineage, and metadata standards; lead code reviews and documentation; contribute to evolving data mesh and streaming architecture vision
Job description
ABOUT US
Xsolla is a global commerce company with robust tools and services to help developers solve the inherent challenges of the video game industry. From indie to AAA, companies partner with Xsolla to help them fund, distribute, market, and monetize their games. Grounded in the belief in the future of video games, Xsolla is resolute in the mission to bring opportunities together, and continually make new resources available to creators. Headquartered and incorporated in Los Angeles, California, Xsolla operates as the merchant of record and has helped over 1,500+ game developers to reach more players and grow their businesses around the world. With more paths to profits and ways to win, developers have all the things needed to enjoy the game.