Logo for Astreya

Data Scientist III - Lead Data Architect

Key Facts

Remote From: 
Category:  Data Scientist
Full time
Senior (5-10 years)
108 - 180K yearly
English

Other Skills

  • Collaboration
  • Problem Solving

Roles & Responsibilities

  • 8+ years in data modeling, data architecture, or analytics engineering
  • 3+ years of Utility/energy domain experience supporting electric, gas, and/or water utilities
  • Strong expertise in Dimensional modeling for analytics and data modeling for machine learning pipelines
  • Experience with SQL and data transformation frameworks

Requirements:

  • Design AI-ready data models to support machine learning and advanced analytics
  • Build and maintain feature-ready datasets for data science teams
  • Develop semantic and analytical data layers for BI and self-service analytics
  • Collaborate with data scientists to translate ML use cases into scalable data structures

Job description

Job Title: Data Modeling Expert – AI & Analytics

Location: California (Hybrid/Remote)

We are seeking a Data Modeling Expert with strong AI/Analytics focus to enable next-generation data platforms supporting predictive analytics, machine learning, and intelligent automation. This role will design and optimize data models that power use cases such as grid reliability, predictive maintenance, wildfire risk modeling, customer analytics, and AI-driven operations.

Key Responsibilities

  • Design AI-ready data models to support machine learning, advanced analytics, and real-time decisioning
  • Build and maintain feature-ready datasets for data science teams (feature engineering support)
  • Develop semantic and analytical data layers for BI, AI, and self-service analytics
  • Collaborate with data scientists to translate ML use cases into scalable data structures
  • Model and integrate high-volume time-series and IoT data (e.g., smart meters, sensors, grid telemetry)
  • Enable real-time / near-real-time data pipelines for AI-driven insights
  • Ensure data models support MLOps frameworks (model training, validation, deployment pipelines)
  • Implement data lineage, observability, and quality frameworks to support trusted AI outcomes
  • Optimize data structures for lakehouse architectures and distributed compute environments
  • Align with data governance, privacy, and regulatory compliance requirements

AI/Analytics Use Case Alignment

  • Predictive Maintenance: Asset failure prediction using sensor and maintenance data
  • Wildfire Risk Modeling: Environmental and grid data modeling for risk forecasting
  • Load Forecasting: Time-series modeling for energy demand prediction
  • Customer 360 Analytics: Behavioral segmentation and usage insights
  • Grid Intelligence: AI-driven outage prediction and response optimization
  • Generative AI Enablement: Structuring enterprise data for LLM-based insights and copilots

Required Qualifications

  • 8+ years in data modeling, data architecture, or analytics engineering
  • 3+ years of Utility/energy domain experience (smart grid, AMI, SCADA systems) supporting electric, gas, and/or water utilities.
  • Strong expertise in:
    • Dimensional modeling for analytics (Star/Snowflake schemas)
    • Data modeling for machine learning pipelines
    • SQL and data transformation frameworks (dbt preferred)
  • Experience designing data models for:
    • Data lakes / lakehouse architectures (Delta Lake, Iceberg, etc.)
    • Structured + semi-structured data (JSON, Parquet)
  • Proven experience supporting AI/ML workloads in production environments

Preferred Qualifications

  • Experience with cloud AI ecosystems:
    • AWS (SageMaker, Redshift)
    • Azure (Synapse, Azure ML)
    • GCP (BigQuery, Vertex AI)
  • Familiarity with time-series and streaming platforms (Kafka, Spark Streaming)
  • Knowledge of feature stores (Feast, Tecton)
  • Experience with MLOps tools (MLflow, Kubeflow)
  • Understanding of LLM data preparation, vector databases, and embeddings

Key Skills

  • AI/ML Data Modeling & Feature Engineering
  • Lakehouse & Modern Data Stack (dbt, Spark, Delta Lake)
  • Time-Series & Streaming Data Modeling
  • Data Governance for AI (quality, lineage, bias mitigation)
  • Performance Optimization for Analytics Workloads
  • Cross-functional collaboration (Data Science, Engineering, Business)

Salary Range

$108,000.00 - $180,000.00 USD (Salary)
  • Please note that the salary information provided herein is base pay only (gross); it does not include other forms of compensation which may or may not apply to this specific position, namely, performance-based bonuses, benefits-related payments, or other general incentives - none of which are guaranteed, may be subject to specific eligibility requirements, and are wholly within the discretion of Astreya to remit.
  • Further, the salary information noted above is a range that consists of a minimum and maximum rate of pay for this specific position. Where an applicant or employee is placed on this range will depend and be contingent on objective, documented work-related considerations like education, experience, certifications, licenses, preferred qualifications, among other factors.

Astreya offers comprehensive benefits to all Regular, Full-Time Employees, including:

  • Medical provided through UHC (PPO, HSA, Surest options) / Medical provided through Kaiser (HMO option only) for California employees only

  • Dental provided through UHC

  • Nationwide Vision provided by UHC

  • Flexible Spending Account for Health & Dependent Care

  • Pre-Tax Account for Commuter Benefit/Parking & Transit (location-specific)

  • Continuing Education and Professional Development via various integrated platforms, e.g. Udemy and Coursera

  • Corporate Wellness Program provided by Goomi Group

  • Employee Assistance Program

  • Wellness Days

    401k Plan

  • Basic and Supplemental Life Insurance

  • Short Term & Long Term Disability

  • Critical Illness, Critical Hospital, and Voluntary Accident Insurance

  • Tuition Reimbursement (available 6 months after start date, capped)

  • Paid Time Off (accrued and prorated, maximum of 120 hours annually)

  • Paid Holidays

  • Any other statutory leaves, paid time, or other ancillary benefits required under state and federal law

Data Scientist Related jobs

Other jobs at Astreya

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.