10+ years of experience in data engineering and platform architecture
Strong expertise in cloud platforms (Azure, AWS, or GCP) and modern data ecosystems
Familiarity with AI/ML data pipelines, feature stores, and model lifecycle support
Experience with LLM data pipelines
Requirements:
Architect and build scalable, cloud-native data platforms supporting AI/ML and agent-based applications
Design pipelines to deliver AI-ready data (curated, labeled, contextualized, and feature-rich datasets)
Develop robust data ingestion, transformation, and serving layers (batch plus; real-time)
Enable semantic data models, knowledge graphs, and vector databases to power AI agents and LLMs
Job description
Job Title: Data engineer (AI Ready data platforms) Location: Remote (PST time zone)
Role Summary We are seeking a senior Data Engineer to design and build enterprise-scale data platforms that enable AI/ML and agentic systems. This role focuses on engineering AI-ready data foundationsβensuring data is high-quality, governed, and optimized for advanced analytics and autonomous AI agents.
Key Responsibilities
Architect and build scalable, cloud-native data platforms supporting AI/ML and agent-based applications
Design pipelines to deliver AI-ready data (curated, labeled, contextualized, and feature-rich datasets)
Develop robust data ingestion, transformation, and serving layers (batch + real-time)
Enable semantic data models, knowledge graphs, and vector databases to power AI agents and LLMs
Implement data quality, lineage, and governance frameworks to ensure trust and compliance
Collaborate with AI/ML teams to support feature engineering, model training, and inference pipelines
Optimize data architectures for performance, scalability, and cost efficiency
Mentor teams and establish best practices for AI-driven data engineering
Required Skills & Experience
10+ years of experience in data engineering and platform architecture
Strong expertise in cloud platforms (Azure, AWS, or GCP) and modern data ecosystems
Familiarity with AI/ML data pipelines, feature stores, and model lifecycle support
Experience with LLM data pipelines
Strong understanding of data governance, metadata management, and security frameworks
Preferred Qualifications
Experience building data platforms for AI agents / agentic workflows
Knowledge of RAG (Retrieval Augmented Generation) architectures and semantic search
Exposure to data mesh / domain-oriented data architectures
Experience in large-scale enterprise transformation programs