SUMMARY

As a handson Data Architect, you will be crucial in designing, building, and optimizing the data architecture for our nextgeneration SaaS platform. This position requires expertise in eventdriven data architectures (e.g., Apache Kafka) and emerging technologies such as vector databases to support Generative AI applications. You will be deeply involved in implementing scalable, highperformance data systems that drive realtime analytics, AI applications, and dynamic data processing. Experience in the Utility industry and knowledge of AWS is preferred as we seek to optimize data systems that cater to the unique demands of this sector.

The development organization leverages Java, Spring Boot, AWS RDS (Postgres, SQL Server), Oracle, AWS Serverless technologies (Lambda, SQS), REST, JavaScript, and Mobile development with React Native hosted in AWS using Atlassian tools (Jira, BitBucket, andConfluence).

Dealbreakers:

Handson experience with Python andor Java is required. Must have practical experience with data streaming use cases. Strong verbal and written communication skills are essential for engaging with stakeholders in technical roles.

Highlight Responsibilities:

This person is directly responsible for maintaining a consistent focus on the aspects of data and collaborating with business, technical stakeholders to implement a robust set of data capabilities consistent with our nonfunctional and functional requirements.

JOB FUNCTIONS

Duties and Responsibilities

Design & Build Data Infrastructure: Architect and implement scalable, highperformance data infrastructure focusing on eventdriven architectures, realtime data streaming, and advanced AIdriven applications.
EventDriven Data Solutions: Develop eventdriven systems leveraging tools like Apache Kafka or similar technologies to support realtime data processing and lowlatency pipelines.
Handson Development: Actively develop and maintain data pipelines, ETLELT processes, and eventstreaming solutions using Apache Kafka, Apache Flink, Apache Spark, or similar tools, as well as AIspecific data systems.
Database Management: Manage and optimize SQL, NoSQL, OLAP and vector databases to ensure high availability, scalability, and performance, leveraging deep knowledge of database internals, mastery of concepts such as partitioning, sharding, embeddings, distributed database systems, and change data capture (CDC) techniques to drive efficiency and reliability across complex, largescale environments.
Data Integration: Build realtime and batch data pipelines that integrate structured and unstructured data from various sources, including AI models and thirdparty data sources.
Performance Tuning: Continuously monitor and optimize data systems for performance, ensuring that AI workloads are supported by highly efficient data pipelines and storage solutions.
Collaboration: Work closely with product managers, software engineers, and data scientists to align eventdriven architectures, vector databases, and data pipelines with the needs of AI and machine learning models.
Cloud Architecture: Architect and manage cloudbased data solutions AWS preferred, that support distributed data processing, AI workloads, and realtime data streaming.
Vector Databases: Design and implement vectorbased databases (e.g., Pinecone, pg_vector, Milvus) to support machine learning models, including Generative AI applications, efficiently handling highdimensional data such as embeddings and unstructured data.

Data Architect

Offer summary

Qualifications:

Key responsibilities:

Job description

Required profile

Experience

Hard Skills

Other Skills

Data Architect Related jobs

Data Platforms Solutions Architect

Planning Data Specialist II (East)

Planning Data Specialist II (West)

Data Architect

Data Architect