Logo for Exavalu

Data Engineer

Roles & Responsibilities

  • Strong proficiency in PySpark, Spark SQL, and Databricks Jobs
  • Familiarity with Azure Data Factory, Azure Data Lake Storage Gen2, and Azure Synapse Analytics
  • Experience in building data models (e.g., Star Schema, Snowflake)
  • Proficient in Python and SQL

Requirements:

  • Design, build, and optimize robust ETL/ELT pipelines using Azure Databricks, Spark, and SQL
  • Develop data processing transformations with Python, PySpark, and SQL to clean, transform, and aggregate data for analytics
  • Manage Azure Data Lake Storage Gen2 and implement Delta Lake for ACID compliance and high-performance data lake operations
  • Integrate Databricks with Azure Data Factory, Synapse Analytics, and Azure Key Vault; monitor and optimize Spark jobs and Databricks clusters for cost efficiency, performance, security, and governance

Job description

This is a remote position.

• Data Pipeline Development: Design, build, and optimize robust ETL/ELT pipelines using Azure Databricks, Spark, and SQL.
• Data Processing & Transformation: Utilize Python, PySpark, and SQL to clean, transform, and aggregate complex data for analytics.
• Azure Data Lake Management: Manage and optimize data storage and retrieval in Azure Data Lake Storage (ADLS) Gen2.
• Delta Lake Implementation: Implement Delta Lake for ACID compliance, data versioning, and high-performance data lake operations.
• Integration with Azure Services: Integrate Databricks with Azure Data Factory for orchestration, Azure Synapse Analytics for warehousing, and Azure Key Vault for security.
• Performance Optimization: Monitor, troubleshoot, and optimize Databricks clusters and spark jobs to manage costs and performance.
• Data Security & Governance: Implement Role-Based Access Control (RBAC), data encryption, and data lineage tracking.


Requirements

Collaboration: Work with data scientists and analysts to support machine learning models and business intelligence (BI) reporting.
• Databricks & Spark: Strong proficiency in PySpark, Spark SQL, and Databricks Jobs.
• Azure Infrastructure: Familiarity with Azure Data Factory, ADLS, and Synapse Analytics.
• Data Modelling: Experience in building data models (e.g., Star Schema, Snowflake).
• Programming: Proficient in Python and SQL.
• DevOps: Experience with Git and CI/CD tools


Benefits

Diversity Inclusion:

At Exavalu, we are committed to building a diverse and inclusive workforce. We welcome applications for employment from all qualified candidates, regardless of race, color, gender, national or ethnic origin, age, disability, religion, sexual orientation, gender identity or any other status protected by applicable law. We nurture a culture that embraces all individuals and promotes diverse perspectives, where you can make an impact and grow your career.

Exavalu also promotes flexibility  depending on the needs of employees, customers and the business. It might be part-time work, working outside normal 9-5 business hours or working remotely.. We also have a welcome back program to help people get back to mainstream after a long break due to health or family reasons



Data Engineer Related jobs

Other jobs at Exavalu

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.