Logo for Diverse Lynx

Lead Data Engineer

Roles & Responsibilities

  • 3+ years of experience designing and deploying big data applications and ETL jobs using PySpark APIs and SparkSQL
  • Strong experience with AWS services across multiple domains, including Kinesis, DMS; S3, RDS, Redshift, DynamoDB; Glue, EMR, Athena, SageMaker, Bedrock; EC2, Lambda, ECS; IAM, KMS, SSE
  • Proficiency in SQL and relational databases (Oracle, SQL Server, Teradata) with expert-level query tuning
  • Hands-on Python development, REST APIs (AWS API Gateway, Node.js), and CI/CD pipelines using GitHub

Requirements:

  • Architect and maintain data pipelines using AWS native services (Glue, Kinesis, Lambda, S3, Redshift)
  • Design and optimize data models on AWS Cloud leveraging Redshift, RDS, and S3
  • Implement ETL/ELT workflows and PySpark jobs for data ingestion, transformation, and storage
  • Mentor engineers on coding best practices, lead design reviews, deployment strategies, and ensure security/compliance; coordinate with System Architect and Scrum Master

Job description


Job Title: Lead Data Engineer

Location: Remote

Duration: Full Time Employment

Job Description:

Must Have Technical/Functional Skills

3+ years relevant experience in designing and deploying big data applications and ETL jobs using PySpark APIs/SparkSQL.

Strong experience with AWS services across multiple domains:

Collection: Kinesis, DMS

Storage: S3, RDS, Redshift, DynamoDB

Analytics & ML: Glue, EMR, Athena, SageMaker, Bedrock

Compute: EC2, Lambda, ECS

Security: IAM, KMS, SSE

Proficiency in SQL and relational databases (Oracle, SQL Server, Teradata); expert-level query tuning.

Hands-on experience with Python development, REST APIs (AWS API Gateway, Node.js), and CI/CD pipelines using GitHub.

Familiarity with file formats (JSON, Parquet, Avro) and Linux/Unix shell scripting.

Exposure to Docker/Kubernetes, Delta Lake APIs, and data quality frameworks.

AWS certification (Developer Associate or higher) preferred.

Roles & Responsibilities:

Architect and maintain data pipelines using AWS native services (Glue, Kinesis, Lambda, S3, Redshift).

Design and optimize data models on AWS Cloud leveraging Redshift, RDS, and S3.

Implement ETL/ELT workflows and PySpark jobs for data ingestion, transformation, and storage.

Operationalize self-service data preparation tools (e.g., Trifacta) on AWS.

Conduct performance engineering for large-scale data lakes in production environments.

Participate in design workshops, provide trade-offs and recommendations for solution architecture.

Mentor engineers on coding best practices, problem-solving, and AWS service utilization.

Define code review processes, deployment strategies, and ensure compliance with security standards.

Collaborate with System Architect and Scrum Master to manage dependencies, risks, and blockers. Support test strategy, defect resolution, and root cause analysis during warranty periods.

Maintain documentation in Confluence and ensure team alignment on standards and practices.

Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.

Data Engineer Related jobs

Other jobs at Diverse Lynx

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

✨

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.