Match score not available

GSA Data Engineer

unlimited holidays - extra holidays - extra parental leave - long remote period allowed
Remote: 
Full Remote
Work from: 

Offer summary

Qualifications:

Candidate with 3 years of daily use in Python, expertise in SQL database design and CI/CD, Proficiency in Python, SQL, ORM tools, AWS services, and web APIs, Experience in trunk-based development, cloud infrastructure management, and Apache Spark.

Key responsabilities:

  • Engineer data solutions for Sustainability reporting
  • Engage with teams to develop data solutions based on priorities
BizTek People, Inc. logo
BizTek People, Inc. Human Resources, Staffing & Recruiting SME https://www.biztekpeople.com
201 - 500 Employees
See more BizTek People, Inc. offers

Job description

Job Duties

Engineer data solutions in support of Sustainability reporting and analytics initiatives

Engage with product owner, analysts, visualization developers, and business partners to

understand capability requirements, and to develop and support data solutions based on

product backlog priorities

 

Skills

Preference will be shown to candidates who can provide a link to their open-source code

portfolio (a link to your profile on github.com, bitbucket.com, gitlab.com, or another public VCS

is sufficient).



Requirements

Required Skills

Candidates should have demonstrated, in a professional capacity, all of the competencies listed

for each of the following three subject areas:

 

General Purpose Python Programming:

·        Python has been your primary coding language (daily use) for at least 3 years You have authored distributable Python packages (packages which can be built, installed, and distributed using setup tools, pip, and twine)

·        You have a solid understanding of how pip dependency resolution works

·        You are proficient in authoring and automating unit and integration tests for python packages using (minimally) unit tests, PYtest, and tox

·        You are meticulous about code quality, including readability, know your PEP8 guidelines inside and out, and are capable of authoring code which will pass validation by commonly used static analysis tools including mypy and flake8

 

Database Design and SQL:

·        You are proficient in authoring readable, well-structured, SQL SELECT statements using ISO/ANSI-standard SQL

·        You have hands-on professional experience in data warehouse design and modeling, including authoring DDL statements.

 

Version Control and CI/CD:

·        You have experience with trunk-based development (feature branching) using git for

·        version control, with fully automated deployments (CI/CD).

 

Desired Technical Competencies

 

General Purpose Python Programming:

·        You have a deep understanding of python’s standard library and python internals. You understand python memory management, how CPython implements built-in data structures, and which data structures are best suited for different scenarios

·        You understand and can compare/contrast CPython’s built-in concurrency models, when to use each, and what obstacles might prevent the use of each mechanism

 

Database Design, SQL, and Object Relational Models:

·        You are adept at performance-tuning SQL queries for both OLAP and OLTP databases

·        You understand and are prepared to discuss how and when/where to utilize more esoteric and/or modern SQL features such as window functions and common table expressions.

·        You understand and are prepared to discuss the performance implications of columnar vs relational databases.

·        You have hands-on experience in managing database schema migrations (ideally using

·        SQLAlchemy’s ORM + Alembic).

 

Version Control and CI/CD:

·        You have experience with trunk-based development (feature branching) using git for version control, with fully automated deployments (CI/CD).

 

Cloud Infrastructure and Amazon Web Services:

·        You have hands-on experience using boto3 to interact with Amazon Web Services’ resource APIs, particularly Amazon S3 (Simple Storage Service).

·        You have hands-on experience authoring unit and integration tests utilizing localstack to emulate AWS resources.

·        You have hands-on experience using HashiCorp Terraform to manage cloud infrastructure.

·        You have hands-on experience developing serverless ASGI applications using AWS lambda and AWS API Gateway

 

Web API Server and Client Development:

·        You have experience planning and executing the design and development of web APIs using a modern python ASGI framework (preferably FastAPI).

·        You have authored, validated, and maintained OpenAPI documents describing your web APIs accurately.

·        You have experience developing and testing python web API client libraries based on an OpenAPI document.

 

Distributed Computing and Apache Spark:

·        You have experience using Apache Spark for ingestion and manipulation of data sets which are too large to process efficiently in-memory.

·        You have hands-on experience translating algorithms and procedures designed by topical subject matter experts, having varying levels of engineering experience, into well-designed data pipelines.

·        You have experience configuring and tuning Spark clusters to optimize use of computing resources for varying workloads.

·        You understand, and can discuss: when and why to use distributed computing frameworks, such as Apache Spark, versus alternate concurrency models such as asyncio or multiprocessing.

 

Database Design, SQL, and Object Relational Models:

·        You have experience modeling databases using SQLAlchemy’s ORM framework.

·        You have experience managing database versions and schema migrations using SQLAlchemy with Alembic.

 

Required Soft Skills

Candidates should have demonstrated, in a professional capacity, all or most of the following

skills:

You are proficient in communicating effectively and efficiently within a hybrid

remote/in-person team structure:

·        You are meticulous about managing your calendar to accurately reflect your free/busy hours

·        You respect and seek to learn digital communications etiquette—including region-specific, industry-specific, and organization-specific etiquette

·        You proactively initiate constructive discussions while curating and targeting your communications with respect for your colleagues’ time and schedules

You are adept at discovering and navigating the complex bureaucratic resources of a

large organization.


Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Open Mindset

Data Engineer Related jobs