Key Facts

Remote From:

Full time

Mid-level (2-5 years)

English

Hard Skills

Apache Spark Apache Airflow SQL (Programming Language) Data Architecture High Performance Computing Distributed Computing Distributed File Systems Python (Programming Language) Apache Oozie Analytics Amazon S3 Git (Version Control System) Java (Programming Language) Apache Flume Apache Hive Relational Databases Data Architecture

Other Skills

•
Verbal Communication Skills

Roles & Responsibilities

4+ years of relevant industry experience
Experience building batch data pipelines in Spark (Scala) and distributed storage/compute (S3, Hive, Spark)
Strong SQL and relational database querying skills
Experience with ETL frameworks (Airflow, Flume, Oozie, etc.) to build and deploy production-quality pipelines

Requirements:

Develop and automate large-scale, high-performance data processing systems (batch and/or streaming) to drive Airbnb business growth and improve the product experience
Build scalable Spark data pipelines leveraging Airflow scheduler/executor framework
Analyze large data sets to identify gaps and inconsistencies, provide data insights, and support effective product solutions
Build and deploy production-quality ETL pipelines using ETL frameworks (Airflow, Flume, Oozie)

Georgia IT, Inc.

About Georgia IT, Inc.

Georgia IT, Inc. provides IT Consulting for a wide range of IT services and custom build turn-key enterprise solutions. GIT specializes in improving business scalability and efficiency through BSM and SBA Solutions. GIT transforms business with service management and service automation solutions. We are BMC & HP partners. GIT Services include custom built enterprise software and customer-centric web portals, network design and implementation, remote and site-to-site VPN, network and server security assessment and setup, server and desktop virtualization, and many others. GIT also provides IT Consulting in many areas such as Disaster Recovery Planning, Enterprise Data Backup Strategy, Long Term Strategic IT Planning and Augmentation Professional Services Solutions. Professional Services Specialties: Business Process Integration, Software Integration & Development, Web portal, Networking, Remote and site-to-site VPN, Virtualization Solutions, Product Development Please contact us Hrus@georgiait.com / uthay@georgiait.com/470-798-5000 x 1010 / (732) 890-2535 direct

Founded: 2018

Company size: 51 - 200

Website LinkedIn See all jobs →

Job description

Job Name : Data Engineer - REMOTE
Job Type : Contract
Job Authorization: US Citizen/ GC /EAD (H4/L2/TN) preferred- No 3rdPARTIES RESUME C2C ACCEPTED

Responsibilities:

Develop and automate large scale, high-performance data processing systems (batch and/or streaming) to drive Airbnb business growth and improve the product experience.
Build scalable Spark data pipelines leveraging Airflow scheduler/executor framework

Minimum Requirements:

4+ years of relevant industry experience
Demonstrated ability to analyze large data sets to identify gaps and inconsistencies, provide data insights, and advance effective product solutions
Working knowledge of relational databases and query authoring (SQL).
Good communication skills, both written and verbal
Strong experience using ETL framework (ex: Airflow, Flume, Oozie etc.) to build and deploy production-quality ETL pipelines.
Experience building batch data pipelines in Spark Scala.
Strong understanding of distributed storage and compute (S3, Hive, Spark)
General software engineering skills (Java or Python, Github)