Description:

Data engineer with 4 to 6 years of hands on experience working on Big Data Platforms
Experience building and optimizing Big data data pipelines and data sets ranging from Data ingestion to Processing to Data Visualization.
Good Experience in writing and optimizing Spark Jobs, Spark SQL etc. Should have worked on both batch and steaming data processing
Good experience in any one programming language -Scala/Python , Python preferred.
Experience in writing and optimizing complex Hive and SQL queries to process huge data. good with UDFs, tables, joins,Views etc
Experience in using Kafka or any other message brokers
Configuring, monitoring and scheduling of jobs using Oozie and/or Airflow
Processing streaming data directly from Kafka using Spark jobs, expereince in Spark- streaming is must
Should be able to handling different file formats (ORC, AVRO and Parquet) and unstructured data
Should have experience with any one No SQL databases like Amazon S3 etc
Should have worked on any of the Data warehouse tools like AWS Redshift or Snowflake or BigQuery etc
Work expereince on any one cloud AWS or GCP or Azure

Requirements:

Data engineer with 4 to 6 years of hands on experience working on Big Data Platforms
Experience building and optimizing Big data data pipelines and data sets ranging from Data ingestion to Processing to Data Visualization.
Good Experience in writing and optimizing Spark Jobs, Spark SQL etc. Should have worked on both batch and steaming data processing
Good experience in any one programming language -Scala/Python , Python preferred.
Experience in writing and optimizing complex Hive and SQL queries to process huge data. good with UDFs, tables, joins,Views etc
Experience in using Kafka or any other message brokers
Configuring, monitoring and scheduling of jobs using Oozie and/or Airflow
Processing streaming data directly from Kafka using Spark jobs, expereince in Spark- streaming is must
Should be able to handling different file formats (ORC, AVRO and Parquet) and unstructured data
Should have experience with any one No SQL databases like Amazon S3 etc
Should have worked on any of the Data warehouse tools like AWS Redshift or Snowflake or BigQuery etc
Work expereince on any one cloud AWS or GCP or Azure

Good to have skills:

Experience in AWS cloud services like EMR, S3, Redshift, EKS/ECS etc
Experience in GCP cloud services like Dataproc, Google storage etc
Experience in working with huge Big data clusters with millions of records
Experience in working with ELK stack, specially Elasticsearch
Experience in Hadoop MapReduce, Apache Flink, Kubernetes etc

Preferences:

Data engineer with 4 to 6 years of hands on experience working on Big Data Platforms
Experience building and optimizing Big data data pipelines and data sets ranging from Data ingestion to Processing to Data Visualization.
Good Experience in writing and optimizing Spark Jobs, Spark SQL etc. Should have worked on both batch and steaming data processing
Good experience in any one programming language -Scala/Python , Python preferred.
Experience in writing and optimizing complex Hive and SQL queries to process huge data. good with UDFs, tables, joins,Views etc
Experience in using Kafka or any other message brokers
Configuring, monitoring and scheduling of jobs using Oozie and/or Airflow
Processing streaming data directly from Kafka using Spark jobs, expereince in Spark- streaming is must
Should be able to handling different file formats (ORC, AVRO and Parquet) and unstructured data
Should have experience with any one No SQL databases like Amazon S3 etc
Should have worked on any of the Data warehouse tools like AWS Redshift or Snowflake or BigQuery etc
Work expereince on any one cloud AWS or GCP or Azure

Job Responsibilities:

Data engineer with 4 to 6 years of hands on experience working on Big Data Platforms
Experience building and optimizing Big data data pipelines and data sets ranging from Data ingestion to Processing to Data Visualization.
Good Experience in writing and optimizing Spark Jobs, Spark SQL etc. Should have worked on both batch and steaming data processing
Good experience in any one programming language -Scala/Python , Python preferred.
Experience in writing and optimizing complex Hive and SQL queries to process huge data. good with UDFs, tables, joins,Views etc
Experience in using Kafka or any other message brokers
Configuring, monitoring and scheduling of jobs using Oozie and/or Airflow
Processing streaming data directly from Kafka using Spark jobs, expereince in Spark- streaming is must
Should be able to handling different file formats (ORC, AVRO and Parquet) and unstructured data
Should have experience with any one No SQL databases like Amazon S3 etc
Should have worked on any of the Data warehouse tools like AWS Redshift or Snowflake or BigQuery etc
Work expereince on any one cloud AWS or GCP or Azure

Good to have skills:

Experience in AWS cloud services like EMR, S3, Redshift, EKS/ECS etc
Experience in GCP cloud services like Dataproc, Google storage etc
Experience in working with huge Big data clusters with millions of records
Experience in working with ELK stack, specially Elasticsearch
Experience in Hadoop MapReduce, Apache Flink, Kubernetes etc

What We Offer

Exciting Projects: We focus on industries like High-Tech, communication, media, healthcare, retail and telecom. Our customer list is full of fantastic global brands and leaders who love what we build for them.

Collaborative Environment: You Can expand your skills by collaborating with a diverse team of highly talented people in an open, laidback environment — or even abroad in one of our global centers or client facilities!

Work-Life Balance: GlobalLogic prioritizes work-life balance, which is why we offer flexible work schedules, opportunities to work from home, and paid time off and holidays.

Professional Development: Our dedicated Learning & Development team regularly organizes Communication skills training(GL Vantage, Toast Master),Stress Management program, professional certifications, and technical and soft skill trainings.

Excellent Benefits: We provide our employees with competitive salaries, family medical insurance, Group Term Life Insurance, Group Personal Accident Insurance , NPS(National Pension Scheme ), Periodic health awareness program, extended maternity leave, annual performance bonuses, and referral bonuses.

Fun Perks: We want you to love where you work, which is why we host sports events, cultural activities, offer food on subsidies rates, Corporate parties. Our vibrant offices also include dedicated GL Zones, rooftop decks and GL Club where you can drink coffee or tea with your colleagues over a game of table and offer discounts for popular stores and restaurants!

About GlobalLogic GlobalLogic is a leader in digital engineering. We help brands across the globe design and build innovative products, platforms, and digital experiences for the modern world. By integrating experience design, complex engineering, and data expertise—we help our clients imagine what’s possible, and accelerate their transition into tomorrow’s digital businesses. Headquartered in Silicon Valley, GlobalLogic operates design studios and engineering centers around the world, extending our deep expertise to customers in the automotive, communications, financial services, healthcare and life sciences, manufacturing, media and entertainment, semiconductor, and technology industries. GlobalLogic is a Hitachi Group Company operating under Hitachi, Ltd. (TSE: 6501) which contributes to a sustainable society with a higher quality of life by driving innovation through data and technology as the Social Innovation Business.

Data Engineer with Bigdata expertise IRC229473

Offer summary

Qualifications:

Key responsabilities:

Job description

Required profile

Experience

Hard Skills

Data Engineer Related jobs

Digital Data Specialist

Principal Data Engineer

Data Engineer – Experiencia en Denodo, AWS y Python – ¡Certificaciones muy valoradas!

Joist - Data Engineer, Remote (Canada)

Data Engineer Pleno | Afirmativa para pessoas pretas