Python + Pyspark

Work set-up: 
Full Remote
Contract: 
Experience: 
Senior (5-10 years)
Work from: 

Offer summary

Qualifications:

Minimum 5 years of experience in system engineering or software development., At least 3 years of experience with ETL processes, databases, and Hadoop platforms., Proficiency in Python or Java, with experience in Spark and REST APIs., Strong SQL skills and familiarity with AWS services and security protocols..

Key responsibilities:

  • Develop and optimize ETL/ELT processes for large-scale data systems.
  • Launch and manage Spark jobs in client and cluster modes, ensuring performance.
  • Collaborate with DevOps teams following Agile and SDLC practices.
  • Maintain data security and user authorization protocols in Hadoop environments.

Sureminds Solutions Private Limited logo
Sureminds Solutions Private Limited Human Resources, Staffing & Recruiting SME https://www.sureminds.co.in/
501 - 1000 Employees
See all jobs

Job description

5+ years of experience in system engineering or software development
3+ years of experience in engineering with experience in ETL type work with databases and Hadoop platforms.
Skills
  • Hadoop GeneralDeep knowledge of distributed file system concepts, mapreduce principles and distributed computing. Knowledge of Spark and differences between Spark and MapReduce. Familiarity of encryption and security in a Hadoop cluster.
  • Data management data structuresMust be proficient in technical data management tasks, i.e. writing code to read, transform and store data
  • XMLJSON knowledge
  • Experience working with REST APIs
  • SparkExperience in launching spark jobs in client mode and cluster mode. Familiarity with the property settings of spark jobs and their implications to performance.
  • Application DevelopmentFamiliarity with HTML, CSS, and JavaScript and basic designvisual competency
  • SCCGitMust be experienced in the use of source code control systems such as Git
  • ETL Experience with developing ELTETL processes with experience in loading data from enterprise sized RDBMS systems such as Oracle, DB2, MySQL, etc.
  • AuthorizationBasic understanding of user authorization (Apache Ranger preferred)
  • Programming Must be at able to code in Python or expert in at least one high level language such as Java, C, Scala.
  • Must have experience in using REST APIs
  • SQL Must be an expert in manipulating database data using SQL. Familiarity with views, functions, stored procedures and exception handling.
  • AWS General knowledge of AWS Stack (EC2, S3, EBS, …)
  • IT Process ComplianceSDLC experience and formalized change controls
  • Working in DevOps teams, based on Agile principles (e.g. Scrum)
  • ITIL knowledge (especially incident, problem and change management)
  • Languages Fluent English skills

Required profile

Experience

Level of experience: Senior (5-10 years)
Industry :
Human Resources, Staffing & Recruiting
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Communication

Related jobs