Senior Big Data Engineer

Work set-up: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 

Offer summary

Qualifications:

Bachelor's or Master's degree in Computer Science, Software Engineering, or related fields., At least 5 years of experience in backend or ML engineering., Strong Python skills and experience with distributed systems like Spark, Dask, or similar., Proficiency in designing scalable ML infrastructure and understanding of ML workflows..

Key responsibilities:

  • Design and implement large-scale distributed ML training pipelines.
  • Build scalable infrastructure for data preprocessing, feature engineering, and model evaluation.
  • Lead the development of new ML systems from architecture to production.
  • Collaborate with cross-functional teams to deliver impactful solutions.

Globaldev Group logo
Globaldev Group Information Technology & Services Scaleup https://globaldev.tech/
201 - 500 Employees
See all jobs

Job description

We’re looking for a highly technical, independent, and visionary Big Data Engineer to take ownership of our nextgeneration distributed training pipelines and infrastructure. This is a handson, highimpact role in the core of our algorithmic decisionmaking systems shaping how models are trained and deployed at a scale across billions of data points in realtime AdTech environments.

You’ll be responsible for designing and building scalable ML systems from the ground up from data ingestion to model training to evaluation. Youll work closely with Algo researchers, data engineers, and production teams to drive innovation and performance improvements throughout the lifecycle.

Responsibilities
  • Design and implement largescale, distributed ML training pipelines.
  • Build scalable infrastructure for data preprocessing, feature engineering, and model evaluation.
  • Lead the technical design and development of new ML systems: from architecture to production.
  • Collaborate crossfunctionally with DS, infra teams, Product, BA and Engineering teams to define and deliver impactful solutions.
  • Own the full lifecycle of ML infra: tooling, versioning, monitoring, automation, measuring results and quickly responding to critical issues.
  • Continuously research and adopt bestinclass practices in MLOps, performance tuning, and distributed systems.
    • Requirements
      • B.Sc. or M.Sc. in Computer Science, Software Engineering, or other equivalents fields.
      • 5+ years of handson experience in backend or ML engineering.
      • Strong Python skills and experience working with distributed systems and parallel data processing frameworks such as Spark (using PySpark or Scala), Dask, or similar technologies. Familiarity with Scala is a strong advantage, especially in performance critical.
      • Proven track record in designing and scaling ML infrastructure.
      • Deep understanding of ML workflows and lifecycle management.
      • Experience in cloud environments (AWS, GCP, OCI) and containerized deployment (Kubernetes).
      • Understanding databases and SQL for data retrieval.
      • Strong communication skills and ability to drive initiatives independently.
      • A passion for clean code, elegant architecture, and measurable impact.
      • Monitoring and alerting tools (e.g. Grafana, Kibana).
      • Experience working with inmemory and NoSQL databases (e.g. Aerospike, Redis, Bigtable) to support ultrafast data access in productiongrade ML services.
        • What we offer:
          • Help and support from our caring HR team.
          • 20 days of vacation.
          • Purchase the necessary software.
          • The exchange of experience and work with talented colleagues.
          • Last but not least valuable compensation for your efforts.

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Industry :
Information Technology & Services
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Teamwork
  • Communication

Data Engineer Related jobs